<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>
<head>
<meta http-equiv="content-type" content="text/html; charset=ISO-8859-1">
</head>
<body bgcolor="#ffffff" text="#000000">
<font face="Arial, sans-serif"><font style="font-size: 10pt;"
size="2">Dear
list</font></font>
<p style="margin-bottom: 0cm;"><font face="Arial, sans-serif"><font
style="font-size: 10pt;" size="2">I
have to read some data from an ASCII text file, filter it, and
then
export it to a .dbf file. Basically a straight forward task...<br>
My
problem is that the input files contains some special national
(Danish) characters, and it appears that I have to do
something
special to handle these in Python.<br>
The
Danish language contains three letters not in the English
alphabet:
æ, ø and å.<br>
E.g.
the Danish city name 'SOLRØD' is red by Python as '<font
color="#000000">SOLR\xc3\x98D'<br>
The
three letters, in lower and upper case, seems to get
translated as
follow:</font></font></font></p>
<p style="margin-bottom: 0cm;"><font face="Arial, sans-serif"><font
style="font-size: 10pt;" size="2"><font color="#000000">æ
= \xc3\xa6<br>
ø
= \xc3\xb8<br>
å
= \xc3\xa5<br>
Æ
= \xc3\x86<br>
Ø
= \xc3\x98<br>
Å
= \xc3\x85</font></font></font></p>
<p style="margin-bottom: 0cm;"><font face="Arial, sans-serif"><font
style="font-size: 10pt;" size="2"><font color="#000000">Question:<br>
What
is this, how do I get my Danish letters back?</font></font></font></p>
<p style="margin-bottom: 0cm;"><font face="Arial, sans-serif"><font
style="font-size: 10pt;" size="2"><font color="#000000">Best
Regards<br>
Martin</font></font></font></p>
</body>
</html>