Looking for an appropriate encoding standard that supports all languages

Steven D'Aprano steve-REMOVE-THIS at cybersource.com.au
Thu Aug 19 04:48:06 EDT 2010


On Thu, 19 Aug 2010 01:29:51 -0700, ata.jaf wrote:

> On Aug 17, 11:55 pm, Thomas Jollans <tho... at jollybox.de> wrote:
>> On Tuesday 17 August 2010, it occurred to ata.jaf to exclaim:
>>
>> > I am developing a little program in Mac with wxPython. But I have
>> > problems with the characters that are not in ASCII. Like some special
>> > characters in French or Turkish. So I am looking for a way to solve
>> > this. Like an encoding standard that supports all languages. Or some
>> > other way.
>>
>> Anything that supports all of Unicode will do. Like UTF-8. If your text
>> is mostly Latin, then just go for UTF-8, if you use other alphabets
>> extensively, you might want to consider UTF-16, which might the use a
>> little less space.
> 
> OK, I used UTF-8.
> I write a line of strings in the source code 


Do you have a source code encoding line at the start of your script?


http://www.python.org/dev/peps/pep-0263/


> and I want my program to
> show that as an output on GUI. And this line of strings includes a
> character like "ü". But I see that in GUI this character is replaced
> with another strange characters. I mean it doesn't work. And when I try
> to use UTF-16, I get an syntax error that declares "UTF-16 stream does
> not start with BOM".


What GUI are you using?


Please COPY AND PASTE (do not retype) the EXACT error message you get, 
including the entire traceback.




-- 
Steven



More information about the Python-list mailing list