[Python-Dev] Unicode byte order mark decoding

Nicholas Bastin nbastin at opnet.com
Thu Apr 7 16:19:37 CEST 2005


On Apr 7, 2005, at 5:07 AM, M.-A. Lemburg wrote:

>> The current implementation of the utf-16 codecs makes for some
>> irritating gymnastics to write the BOM into the file before reading it
>> if it contains no BOM, which seems quite like a bug in the codec.
>
> The codec writes a BOM in the first call to .write() - it
> doesn't write a BOM before reading from the file.

Yes, see, I read a *lot* of UTF-16 that comes from other sources.  It's 
not a matter of writing with python and reading with python.

--
Nick



More information about the Python-Dev mailing list