[Python-Dev] Unicode byte order mark decoding
Nicholas Bastin
nbastin at opnet.com
Thu Apr 7 16:19:37 CEST 2005
On Apr 7, 2005, at 5:07 AM, M.-A. Lemburg wrote:
>> The current implementation of the utf-16 codecs makes for some
>> irritating gymnastics to write the BOM into the file before reading it
>> if it contains no BOM, which seems quite like a bug in the codec.
>
> The codec writes a BOM in the first call to .write() - it
> doesn't write a BOM before reading from the file.
Yes, see, I read a *lot* of UTF-16 that comes from other sources. It's
not a matter of writing with python and reading with python.
--
Nick
More information about the Python-Dev
mailing list