[Python-Dev] Internationalization Toolkit
Tim Peters
tim_one@email.msn.com
Tue, 16 Nov 1999 01:19:16 -0500
[MAL]
> sys.bom should return the byte order mark (BOM) for the format used
> internally. The unicodec module should provide symbols for all
> possible values of this variable:
>
> BOM_BE: '\376\377'
> (corresponds to Unicode 0x0000FEFF in UTF-16
> == ZERO WIDTH NO-BREAK SPACE)
>
> BOM_LE: '\377\376'
> (corresponds to Unicode 0x0000FFFE in UTF-16
> == illegal Unicode character)
>
> BOM4_BE: '\000\000\377\376'
> (corresponds to Unicode 0x0000FEFF in UCS-4)
Should be
BOM4_BE: '\000\000\376\377'
> BOM4_LE: '\376\377\000\000'
> (corresponds to Unicode 0x0000FFFE in UCS-4)
Should be
BOM4_LE: '\377\376\000\000'