Sniffing encoding type by looking at file BOM header

Irmen de Jong irmen-NOSPAM- at
Fri Mar 26 00:21:34 CET 2010

On 26-3-2010 0:16, Lawrence D'Oliveiro wrote:
> In message<mailman.1139.1269442366.23598.python-list at>,
> python at wrote:
>> BOM_UTF8     = '\xef\xbb\xbf'
> Since when does UTF-8 need a BOM?

It doesn't, but it is allowed. Not recommended though.
Unfortunately several tools, such as notepad.exe, have a tendency of 
silently adding it when saving files.


More information about the Python-list mailing list