[Python-Dev] Improve open() to support reading file starting with an unicode BOM

Victor Stinner victor.stinner at haypocalc.com
Sat Jan 9 14:38:43 CET 2010


Le samedi 09 janvier 2010 12:18:33, Walter Dörwald a écrit :
> > Good idea, I choosed open(filename, encoding="BOM").
> 
> On the surface this looks like there's an encoding named "BOM", but
> looking at your patch I found that the check is still done in
> TextIOWrapper. IMHO the best approach would to the implement a *real*
> codec named "BOM" (or "sniff"). This doesn't require *any* changes to
> the IO library. It could even be developed as a standalone project and
> published in the Cheeseshop.

Why not, this is another solution to the point (2) (Check for a BOM while 
reading or detect it before?). Which encoding would be used if there is not 
BOM? UTF-8 sounds like a good choice.

-- 
Victor Stinner
http://www.haypocalc.com/



More information about the Python-Dev mailing list