[Python-Dev] Improve open() to support reading file starting with an unicode BOM
Victor Stinner
victor.stinner at haypocalc.com
Sat Jan 9 14:38:43 CET 2010
Le samedi 09 janvier 2010 12:18:33, Walter Dörwald a écrit :
> > Good idea, I choosed open(filename, encoding="BOM").
>
> On the surface this looks like there's an encoding named "BOM", but
> looking at your patch I found that the check is still done in
> TextIOWrapper. IMHO the best approach would to the implement a *real*
> codec named "BOM" (or "sniff"). This doesn't require *any* changes to
> the IO library. It could even be developed as a standalone project and
> published in the Cheeseshop.
Why not, this is another solution to the point (2) (Check for a BOM while
reading or detect it before?). Which encoding would be used if there is not
BOM? UTF-8 sounds like a good choice.
--
Victor Stinner
http://www.haypocalc.com/
More information about the Python-Dev
mailing list