[Python-Dev] Improve open() to support reading file starting with an unicode BOM

Antoine Pitrou solipsis at pitrou.net
Sat Jan 9 16:05:45 CET 2010


Walter Dörwald <walter <at> livinglogic.de> writes:
> 
> On the surface this looks like there's an encoding named "BOM", but 
> looking at your patch I found that the check is still done in 
> TextIOWrapper. IMHO the best approach would to the implement a *real* 
> codec named "BOM" (or "sniff"). This doesn't require *any* changes to 
> the IO library. It could even be developed as a standalone project and 
> published in the Cheeseshop.

Sorry but this is missing the point. The point here is to improve the open()
function. I'm sure people who know about encodings are able to install the
chardet library or even whip up their own BOM detection routine...


Antoine.





More information about the Python-Dev mailing list