[Python-Dev] Encoding detection in the standard library?

Terry Reedy tjreedy at udel.edu
Wed Apr 23 07:26:22 CEST 2008

""Martin v. Löwis"" <martin at v.loewis.de> wrote in message 
news:480EC376.8070406 at v.loewis.de...
|> I certainly agree that if the target set of documents is small enough it
| Ok. What advantage would you (or somebody working on a similar project)
| gain if chardet was part of the standard library? What if it was not
| chardet, but some other algorithm?

It seems to me that since there is not a 'correct' algorithm but only 
competing heuristics, encoding detection modules should be made available 
via PyPI and only be considered for stdlib after a best of breed emerges 
with community support. 

More information about the Python-Dev mailing list