[Python-Dev] Encoding detection in the standard library?
tjreedy at udel.edu
Wed Apr 23 07:26:22 CEST 2008
""Martin v. Löwis"" <martin at v.loewis.de> wrote in message
news:480EC376.8070406 at v.loewis.de...
|> I certainly agree that if the target set of documents is small enough it
| Ok. What advantage would you (or somebody working on a similar project)
| gain if chardet was part of the standard library? What if it was not
| chardet, but some other algorithm?
It seems to me that since there is not a 'correct' algorithm but only
competing heuristics, encoding detection modules should be made available
via PyPI and only be considered for stdlib after a best of breed emerges
with community support.
More information about the Python-Dev