[Python-ideas] Py3 unicode impositions

Paul Moore p.f.moore at gmail.com
Tue Feb 14 22:10:45 CET 2012


On 14 February 2012 14:38, Chris Rebert <pyideas at rebertia.com> wrote:
> On Tue, Feb 14, 2012 at 2:37 AM, Christopher Reay
> <christopherreay at gmail.com> wrote:
>> Web browsers can parse pages with multiple encodings seemingly perfectly
>> into the correct display characters. A quick copy and paste produces UTF-8
>> encoded text in the clip board. (on linux)
>>
>> HOW DO THEY DO IT.. can we have their libraries? :)
>
> The "chardet" package is in fact a port of Mozilla's encoding guessing code.

It seems to be Python 2 only. "Dive into Python 3" describes porting
it to Python 3, but I don't know of an actual Python 3 version.

Paul



More information about the Python-ideas mailing list