[Python-Dev] urllib unicode handling

"Martin v. Löwis" martin at v.loewis.de
Wed May 7 07:16:12 CEST 2008


> Thanks for any thoughts on this,

The proper way to implement this would be IRIs (RFC 3987),
in particular section 3.1. This is not as simple as just
encoding it as UTF-8, as you might have to apply IDNA to
the host part.

Code doing so just hasn't been contributed yet.

Regards,
Martin


More information about the Python-Dev mailing list