[Python-Dev] PEP 383 update: utf8b is now the error handler

Antoine Pitrou solipsis at pitrou.net
Thu May 7 01:27:00 CEST 2009


Martin v. Löwis <martin <at> v.loewis.de> writes:
> py> b'\xed\xa0\x80'.decode("utf-8","surrogates")
> '\ud800'

The point is, "surrogates" does not mean anything intuitive for an /error
handler/. You seem to be the only one who finds this name explicit enough,
perhaps because you chose it.
Most other handlers' names have verbs in them ("ignore", "replace",
"xmlcharrefreplace", etc.).

Regards

Antoine.




More information about the Python-Dev mailing list