[Python-ideas] Support WHATWG versions of legacy encodings

Antoine Pitrou solipsis at pitrou.net
Thu Jan 11 08:42:24 EST 2018


On Thu, 11 Jan 2018 05:18:43 -0800
Nathaniel Smith <njs at pobox.com> wrote:
> I'm not an expert here or anything, but from what we've been hearing it
> sounds like it must be used by all standard-compliant HTML parsers. I don't
> *like* the standard much, but I don't think that the stdlib should refuse
> to handle standard-compliant HTML, or help users handle standard-compliant
> HTML correctly, just because the HTML standard has unfortunate things in
> it. We're not going to convince them to change the standard or anything.
> And this whole thread started with someone said that their mojibake fixing
> library is having trouble because of this, so clearly mojibake does still
> exist.

This is true.  The other question is what the bar is for admitting new
encodings in the standard library.  I don't know much about the history
of past practices there, so I will happily leave the decision to other
people such as Marc-André.

Regards

Antoine.




More information about the Python-ideas mailing list