[Python-ideas] Add "htmlcharrefreplace" error handler
Serhiy Storchaka
storchaka at gmail.com
Fri Jun 14 17:00:17 CEST 2013
14.06.13 02:37, Ezio Melotti написав(ла):
> On Tue, Jun 11, 2013 at 5:49 PM, Serhiy Storchaka <storchaka at gmail.com> wrote:
>> I propose to add "htmlcharrefreplace" error handler which is similar to
>> "xmlcharrefreplace" error handler but use html entity names if possible.
>>
>>>>> '∀ x∈ℜ'.encode('ascii', 'xmlcharrefreplace')
>> b'∀ x∈ℜ'
>>>>> '∀ x∈ℜ'.encode('ascii', 'htmlcharrefreplace')
>> b'∀ x∈ℜ'
>>
>
> Do you have any use cases for this, or is it just for completeness
> since we already have xmlcharrefreplace?
In fact, there is no *need* in the "htmlentityreplace" error handler.
"xmlcharrefreplace" is enough in most cases, it is faster and its scope
is wider. "htmlentityreplace" is only desired for more human readable html.
Perhaps it is not worth to register this error handler by default, but I
see some people desire it in the stdlib.
With regard to non utf-8 encodings of html, of course there are reasons
for their use.
More information about the Python-ideas
mailing list