[Python-Dev] cpython (2.7): #14538: HTMLParser can now parse correctly start tags that contain a bare /.

Georg Brandl g.brandl at gmx.net
Tue Apr 24 21:02:43 CEST 2012


On 24.04.2012 20:34, Benjamin Peterson wrote:
> 2012/4/24 Georg Brandl <g.brandl at gmx.net>:
>> On 19.04.2012 03:36, ezio.melotti wrote:
>>> http://hg.python.org/cpython/rev/36c901fcfcda
>>> changeset:   76413:36c901fcfcda
>>> branch:      2.7
>>> user:        Ezio Melotti <ezio.melotti at gmail.com>
>>> date:        Wed Apr 18 19:08:41 2012 -0600
>>> summary:
>>>   #14538: HTMLParser can now parse correctly start tags that contain a bare /.
>>
>>> diff --git a/Misc/NEWS b/Misc/NEWS
>>> --- a/Misc/NEWS
>>> +++ b/Misc/NEWS
>>> @@ -50,6 +50,9 @@
>>>  Library
>>>  -------
>>>
>>> +- Issue #14538: HTMLParser can now parse correctly start tags that contain
>>> +  a bare '/'.
>>> +
>>
>> I think that's misleading: there's no way to "correctly" parse malformed HTML.
> 
> There is in the since that you can follow the HTML5 algorithm, which
> can "parse" any junk you throw at it.

Ah, good. Then I hope we are following the algorithm here (and are slowly
coming to use it for htmllib in general).

Georg



More information about the Python-Dev mailing list