htmllib: CR in CDATA

Mark Nottingham mnot at pobox.com
Mon Jun 21 22:55:24 EDT 1999


It appears that htmllib doesn't ignore returns in CDATA fields, as HTML 4.0
says it should:
http://www.w3.org/TR/REC-html40/types.html#type-cdata
http://www.w3.org/TR/REC-html40/sgml/dtd.html

As a result, htmllib improperly parses any CDATA element that wraps across a
line; this affects elements like

<A href="foo.
gif">

I'm happy to work up a patch, but I thought I'd ask around first. It may be
a bit involved to fix it properly; every CDATA should be handled this way,
which practically means almost every tag attribute.

Regards,


Mark Nottingham, Melbourne Australia
mnot at pobox.com  http://www.mnot.net/






More information about the Python-list mailing list