htmllib: CR in CDATA
Mark Nottingham
mnot at pobox.com
Mon Jun 21 22:55:24 EDT 1999
It appears that htmllib doesn't ignore returns in CDATA fields, as HTML 4.0
says it should:
http://www.w3.org/TR/REC-html40/types.html#type-cdata
http://www.w3.org/TR/REC-html40/sgml/dtd.html
As a result, htmllib improperly parses any CDATA element that wraps across a
line; this affects elements like
<A href="foo.
gif">
I'm happy to work up a patch, but I thought I'd ask around first. It may be
a bit involved to fix it properly; every CDATA should be handled this way,
which practically means almost every tag attribute.
Regards,
Mark Nottingham, Melbourne Australia
mnot at pobox.com http://www.mnot.net/
More information about the Python-list
mailing list