[issue2124] xml.sax and xml.dom fetch DTDs by default

A.M. Kuchling report at bugs.python.org
Sat Feb 23 18:52:58 CET 2008


A.M. Kuchling added the comment:

The solution of adding caching, If-Modified-Since, etc. is a good one,
but I quail in fear at the prospect of expanding the saxutils resolver
into a fully caching HTML agent that uses a cache across processes.  We
should really be encouraging people to use more capable libraries such
as httplib2 (http://code.google.com/p/httplib2/), but this is slightly
at war 
with the batteries-included philosophy.

So, I propose we:

* add warnings to the urllib, urllib2, saxutil module docs that parsing
can retrieve arbitrary resources over the network, and encourage the
user to use a smarter library such as httplib2.
* update the urllib2 HOWTO to mention this.

I'm willing to do the necessary writing.

----------
assignee:  -> akuchling
priority: urgent -> normal

__________________________________
Tracker <report at bugs.python.org>
<http://bugs.python.org/issue2124>
__________________________________


More information about the Python-bugs-list mailing list