[Python-bugs-list] [ python-Bugs-620705 ] websucker relative-URL errors
noreply@sourceforge.net
noreply@sourceforge.net
Wed, 09 Oct 2002 03:30:52 -0700
Bugs item #620705, was opened at 2002-10-09 12:30
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=105470&aid=620705&group_id=5470
Category: Demos and Tools
Group: Python 2.2.2
Status: Open
Resolution: None
Priority: 5
Submitted By: Alex Martelli (aleax)
Assigned to: Nobody/Anonymous (nobody)
Summary: websucker relative-URL errors
Initial Comment:
reproduce easily with, e.g.:
python websucker.py -v http://www.aleax.it
gives a series of error messages such as:
Check http://www.aleax.it/./py2.htm
Error ('http error', 404, 'Object Not Found')
HREF http://www.aleax.it/./py2.htm
from http://www.aleax.it/./Python/ (///./py2.htm)
Check http://www.aleax.it/p1.htm
Error ('http error', 404, 'Object Not Found')
HREF http://www.aleax.it/p1.htm
from http://www.aleax.it/./TutWin32/index.htm (///p1.htm)
but the relevant snippets of the HTML sources are e.g:
in Python/index.html:
<A href="./py2.htm">
in TutWin32/index.html:
<a href="p1.htm">
i.e. both relative URLs, so should resolve to the URLs
of the files that ARE present, Python/py2.htm and
TutWin32/p1.htm respectively.
And indeed /usr/bin/wget has no problem fetching
the whole small site.
Pls let me know if you want me to explore the bug further
and prepare a patch in time for 2.2.2 release -- otherwise
I think this shd at least be documented as a known bug
(making websucker close to unusable, alas).
Alex
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=105470&aid=620705&group_id=5470