[Python-bugs-list] [ python-Bugs-620705 ] websucker relative-URL errors

noreply@sourceforge.net noreply@sourceforge.net
Wed, 09 Oct 2002 03:30:52 -0700


Bugs item #620705, was opened at 2002-10-09 12:30
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=105470&aid=620705&group_id=5470

Category: Demos and Tools
Group: Python 2.2.2
Status: Open
Resolution: None
Priority: 5
Submitted By: Alex Martelli (aleax)
Assigned to: Nobody/Anonymous (nobody)
Summary: websucker relative-URL errors

Initial Comment:
reproduce easily with, e.g.:
python websucker.py -v http://www.aleax.it

gives a series of error messages such as:

Check http://www.aleax.it/./py2.htm
Error ('http error', 404, 'Object Not Found')
 HREF  http://www.aleax.it/./py2.htm
  from http://www.aleax.it/./Python/ (///./py2.htm)

Check http://www.aleax.it/p1.htm
Error ('http error', 404, 'Object Not Found')
 HREF  http://www.aleax.it/p1.htm
  from http://www.aleax.it/./TutWin32/index.htm (///p1.htm)

but the relevant snippets of the HTML sources are e.g:
in Python/index.html:
<A href="./py2.htm">
in TutWin32/index.html:
<a href="p1.htm">

i.e. both relative URLs, so should resolve to the URLs
of the files that ARE present, Python/py2.htm and
TutWin32/p1.htm respectively.

And indeed /usr/bin/wget has no problem fetching
the whole small site.

Pls let me know if you want me to explore the bug further
and prepare a patch in time for 2.2.2 release -- otherwise
I think this shd at least be documented as a known bug
(making websucker close to unusable, alas).


Alex





----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=105470&aid=620705&group_id=5470