[Python-bugs-list] [ python-Bugs-535285 ] urllib, fragment identifiers and 404s

noreply@sourceforge.net noreply@sourceforge.net
Tue, 26 Mar 2002 09:00:51 -0800


Bugs item #535285, was opened at 2002-03-26 17:00
You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=105470&aid=535285&group_id=5470

Category: None
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Tadgh O'Leary (tadgher)
Assigned to: Nobody/Anonymous (nobody)
Summary: urllib, fragment identifiers and 404s

Initial Comment:
URLOpener raises a 404 IOError accessing a non-existent fragment 
identifier on certain web servers (in fact, all that I've tested, 
except Apache).

I couldn't find any user-agent guidelines, but 
every user-agent I've tested returns the document with a 200 
response code (including lynx).

To repeat:
Python 2.2 (#1, 
Jan 18 2002, 09:22:45) 
[GCC 2.95.3 20010315 (release) 
[FreeBSD]] on freebsd4
Type "help", "copyright", "credits" or 
"license" for more information.
>>> import urllib
>>> 
urllib.URLopener().open('http://www.apache.org/#fake')
<addinfourl 
at 136169196 whose fp = <open file '<socket>', mode 'rb' at 
0x81e7480>>
>>> 
urllib.URLopener().open('http://www.microsoft.com/#fake')
Traceback 
(most recent call last):
(snipped)
>>> 
urllib.URLopener().open('http://www.sun.com/#fake')
Traceback 
(most recent call last):
(snipped)
>>> 
urllib.URLopener().open('http://www.zeus.com/#fake')
Traceback 
(most recent call 
last):
(snipped)
urllib.URLopener().open('http://www.lotus.com/#fake')
Traceback 
(most recent call last):
(snipped)

----------------------------------------------------------------------

You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=105470&aid=535285&group_id=5470