[Python-bugs-list] [Bug #126351] urlparse.scheme_chars and string.letters

noreply@sourceforge.net noreply@sourceforge.net
Tue, 19 Dec 2000 08:49:44 -0800


Bug #126351, was updated on 2000-Dec-19 08:36
Here is a current snapshot of the bug.

Project: Python
Category: None
Status: Closed
Resolution: Fixed
Bug Group: None
Priority: 5
Submitted by: doerwalter
Assigned to : gvanrossum
Summary: urlparse.scheme_chars and string.letters

Details: urlparse.scheme_chars has the same bug as urllib.quote (see bug 111961),
because it used string.letters which includes more than the upper and 
lowercase letter, which results in scheme_chars being
'abcdefghijklmnopqrstuvwxyz\337\340\341\342\343\344\345\346\347\350\351\352\353\354\355\356\357\360\361\362\363\364\365\366\370\371\372\373\374\375\376\377ABCDEFGHIJKLMNOPQRSTUVWXYZ\300\301\302\303\304\305\306\307\310\311\312\313\314\315\316\317\320\321\322\323\324\325\326\330\331\332\333\334\335\3360123456789+-.'.

RFC 1738 Section 2.1 states the following:

   Scheme names consist of a sequence of characters. The lower case
   letters "a"--"z", digits, and the characters plus ("+"), period
   ("."), and hyphen ("-") are allowed. For resiliency, programs
   interpreting URLs should treat upper case letters as equivalent to
   lower case in scheme names (e.g., allow "HTTP" as well as "http").



Follow-Ups:

Date: 2000-Dec-19 08:49
By: gvanrossum

Comment:
Fixed with brute force.  urlparse.py rev. 1.26.

-------------------------------------------------------

For detailed info, follow this link:
http://sourceforge.net/bugs/?func=detailbug&bug_id=126351&group_id=5470