[Patches] [ python-Patches-522587 ] Fixes pydoc http/ftp URL matching

noreply@sourceforge.net noreply@sourceforge.net
Sat, 23 Mar 2002 18:06:00 -0800


Patches item #522587, was opened at 2002-02-25 18:50
You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=305470&aid=522587&group_id=5470

Category: Library (Lib)
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Brian Quinlan (bquinlan)
>Assigned to: Neil Schemenauer (nascheme)
Summary: Fixes pydoc http/ftp URL matching

Initial Comment:
The current URL matching pattern used by pydoc only 
excludes whitespace. My patch also excludes the 
following characters:

' & " - excludes the quotes in: <a href="...">
< & > - As stated in RFC-1738:

"""The characters "<" and ">" are unsafe because they 
are used as the delimiters around URLs in free text"""

We don't want to include the delimeters as part of the 
URL. And including unescaped "<" in an attribute value 
is not legal markup.

Also, remove the word boundary requirement for 
http/ftp URIs because otherwise the "/" would not be 
included in the following URL: "http://www.python.org/"

Attached is the patch and some simple test code.



----------------------------------------------------------------------

You can respond by visiting: 
http://sourceforge.net/tracker/?func=detail&atid=305470&aid=522587&group_id=5470