better regular expression?

Vivek vivek.bhaskar at
Tue Dec 7 01:52:06 CET 2004


I am trying to construct a regular expression using the re module that
matches for
1. my hostname
2. absolute from the root URLs including just "/"
3. relative URLs.

Basically I want the attern to not match for URLs that are not on my

The following statement satisfies numbers 1 and 2, but not 3:

line =

An improvement that also partially satisfies number 3 is

line =

This is not complete because if the relative url is less than seven
characters, than it will not match.

Any suggestions?


More information about the Python-list mailing list