More regex help

Kirk Strauser kirk at
Wed Sep 24 18:42:03 CEST 2008

At 2008-09-24T16:25:02Z, "Support Desk" <support.desk.ipg at> writes:

> I am working on a python webcrawler, that will extract all links from an
> html page, and add them to a queue, The problem I am having is building
> absolute links from relative links, as there are so many different types of
> relative links. If I just append the relative links to the current url, some
> websites will send it into a never-ending loop. 

>>> import urllib
>>> urllib.basejoin('',
>>> urllib.basejoin('',

Kirk Strauser
The Day Companies

More information about the Python-list mailing list