Pulling out <TITLE></TITLE>
bac at OCF.Berkeley.EDU
Mon Nov 19 05:45:44 CET 2001
You could just read each page and use a regex to fetch it:
On Sun, 18 Nov 2001, David A McInnis wrote:
> I am writing a script to catalog about 30,000 html pages on my site and need
> to pull out the value of <TITLE></TITLE>.
> I guess this is possible with htmllib, but I cannot figure it out.
More information about the Python-list