Pulling out <TITLE></TITLE>
bokr at accessone.com
Wed Nov 21 10:13:05 CET 2001
On Sun, 18 Nov 2001 20:45:44 -0800, Brett Cannon <bac at OCF.Berkeley.EDU> wrote:
>You could just read each page and use a regex to fetch it:
Hm. What happens with the following page?
<!-- (old title kept for reference, or possible restoring)
<TITLE>This is the old title</TITLE>
<TITLE>Official new title</TITLE>
>On Sun, 18 Nov 2001, David A McInnis wrote:
>> I am writing a script to catalog about 30,000 html pages on my site and need
>> to pull out the value of <TITLE></TITLE>.
>> I guess this is possible with htmllib, but I cannot figure it out.
More information about the Python-list