getting text inside the HTML tag

Bruno Desthuilliers bruno.42.desthuilliers at wtf.websiteburo.oops.com
Mon Jul 16 09:46:36 CEST 2007


kyosohma at gmail.com a écrit :
> On Jul 14, 12:47 pm, Nikola Skoric <nick-n... at net4u.hr> wrote:
>> I'm using sgmllib.SGMLParser to parse HTML. I have successfuly parsed start
>> tags by implementing start_something method. But, now I have to fetch the
>> string inside the start tag and end tag too. I have been reading through
>> SGMLParser documentation, but just can't figure that out... can somebody
>> help? :-)
>>
>> --
>> "Now the storm has passed over me
>> I'm left to drift on a dead calm sea
>> And watch her forever through the cracks in the beams
>> Nailed across the doorways of the bedrooms of my dreams"
> 
> Oi! Try Beautiful Soup instead. That seems to be the defacto HTML
> parser for Python:

Nope. It's the defacto parser for HTML-like tag soup !-)




More information about the Python-list mailing list