html parser?

Laszlo Zsolt Nagy gandalf at designaproduct.biz
Tue Oct 18 12:51:34 CEST 2005


Christoph Söllner wrote:

>Hi *,
>
>is there a html parser available, which could i.e. extract all links from a 
>given text like that:
>"""
><a href="foo.php?param1=test">BAR<img src="none.gif"></a>
><a href="foo2.php?param1=test&param2=test">BAR2</a>
>"""
>
>and return a set of dicts like that:
>"""
>{
>  ['foo.php','BAR','param1','test'],
>  ['foo2.php','BAR2','param1','test','param2','test']
>}
>"""
>
>thanks,
>Chris 
>  
>
I asked the same question a week ago, and the answer I got was a really 
beautiful one. :-)

http://www.crummy.com/software/BeautifulSoup/

  Les




More information about the Python-list mailing list