Using Beautiful Soup to entangle bookmarks.html

Francach uid09012_ti at martin-collins.de
Fri Sep 8 13:14:25 CEST 2006


Hi,

thanks for the helpful reply.
I wanted to do two things - learn to use Beautiful Soup and bring out
all the information
in the bookmarks file to import into another application. So I need to
be able to travel down the tree in the bookmarks file. bookmarks seems
to use header tags which can then contain a tags where the href
attributes are. What I don't understand is how to create objects which
can then be used to return the information in the next level of the
tree.

Thanks again,
Martin.



George Sakkis wrote:
> Francach wrote:
> > Hi,
> >
> > I'm trying to use the Beautiful Soup package to parse through the
> > "bookmarks.html" file which Firefox exports all your bookmarks into.
> > I've been struggling with the documentation trying to figure out how to
> > extract all the urls. Has anybody got a couple of longer examples using
> > Beautiful Soup I could play around with?
> >
> > Thanks,
> > Martin.
>
> from BeautifulSoup import BeautifulSoup
> urls = [tag['href'] for tag in
>         BeautifulSoup(open('bookmarks.html')).findAll('a')]
> 
> Regards,
> George




More information about the Python-list mailing list