HTML Parsing
Victor Noagbodji
noagbodjivictor at gmail.com
Sat Jun 28 23:22:51 EDT 2008
> Hi everyone
Hello
> I am trying to build my own web crawler for an experiement and I don't
> know how to access HTTP protocol with python.
urllib2: http://docs.python.org/lib/module-urllib2.html
> Also, Are there any Opensource Parsing engine for HTML documents
> available in Python too? That would be great.
BeautifulSoup:
http://www.crummy.com/software/BeautifulSoup/
http://www.crummy.com/software/BeautifulSoup/documentation.html
All the best
--
NOAGBODJI Paul Victor
More information about the Python-list
mailing list