Scraping Wikipedia with Python
Dotan Cohen
dotancohen at gmail.com
Tue Aug 11 13:33:39 EDT 2009
I plan on making a geography-learning Anki [1] deck, and Wikipedia has
the information that I need in nicely formatted tables on the side of
each country's page. Has someone already invented a wheel to parse and
store that data (scrape)? It is probably not difficult to code, and
within the Wikipedia license, but if that wheel has already been
invented then I don't want to redo it. I tried googling for a
Wikipedia-specific solution but found none, is there a general purpose
solution that I could use?
Note that I am a regular Wikipedia contributor and plan on staying
within the realm of Wikipedia's rules.
[1] http://ichi2.net/anki/
--
Dotan Cohen
http://what-is-what.com
http://gibberish.co.il
More information about the Python-list
mailing list