[Chicago] scrape power point

Carl Karsten carl at personnelware.com
Thu Sep 23 17:44:37 CEST 2010


> Getting data isn't hard, it's the metadata that's difficult. I have lots of existing (mostly) HTML, Excel Spreadsheets, and Word docs, and Power Point

Do you have python code to scrape the text from Power Point files?

I would like to be able to scrape the text from Power Point, Keynote
and whatever else a presenter might use for PyCon talks.  I am sure
its a previously solved problem, but it is currently low on my list of
things to even google.

Right now I don't even have a place to store the text, or a firm plan
for a UI to use it.  but at least I am thinking about it.  If someone
hands me one of the pieces, one less thing for me to think about.

-- 
Carl K


More information about the Chicago mailing list