Iterating over PDF documents
Tomas
1cdq3arii001 at sneakemail.com
Fri Nov 12 09:44:41 EST 2004
"Peter Maas" <peter at somewhere.com> wrote in message
news:cmv9j2$et5$1 at swifty.westend.com...
> I'm trying to edit a PDF document line-wise. This is more difficult
> than I thought, because PDF uses a mixture of all line terminators
> available in *X, Mac and Win so that utilizing "for line in file"
> is difficult.
If you're just going to extract some text or do searching, you can try the
pdftotext utility and convert the document(s) to plain text.
http://www.snapfiles.com/get/pdftotext.html (Windows).
-Tomas
More information about the Python-list
mailing list