Fw: PDF library for reading PDF files
Cameron Laird
claird at lairds.com
Tue Jan 20 10:32:48 EST 2004
In article <400CF2E3.29506EAE at netsurf.de>,
Andreas Lobinger <andreas.lobinger at netsurf.de> wrote:
>Aloha,
>
>Peter Galfi schrieb:
.
.
.
>> having to implement all the decompressions, etc. The "information" I am
>> trying to extract from the PDF file is the text, specifically in a way to
>> keep the original paragraphs of the text. I have seen so far one shareware
.
.
.
>As others wrote here, the simplest solution is to use a external
>pdf-2-text programm and postprocess the data. Read comp.text.pdf
>
>There is no simple and consistent way to extract text from a .pdf
>because there are many ways to set text. The optical impression
.
.
.
I want to emphasize that final sentence. If you insist on pursuing
this, though, refer to <URL:
http://phaseit.net/claird/comp.text.pdf/PDF_converters.html#pdf2txt >.
--
Cameron Laird <claird at phaseit.net>
Business: http://www.Phaseit.net
More information about the Python-list
mailing list