Fw: PDF library for reading PDF files
Robert Kern
rkern at ucsd.edu
Sun Jan 18 18:20:52 EST 2004
Cameron Laird wrote:
> In article <Xns9474CBDE9B2D7cpl19ghumspamgourmet at 62.153.159.134>,
> Harald Massa <cpl.19.ghum at spamgourmet.com> wrote:
>
>>>I am looking for a library in Python that would read PDF files and I
>>>could extract information from the PDF with it. I have searched with
>>>google, but only found libraries that can be used to write PDF files.
>>
>>reportlab has a lib called pagecatcher; it is fully supported with python,
>>it is not free.
>>
>>Harald
>
>
> ReportLab's libraries are great things--but they do not "extract
> information from the PDF" in the sense I believe the original
> questioner intended.
No, but ReportLab (the company) has a product separate from reportlab
(the package) called PageCatcher that does exactly what the OP asked
for. It is not open source, however, and costs a chunk of change.
More information about the Python-list
mailing list