Fw: PDF library for reading PDF files
rkern at ucsd.edu
Mon Jan 19 00:20:52 CET 2004
Cameron Laird wrote:
> In article <Xns9474CBDE9B2D7cpl19ghumspamgourmet at 220.127.116.11>,
> Harald Massa <cpl.19.ghum at spamgourmet.com> wrote:
>>>I am looking for a library in Python that would read PDF files and I
>>>could extract information from the PDF with it. I have searched with
>>>google, but only found libraries that can be used to write PDF files.
>>reportlab has a lib called pagecatcher; it is fully supported with python,
>>it is not free.
> ReportLab's libraries are great things--but they do not "extract
> information from the PDF" in the sense I believe the original
> questioner intended.
No, but ReportLab (the company) has a product separate from reportlab
(the package) called PageCatcher that does exactly what the OP asked
for. It is not open source, however, and costs a chunk of change.
More information about the Python-list