Read the table data from PDF files in Python
pkpearson at nowhere.invalid
Wed Apr 24 11:16:40 EDT 2019
On Wed, 24 Apr 2019 02:36:27 -0700 (PDT), mrawat213 at gmail.com wrote:
> Anyone knows how to fetch the data from PDF file having tables with
> other text in Python. Need to fetch some cell values based on
> condition from that table.
You might find pdftotext useful.
The command . . .
pdftotext -layout somefile.pdf
produces a file named somefile.txt.
This will be completely useless if the original PDF is just
a PDF wrapper around an image. That's what document scanners
tend to produce.
To email me, substitute nowhere->runbox, invalid->com.
More information about the Python-list