Is there any library that can extract titles from PDFs?
fredrik at pythonware.com
Wed Jul 16 09:36:13 CEST 2008
> I want to write a script which will rename PDFs according to their
> titles. I want to know if there is any library that can extract
> titles(the first line of the PDF) from PDFs.
Mathieu Fenniak's PyPdf should be able to do this:
(but note that "the first line of the PDF" may not be the first thing
that's rendered on the page, so don't expect such an approach to work
for all files)
More information about the Python-list