Analyse of PDF (or EPS?)

Bengt Richter bokr at
Thu Nov 20 20:01:20 CET 2003

On Thu, 20 Nov 2003 14:48:52 +0100, Johan Holst Nielsen <johan at> wrote:

>Is there any Python packages to analyse or get some information out of 
>an PDF document...
>Like where the text are placed - what text are placed - fonts, embedded 
>PDFs/fonts/images etc.
>Please let me know :)
IIRC you can get the full specs of pdf and eps at the adobe site.
Some stuff is easy to get at, some may be compressed and/or encrypted,
and not so easy.

Conforming docs are supposed to be structured so that it is relatively easy
to grab chunks of document and do the kinds of things printing business s/w does,
like rotating and scaling and reordering pages, etc.

There are whole books on pdf and postscript also, which you could browse at a good
tech book store or tech library.

Bengt Richter

More information about the Python-list mailing list