Analyse of PDF (or EPS?)

Thu Nov 20 14:01:20 EST 2003

On Thu, 20 Nov 2003 14:48:52 +0100, Johan Holst Nielsen <johan at weknowthewayout.com> wrote:

>Hi,
>
>Is there any Python packages to analyse or get some information out of 
>an PDF document...
>
>Like where the text are placed - what text are placed - fonts, embedded 
>PDFs/fonts/images etc.
>
>Please let me know :)
>
IIRC you can get the full specs of pdf and eps at the adobe site.
Some stuff is easy to get at, some may be compressed and/or encrypted,
and not so easy.

Conforming docs are supposed to be structured so that it is relatively easy
to grab chunks of document and do the kinds of things printing business s/w does,
like rotating and scaling and reordering pages, etc.

There are whole books on pdf and postscript also, which you could browse at a good
tech book store or tech library.

Regards,
Bengt Richter