Analyse of PDF (or EPS?)
Bengt Richter
bokr at oz.net
Thu Nov 20 14:01:20 EST 2003
On Thu, 20 Nov 2003 14:48:52 +0100, Johan Holst Nielsen <johan at weknowthewayout.com> wrote:
>Hi,
>
>Is there any Python packages to analyse or get some information out of
>an PDF document...
>
>Like where the text are placed - what text are placed - fonts, embedded
>PDFs/fonts/images etc.
>
>Please let me know :)
>
IIRC you can get the full specs of pdf and eps at the adobe site.
Some stuff is easy to get at, some may be compressed and/or encrypted,
and not so easy.
Conforming docs are supposed to be structured so that it is relatively easy
to grab chunks of document and do the kinds of things printing business s/w does,
like rotating and scaling and reordering pages, etc.
There are whole books on pdf and postscript also, which you could browse at a good
tech book store or tech library.
Regards,
Bengt Richter
More information about the Python-list
mailing list