Analyse of PDF (or EPS?)
bokr at oz.net
Thu Nov 20 20:01:20 CET 2003
On Thu, 20 Nov 2003 14:48:52 +0100, Johan Holst Nielsen <johan at weknowthewayout.com> wrote:
>Is there any Python packages to analyse or get some information out of
>an PDF document...
>Like where the text are placed - what text are placed - fonts, embedded
>Please let me know :)
IIRC you can get the full specs of pdf and eps at the adobe site.
Some stuff is easy to get at, some may be compressed and/or encrypted,
and not so easy.
Conforming docs are supposed to be structured so that it is relatively easy
to grab chunks of document and do the kinds of things printing business s/w does,
like rotating and scaling and reordering pages, etc.
There are whole books on pdf and postscript also, which you could browse at a good
tech book store or tech library.
More information about the Python-list