Extract images from PDF files
David Lyon
david.lyon at preisshare.net
Mon Jul 27 23:37:39 EDT 2009
pdftohtml on sourceforge may help...
On Mon, 27 Jul 2009 19:52:01 -0700 (PDT), writeson <doug.farrell at gmail.com>
wrote:
> Hi all,
>
> I've looked around with Google quite a bit, but haven't found anything
> like what I'm looking for. Is there a Python library that will extract
> images from PDF files? My ultimate goal is to pull the images out, use
> the PIL library to reduce the size of the images and rebuild another
> PDF file that's an essentially "thumbnail" version of the original PDF
> file, smaller in size.
>
> We've been using imagick to extract the images, but it's difficult to
> script and slow to process the input PDF. Can someone suggest
> something better?
>
> Thanks in advance,
> Doug
More information about the Python-list
mailing list