<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>
<head>
<meta content="text/html;charset=UTF-8" http-equiv="Content-Type">
</head>
<body bgcolor="#ffffff" text="#000000">
<br>
<br>
Paul McNett wrote:
<blockquote cite="mid:494FF2BB.2050402@ulmcnett.com" type="cite">Colin
J. Williams wrote:
<br>
<blockquote type="cite">The ReportLab toolkit appears to be concerned
with building Portable
<br>
Document Files. I would be interested in any utility which will read
<br>
any pdf - for example, to convert pdf -> html
<br>
</blockquote>
<br>
I don't know of any Python utility to do this, but pdftohtml,
pdftotext, pdftoppm, and pdftops exist on my Ubuntu Linux system.
<br>
<br>
Paul
<br>
<br>
</blockquote>
Thanks, pdftohtml is an experimental version, last updated in 2006: <cite><b>pdftohtml</b>.sourceforge.net<br>
<br>
T<b>here is another converter, last updated in 2004</b>:
<a class="moz-txt-link-freetext" href="http://www.intrapdf.com/">http://www.intrapdf.com/</a><br>
<br>
The Debian version appears to have been last updated in 2003:
<a class="moz-txt-link-freetext" href="http://freshmeat.net/projects/pdftohtml/">http://freshmeat.net/projects/pdftohtml/</a><br>
<br>
Aside from the Adobe service, the most recent could be:
<a class="moz-txt-link-freetext" href="http://www.abbyyusa.com/shop/PDFT.htm">http://www.abbyyusa.com/shop/PDFT.htm</a><br>
<br>
I was hoping that there might be something in Python.<br>
<br>
Colin W.<br>
</cite>
</body>
</html>