[Tutor] PDF to text conversion
David
david at abbottdavid.com
Tue Apr 21 21:01:03 CEST 2009
bob gailer wrote:
> Robert Berman wrote:
>> Hi,
>>
>> I must convert a history file in PDF format that goes from May of 1988
>> to current date. Readings are taken twice weekly and consist of the
>> date taken mm/dd/yy and the results appearing as a 10 character
>> numeric + special characters sequence. This is obviously an easy setup
>> for a very small database application with the date as the key, the
>> result string as the data.
>>
>> My problem is converting the PDF file into a text file which I can
>> then read and process. I do not see any free python libraries having
>> this capacity. I did see a PDFPILOT program for Windows but this
>> application is being developed on Linux and should also run on
>> Windows; so I do not want to incorporate a Windows only application.
How about pyPdf;
http://pybrary.net/pyPdf/
And an example;
http://code.activestate.com/recipes/511465/
-david
--
Powered by Gentoo GNU/Linux
http://linuxcrazy.com
More information about the Tutor
mailing list