<br><br><div class="gmail_quote">On Fri, Apr 3, 2009 at 8:20 PM, Sridhar Ratnakumar <span dir="ltr"><<a href="mailto:sridhar.ratna@gmail.com">sridhar.ratna@gmail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
<div class="im">On 3/26/09 3:29 PM, M Kumar wrote:<br>
<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
I need to read one pdf file and extract data from it. Is there any one can guide me<br>
</blockquote></div>
pyPdf?<br>
<br>
<a href="http://pybrary.net/pyPdf/" target="_blank">http://pybrary.net/pyPdf/</a></blockquote><div><br>To give my $0.02, I had an opportunity to use both pyPdf and PDFMiner<br>for an open source project to measure accessibility of PDF documents.<br>
I initially wrote the library using PDFMiner, but found that it had a higher<br>failure rate in reading documents, especially large ones when compared<br>to pyPdf. So, I rewrote the library using pyPdf and the experience was<br>
better. Also I noted pyPdf works better on encrypted documents when<br>compared to PDFMiner.<br><br>pyPdf is not perfect and has a few issues which I faced when<br>reading certain encrypted documents. However, if your PDF files<br>
are mostly non-encrypted, I would suggest pyPdf a better<br>choice than PDFMiner.<br><br> </div><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
<a href="http://pybrary.net/pyPdf/" target="_blank"></a><br>
<br>
There is also reportlab toolkit<br>
<br>
<a href="http://www.reportlab.org/rl_toolkit.html" target="_blank">http://www.reportlab.org/rl_toolkit.html</a><div><div></div><div class="h5"><br>
<br>
_______________________________________________<br>
BangPypers mailing list<br>
<a href="mailto:BangPypers@python.org" target="_blank">BangPypers@python.org</a><br>
<a href="http://mail.python.org/mailman/listinfo/bangpypers" target="_blank">http://mail.python.org/mailman/listinfo/bangpypers</a><br>
</div></div></blockquote></div><br><br clear="all"><br>-- <br>-Anand<br><br><br><br>