<div dir="ltr">The only thing i want from the ppt's is text and ignoring all graphical representations. I need the text to perform various nltk operations.<br></div><div class="gmail_extra"><br><br><div class="gmail_quote">
On Fri, May 30, 2014 at 11:54 PM, Alan Gauld <span dir="ltr"><<a href="mailto:alan.gauld@btinternet.com" target="_blank">alan.gauld@btinternet.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div class="HOEnZb"><div class="h5">On 30/05/14 10:41, Aaron Misquith wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
Like pypdf is used to convert pdf to text; is there any library that is<br>
used in converting .ppt files to .txt? Even some sample programs will be<br>
helpful.<br>
</blockquote>
<br></div></div>
Bearing in mind that Powerpoint is intended for graphical presentations the text elements are not necessarily going to be useful. Often Powerpoint text is actually part of a graphic anyway.<br>
<br>
If the Powerpoint is just a set of bullet points (shame on the presenter!) you probably don't want the text unless you can<br>
also get the notes. I don't know of any libraries that can do that.<br>
<br>
But one option is that Open/Libre office can import Powerpoint and<br>
apparently has a Python API which you could use to drive an export<br>
from there. Just a thought...<span class="HOEnZb"><font color="#888888"><br>
<br>
-- <br>
Alan G<br>
Author of the Learn to Program web site<br>
<a href="http://www.alan-g.me.uk/" target="_blank">http://www.alan-g.me.uk/</a><br>
<a href="http://www.flickr.com/photos/alangauldphotos" target="_blank">http://www.flickr.com/photos/<u></u>alangauldphotos</a><br>
<br>
______________________________<u></u>_________________<br>
Tutor maillist - <a href="mailto:Tutor@python.org" target="_blank">Tutor@python.org</a><br>
To unsubscribe or change subscription options:<br>
<a href="https://mail.python.org/mailman/listinfo/tutor" target="_blank">https://mail.python.org/<u></u>mailman/listinfo/tutor</a><br>
</font></span></blockquote></div><br></div>