[Chennaipy] To extract text from docx file using python
vishnu prabha b v
vishnuprabhabv97 at gmail.com
Fri Jun 24 08:17:07 EDT 2022
from docx import document
document = Document('sample.docx')
type(document)
document.paragraphs
type(document.paragraphs)
document.paragraphs(0)
document.paragraphs[0].text
document.paragraphs[1].text
index = 0
for para in document.paragraphs:
index+=1
if (len(para.text)>0):
print("\n paragraph",index,"is")
print(para.text)
In this ,I have used pip install python-docx
even after installed , i have found No module found error.
[image: image.png]
help me to fix this problem and is this is the correct way to extract txt
from docs?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.python.org/pipermail/chennaipy/attachments/20220624/f9d1a339/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image.png
Type: image/png
Size: 51359 bytes
Desc: not available
URL: <https://mail.python.org/pipermail/chennaipy/attachments/20220624/f9d1a339/attachment-0001.png>
More information about the Chennaipy
mailing list