[python-win32] a trivial question
cool_go_blue
cool_go_blue at yahoo.com
Thu Mar 8 23:01:38 CET 2012
Thanks for all resonses. I just concentrated on the "for row in ..." statement. Now I have another question. I would like to read a document with various structures such as title, subtitle, paragraph, table (as I did previously) and bullet etc. How can I get these contents for further analysis? Thanks.
B.
--- On Thu, 3/8/12, Vernon Cole <vernondcole at gmail.com> wrote:
From: Vernon Cole <vernondcole at gmail.com>
Subject: Re: [python-win32] a trivial question
To: "cool_go_blue" <cool_go_blue at yahoo.com>
Cc: python-win32 at python.org
Date: Thursday, March 8, 2012, 2:48 PM
self.doc.Tables(1).Cell(row,2) is not a string, and therefore has no .split() method.
str(self.doc.Tables(1).Cell(row,2)) returns a string, so it does have a split() method and therefore
str(self.doc.Tables(1).Cell(row,2)).split() is correct, but
str(self.doc.Tables(1).Cell(row,2).split()) is not. Note the different position on the double parens ))
--
Vernon
On Thu, Mar 8, 2012 at 10:14 AM, cool_go_blue <cool_go_blue at yahoo.com> wrote:
I try to use comprehensions when I am learning Python. After opening a word document, i try to read the 2nd column of a table for each row. I print out the words as follows:
for row in range(1,len(self.doc.Tables(1).Rows)+1):
for word in str(self.doc.Tables(1).Cell(row,2)).split():
if word not in stopwords:
print word
But I get a runtime error when I have the following code:
content = [[word for word in str(self.doc.Tables(1).Cell(row,2).split()) if word not in stopwords]
for row in range(1,len(self.doc.Tables(1).Rows)+1)]
The error is as follows:
Traceback (most recent call last):
File "J:\MyProjects\Python\VectorSpaceTry\src\ReadCorpus.py", line 111, in <module>
array.setup()
File "J:\MyProjects\Python\VectorSpaceTry\src\ReadCorpus.py", line 79, in setup
for row in range(1,len(self.doc.Tables(1).Rows)+1)
File "F:\Softwares\Working\Languages\Python27\lib\site-packages\win32com\client\__init__.py", line 465, in __getattr__
raise AttributeError("'%s' object has no attribute '%s'" % (repr(self), attr))
AttributeError: '<win32com.gen_py.Microsoft Word 14.0 Object Library.Cell instance at 0x51383312>' object has no attribute 'split'
what is wrong with my code? Thanks.
B.
_______________________________________________
python-win32 mailing list
python-win32 at python.org
http://mail.python.org/mailman/listinfo/python-win32
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/python-win32/attachments/20120308/7854f0a5/attachment-0001.html>
More information about the python-win32
mailing list