Thanks for all resonses. I just concentrated on the "for row in ..." statement. Now I have another question. I would like to read a document with various structures such as title, subtitle, paragraph, table (as I did previously) and bullet etc. How can I get these contents for further analysis? Thanks. B.
--- On Thu, 3/8/12, Vernon Cole <vernondc...@gmail.com> wrote: From: Vernon Cole <vernondc...@gmail.com> Subject: Re: [python-win32] a trivial question To: "cool_go_blue" <cool_go_b...@yahoo.com> Cc: python-win32@python.org Date: Thursday, March 8, 2012, 2:48 PM self.doc.Tables(1).Cell(row,2) is not a string, and therefore has no .split() method. str(self.doc.Tables(1).Cell(row,2)) returns a string, so it does have a split() method and therefore str(self.doc.Tables(1).Cell(row,2)).split() is correct, but str(self.doc.Tables(1).Cell(row,2).split()) is not. Note the different position on the double parens )) -- Vernon On Thu, Mar 8, 2012 at 10:14 AM, cool_go_blue <cool_go_b...@yahoo.com> wrote: I try to use comprehensions when I am learning Python. After opening a word document, i try to read the 2nd column of a table for each row. I print out the words as follows: for row in range(1,len(self.doc.Tables(1).Rows)+1): for word in str(self.doc.Tables(1).Cell(row,2)).split(): if word not in stopwords: print word But I get a runtime error when I have the following code: content = [[word for word in str(self.doc.Tables(1).Cell(row,2).split()) if word not in stopwords] for row in range(1,len(self.doc.Tables(1).Rows)+1)] The error is as follows: Traceback (most recent call last): File "J:\MyProjects\Python\VectorSpaceTry\src\ReadCorpus.py", line 111, in <module> array.setup() File "J:\MyProjects\Python\VectorSpaceTry\src\ReadCorpus.py", line 79, in setup for row in range(1,len(self.doc.Tables(1).Rows)+1) File "F:\Softwares\Working\Languages\Python27\lib\site-packages\win32com\client\__init__.py", line 465, in __getattr__ raise AttributeError("'%s' object has no attribute '%s'" % (repr(self), attr)) AttributeError: '<win32com.gen_py.Microsoft Word 14.0 Object Library.Cell instance at 0x51383312>' object has no attribute 'split' what is wrong with my code? Thanks. B. _______________________________________________ python-win32 mailing list python-win32@python.org http://mail.python.org/mailman/listinfo/python-win32
_______________________________________________ python-win32 mailing list python-win32@python.org http://mail.python.org/mailman/listinfo/python-win32