WordExtractor.getText() returns on word docs.

maxSchlein Mon, 11 Jan 2010 06:30:06 -0800

It appears that when I use WordExtractor.getText(), and there are tables in
the document, it returns  for every table column.  Is there a way to have
this filtered out other than looping thru the returned text.  Or is there
something else I should be doing?  Thanks in advance for the help...



The reason this is an issue is I am using Lucene's WhiteSpaceAnalyzer and it
is not treating this  as whitespace.  so a search a given word/phrase that
happens to be next to one of these 's is not found.


-- 
View this message in context: 
http://old.nabble.com/WordExtractor.getText%28%29-returns-%15-on-word-docs.-tp27111308p27111308.html
Sent from the POI - User mailing list archive at Nabble.com.


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

WordExtractor.getText() returns  on word docs.

Reply via email to

WordExtractor.getText() returns on word docs.