Where do i download the

org.textmining package?


Dmitry Goldenberg wrote:
There are a few things you can try. 1. Take a look at org.textmining's Word text extractor:


All you have to do is this:
new WordExtractor().extractText(inputStream)

2. There is also the POI extractor:


All you do is:

WordDocument wd = new WordDocument(is);
StringWriter docTextWriter = new StringWriter();
wd.writeAllText(new PrintWriter(docTextWriter));
text = docTextWriter.toString();

3. I'd also check out the following:


here: http://aperture.sourceforge.net/doc/javadoc/index.html

Hope this helps,
- Dmitry


From: Henry Lu [mailto:[EMAIL PROTECTED]
Sent: Thu 5/17/2007 1:19 PM
To: poi-user@jakarta.apache.org
Subject: reading MS word file

Is there an example/code  to read a MS Word file for text line by line.
All I am interested in is the text regardless format, style, font...


To unsubscribe, e-mail: [EMAIL PROTECTED]
Mailing List:     http://jakarta.apache.org/site/mail2.html#poi
The Apache Jakarta Poi Project:  http://jakarta.apache.org/poi/

To unsubscribe, e-mail: [EMAIL PROTECTED]
Mailing List:     http://jakarta.apache.org/site/mail2.html#poi
The Apache Jakarta Poi Project:  http://jakarta.apache.org/poi/

Reply via email to