reading MS word file

2007-05-17 Thread Henry Lu
Is there an example/code to read a MS Word file for text line by line. All I am interested in is the text regardless format, style, font... -Henry - To unsubscribe, e-mail: [EMAIL PROTECTED] Mailing List:

RE: reading MS word file

2007-05-17 Thread Dmitry Goldenberg
Henry, There are a few things you can try. 1. Take a look at org.textmining's Word text extractor: org.textmining.text.extraction.WordExtractor All you have to do is this: new WordExtractor().extractText(inputStream) 2. There is also the POI extractor:

Visio API? converter to XML?

2007-05-17 Thread Dmitry Goldenberg
Hello, I was wondering if there are any capabilities, either already in POI or being planned for, to have a Java API for the Visio file format. Also, is there an open source Java converter from the native Visio format to XML? Any info would be appreciated. Thanks, - Dmitry