As the author of both Word POI and textmining.org, I recommend using textmining.org. POI is for general purpose manipulation of Word documents. textmining's only purpose is extracting text.
Also, people recommend using POI for text extraction but the only place I've seen an actual how-to on this is in the "Lucene in Action" book. On 3/24/07, jafarim <[EMAIL PROTECTED]> wrote:
Can anyone make a comparison between the two, namely POI API and the one from textmining.org? On 3/24/07, Ryan Ackley <[EMAIL PROTECTED]> wrote: > > The site is down but you can download the word extractor library direct > here: > > http://www.textmining.org/textmining.zip > > Going to fix the site this weekend. > > On 3/24/07, Sami Siren <[EMAIL PROTECTED]> wrote: > > Antony Bowesman wrote: > > > > >> Are there other sollutions? > > > > There's also antiword [1] which can convert your .doc to plain text or > > PS, not sure how good it is. > > > > -- > > Sami Siren > > > > [1] http://www.winfield.demon.nl/ > > > > --------------------------------------------------------------------- > > To unsubscribe, e-mail: [EMAIL PROTECTED] > > For additional commands, e-mail: [EMAIL PROTECTED] > > > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > >
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]