RE: worddoucments search

2004-08-30 Thread Zhang, Lisheng
version. But if I just save the old DOC file by new Word on XP, eveything is fine. Thanks very much for helps, Lisheng -Original Message- From: Otis Gospodnetic [mailto:[EMAIL PROTECTED] Sent: Tuesday, August 24, 2004 10:24 AM To: Lucene Users List Subject: Re: worddoucments search As I

Re: worddoucments search

2004-08-30 Thread Steven Rowe
, eveything is fine. Thanks very much for helps, Lisheng -Original Message- From: Otis Gospodnetic [mailto:[EMAIL PROTECTED] Sent: Tuesday, August 24, 2004 10:24 AM To: Lucene Users List Subject: Re: worddoucments search As I just answered in a separate email to Ryan - we used textmining.org

Re: worddoucments search

2004-08-25 Thread Santosh
: worddoucments search As I just answered in a separate email to Ryan - we used textmining.orglibrary, too, as an example of something that is easier to use thanPOI. It's been a while since I wrote that chapter, so it slipped mymind when I replied. Yes, use textmining.org first, you'll be able

Re: worddoucments search

2004-08-25 Thread Otis Gospodnetic
get it as lucene document format - Original Message - From: Otis Gospodnetic [EMAIL PROTECTED] To: Lucene Users List [EMAIL PROTECTED] Sent: Tuesday, August 24, 2004 11:54 PM Subject: Re: worddoucments search As I just answered in a separate email to Ryan - we used

Re: worddoucments search

2004-08-25 Thread Chandan Tamrakar
/search the previous posting. it should help understanding about lucene more... - Original Message - From: Karthik N S [EMAIL PROTECTED] To: Lucene Users List [EMAIL PROTECTED] Sent: Wednesday, August 25, 2004 4:21 PM Subject: RE: worddoucments search Hi Santosh Please

Introduction to Lucene [was Re: worddoucments search]

2004-08-25 Thread Steven Rowe
- From: Otis Gospodnetic [EMAIL PROTECTED] To: Lucene Users List [EMAIL PROTECTED] Sent: Tuesday, August 24, 2004 11:54 PM Subject: Re: worddoucments search As I just answered in a separate email to Ryan - we used textmining.orglibrary, too, as an example of something that is easier to use

Re: worddoucments search

2004-08-24 Thread Don Vaillancourt
I could ber wrong, but I don't think that there is an indexer for word documents. There's a Python version of Lucene called Lupy with a Python indexer for all sorts of document types (http://www.methods.co.nz/docindexer/). Would anyone be willing to port those over. Although the MSWord

RE: worddoucments search

2004-08-24 Thread David Townsend
Is this a wind-up? -Original Message- From: Santosh [mailto:[EMAIL PROTECTED] Sent: 24 August 2004 13:16 To: Lucene Users List Subject: worddoucments search Can lucene be able to search word documents? if so please give me information about it regards Santosh kumar

Re: worddoucments search

2004-08-24 Thread Chandan Tamrakar
please look at Apache POI project. http://jakarta.apache.org Words documents can be extracted using POI apis and later can be indexed. regards - Original Message - From: Santosh [EMAIL PROTECTED] To: Lucene Users List [EMAIL PROTECTED] Sent: Tuesday, August 24, 2004 6:00 PM Subject:

Re: worddoucments search

2004-08-24 Thread Don Vaillancourt
Lucene isn't a doll made my Hasbro. :-) David Townsend wrote: Is this a wind-up? -Original Message- From: Santosh [mailto:[EMAIL PROTECTED]] Sent: 24 August 2004 13:16 To: Lucene Users List Subject: worddoucments search Can lucene be able to search word documents? if so please

Textmining.org IS NOT POI (was Re: worddoucments search)

2004-08-24 Thread Ryan Ackley
that is still in the document for the purposes of revision marking. POI does not handle this. -Ryan Ackley - Original Message - From: Chandan Tamrakar [EMAIL PROTECTED] To: Lucene Users List [EMAIL PROTECTED] Sent: Tuesday, August 24, 2004 7:31 AM Subject: Re: worddoucments search please

Re: worddoucments search

2004-08-24 Thread Ryan Ackley
, August 24, 2004 7:41 AM Subject: Re: worddoucments search For Lucene in Action Erik and I wrote a little extensible framework for indexing various documents, including MS Word. We used POI, so the solution works on Winblows, UNIX/Linux, OSX I think the code is bit too big for the list

Re: worddoucments search

2004-08-24 Thread Ryan Ackley
: Tuesday, August 24, 2004 8:11 AM Subject: RE: worddoucments search Hi Santhosh, Try out the below attached code.(POI.jar should be in your class path) public String getContent(InputStream reader) throws IOException { ArrayList text = new ArrayList(); POIFSFileSystem fsys

Re: worddoucments search

2004-08-24 Thread Otis Gospodnetic
- From: Otis Gospodnetic [EMAIL PROTECTED] To: Lucene Users List [EMAIL PROTECTED] Sent: Tuesday, August 24, 2004 7:41 AM Subject: Re: worddoucments search For Lucene in Action Erik and I wrote a little extensible framework for indexing various documents, including MS Word. We used POI, so