version. But if I just save the old
DOC file by new Word on XP, eveything is fine.
Thanks very much for helps,
Lisheng
-Original Message-
From: Otis Gospodnetic [mailto:[EMAIL PROTECTED]
Sent: Tuesday, August 24, 2004 10:24 AM
To: Lucene Users List
Subject: Re: worddoucments search
As I
, eveything is fine.
Thanks very much for helps,
Lisheng
-Original Message-
From: Otis Gospodnetic [mailto:[EMAIL PROTECTED]
Sent: Tuesday, August 24, 2004 10:24 AM
To: Lucene Users List
Subject: Re: worddoucments search
As I just answered in a separate email to Ryan - we used textmining.org
: worddoucments search
As I just answered in a separate email to Ryan - we used textmining.orglibrary, too,
as an example of something that is easier to use thanPOI. It's been a while since I
wrote that chapter, so it slipped mymind when I replied. Yes, use textmining.org
first, you'll be able
get it as
lucene document format
- Original Message -
From: Otis Gospodnetic [EMAIL PROTECTED]
To: Lucene Users List [EMAIL PROTECTED]
Sent: Tuesday, August 24, 2004 11:54 PM
Subject: Re: worddoucments search
As I just answered in a separate email to Ryan - we used
/search the
previous posting. it should help understanding about lucene more...
- Original Message -
From: Karthik N S [EMAIL PROTECTED]
To: Lucene Users List [EMAIL PROTECTED]
Sent: Wednesday, August 25, 2004 4:21 PM
Subject: RE: worddoucments search
Hi
Santosh
Please
-
From: Otis Gospodnetic [EMAIL PROTECTED]
To: Lucene Users List [EMAIL PROTECTED]
Sent: Tuesday, August 24, 2004 11:54 PM
Subject: Re: worddoucments search
As I just answered in a separate email to Ryan - we used
textmining.orglibrary, too, as an example of something that is easier
to use
I could ber wrong, but I don't think that there is an indexer for word
documents.
There's a Python version of Lucene called Lupy with a Python indexer
for all sorts of document types
(http://www.methods.co.nz/docindexer/). Would anyone be willing to
port those over. Although the MSWord
Is this a wind-up?
-Original Message-
From: Santosh [mailto:[EMAIL PROTECTED]
Sent: 24 August 2004 13:16
To: Lucene Users List
Subject: worddoucments search
Can lucene be able to search word documents? if so please give me information about it
regards
Santosh kumar
please look at Apache POI project.
http://jakarta.apache.org
Words documents can be extracted using POI apis and later can be indexed.
regards
- Original Message -
From: Santosh [EMAIL PROTECTED]
To: Lucene Users List [EMAIL PROTECTED]
Sent: Tuesday, August 24, 2004 6:00 PM
Subject:
Lucene isn't a doll made my Hasbro. :-)
David Townsend wrote:
Is this a wind-up?
-Original Message-
From: Santosh [mailto:[EMAIL PROTECTED]]
Sent: 24 August 2004 13:16
To: Lucene Users List
Subject: worddoucments search
Can lucene be able to search word documents? if so please
that is still in
the document for the purposes of revision marking. POI does not handle this.
-Ryan Ackley
- Original Message -
From: Chandan Tamrakar [EMAIL PROTECTED]
To: Lucene Users List [EMAIL PROTECTED]
Sent: Tuesday, August 24, 2004 7:31 AM
Subject: Re: worddoucments search
please
, August 24, 2004 7:41 AM
Subject: Re: worddoucments search
For Lucene in Action Erik and I wrote a little extensible framework for
indexing various documents, including MS Word. We used POI, so the
solution works on Winblows, UNIX/Linux, OSX I think the code is
bit too big for the list
: Tuesday, August 24, 2004 8:11 AM
Subject: RE: worddoucments search
Hi Santhosh,
Try out the below attached code.(POI.jar should be in your class
path)
public String getContent(InputStream reader) throws IOException {
ArrayList text = new ArrayList();
POIFSFileSystem fsys
-
From: Otis Gospodnetic [EMAIL PROTECTED]
To: Lucene Users List [EMAIL PROTECTED]
Sent: Tuesday, August 24, 2004 7:41 AM
Subject: Re: worddoucments search
For Lucene in Action Erik and I wrote a little extensible framework
for
indexing various documents, including MS Word. We used POI, so
14 matches
Mail list logo