For PDF you need to extract a text from pdf files using pdfbox library and
for word documents u can use apache POI api's . There are messages
posted on the lucene list related to your queries. About database ,i guess
someone must have done it . :)
- Original Message -
From: Santosh
The PDF and WORD stuff has been done too: have a look at
http://www.zilverline.org.
Michael Franken
Chandan Tamrakar wrote:
For PDF you need to extract a text from pdf files using pdfbox library and
for word documents u can use apache POI api's . There are messages
posted on the lucene list
I am recently joined into list, I didnt gone through any previous mails, if
you have any mails or related code please forward it to me
- Original Message -
From: Chandan Tamrakar [EMAIL PROTECTED]
To: Lucene Users List [EMAIL PROTECTED]
Sent: Thursday, August 19, 2004 3:47 PM
Subject: Re
- Original Message -
From: Chandan Tamrakar [EMAIL PROTECTED]
To: Lucene Users List
[EMAIL PROTECTED]
Sent: Thursday, August 19, 2004 3:47 PM
Subject: Re: searchhelp
For PDF you need to extract a text from pdf files
using pdfbox library
and
for word documents u can use apache POI api's
for pdf u can refer www.pdfbox.org and pls. check the apache POI project
in jakarta.apache.org site for indexing MS documents.
- Original Message -
From: Santosh [EMAIL PROTECTED]
To: Lucene Users List [EMAIL PROTECTED]
Sent: Thursday, August 19, 2004 4:09 PM
Subject: Re: searchhelp
Users List
Subject: Re: searchhelp
I am recently joined into list, I didnt gone through any previous mails, if
you have any mails or related code please forward it to me
- Original Message -
From: Chandan Tamrakar [EMAIL PROTECTED]
To: Lucene Users List [EMAIL PROTECTED]
Sent: Thursday
Users List [EMAIL PROTECTED]
Sent: Thursday, August 19, 2004 4:17 PM
Subject: RE: searchhelp
JGURU FAQ
http://www.jguru.com/faq/Lucene
OFFICIAL FAQ
http://lucene.sourceforge.net/cgi-bin/faq/faqmanager.cgi
MAIL ARCHIVE
http://www.mail-archive.com/[EMAIL PROTECTED]/
hope this helps
to my present lucene
- Original Message -
From: David Townsend [EMAIL PROTECTED]
To: Lucene Users List [EMAIL PROTECTED]
Sent: Thursday, August 19, 2004 4:17 PM
Subject: RE: searchhelp
JGURU FAQ
http://www.jguru.com/faq/Lucene
OFFICIAL FAQ
http://lucene.sourceforge.net/cgi