To: java-user@lucene.apache.org
Subject: Re: indexing pdfs
hi sachin the link wat u gave me only a zip file and an exe file for
downoad. and this zip file also contains no class files.but wouldn't we
be requiring a jar file or class file ???
On 3/8/07, Kainth, Sachin [EMAIL PROTECTED] wrote:
Hi
hi can some one help me by giving any sample programs for indexing pdfs and
.doc files
thanks
regards
ashwin
-
From: ashwin kumar [mailto:[EMAIL PROTECTED]
Sent: 08 March 2007 09:37
To: java-user@lucene.apache.org
Subject: indexing pdfs
hi can some one help me by giving any sample programs for indexing pdfs
and .doc files
thanks
regards
ashwin
This message has been scanned for viruses by MailControl
For DOC files you can use the Jakarta POI library. Text extraction is
outlined here: http://jakarta.apache.org/poi/hwpf/quick-guide.html
Ulf
On 08.03.2007, at 10:37, ashwin kumar wrote:
hi can some one help me by giving any sample programs for indexing
pdfs and .doc files
PDFTextStripper();
// get text from doc using stripper
return stripper.getText(doc);
}
Sachin
-Original Message-
From: ashwin kumar [mailto:[EMAIL PROTECTED]
Sent: 08 March 2007 09:37
To: java-user@lucene.apache.org
Subject: indexing pdfs
hi can some one help me
[mailto:[EMAIL PROTECTED]
Sent: 08 March 2007 11:35
To: java-user@lucene.apache.org
Subject: Re: indexing pdfs
Is the only way index pdfs is to convert it into a text and then only
index it ???
On 3/8/07, Kainth, Sachin [EMAIL PROTECTED] wrote:
Hi Aswin,
You can try pdfbox to convert the pdf
.
The only other way I have heard of is to use Ifilters. I believe
SeekAFile does indexing of pdfs.
Sachin
-Original Message-
From: ashwin kumar [mailto:[EMAIL PROTECTED]
Sent: 08 March 2007 11:35
To: java-user@lucene.apache.org
Subject: Re: indexing pdfs
Is the only way index pdfs
Hi,
Here it is:
http://www.seekafile.org/
-Original Message-
From: ashwin kumar [mailto:[EMAIL PROTECTED]
Sent: 08 March 2007 13:07
To: java-user@lucene.apache.org
Subject: Re: indexing pdfs
hi again
do we have to download any jar files to run this program if so can u
give me
Message-
From: ashwin kumar [mailto:[EMAIL PROTECTED]
Sent: 08 March 2007 13:07
To: java-user@lucene.apache.org
Subject: Re: indexing pdfs
hi again
do we have to download any jar files to run this program if so can u
give me the link pls
ashwin
On 3/8/07, Kainth, Sachin [EMAIL PROTECTED] wrote