Hi Otis May be i am asking for more.. is there some class that accepts this document field terminology, would you be in a position to name out the class name,
and also i did not understand clearly what u mean by creating Field form the text and than create document form these field.... The Q are... 1) How to decide how much or what to be in a field.... 2) and than how much and what to be in document. Considering the PDF file might have 300+ pages.. than how much logical field and hence document can be created.... I think i am acting like an idiot asking the above Q... may be things will be more cleared if i know about this demo class and the understanding of the Fields and document categorization. wait for your positive reply. Cheers Vin. -----Original Message----- From: Otis Gospodnetic [mailto:[EMAIL PROTECTED]] Sent: Thursday, October 17, 2002 3:00 PM To: Lucene Users List Subject: Re: Integrating the PDF Extract With Lucene!!!! Once you extract the content of the PDF and have it in your String variables, you can create Fields with them, then create Documents with Fields, and finally add those Documents to IndexWriter, which indexes them. Please look at the demo code that comes with Lucene first. Otis --- Vinod Bhagat <[EMAIL PROTECTED]> wrote: > Hi Gurus > > I manage to get the content form the PDF file using the JPedal > libraries. > Now i need to use this content to Index inside Lucene, so that PDF > (binaries) files can be searched/indexed by Lucene. > > And i am new with Lucene. Can anyone share there experience of > indexing the > extracted content from PDF into Lucene. How to go about it, i have no > idea > at the moment? > > Wait for the positive and early response. > > Best Regards. > > Vin > > -- > To unsubscribe, e-mail: > <mailto:[EMAIL PROTECTED]> > For additional commands, e-mail: > <mailto:[EMAIL PROTECTED]> > __________________________________________________ Do you Yahoo!? Faith Hill - Exclusive Performances, Videos & More http://faith.yahoo.com -- To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> For additional commands, e-mail: <mailto:[EMAIL PROTECTED]> -- To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>
