Re: query for search through lucene for BLOB

Steven Rowe Wed, 12 Jul 2006 07:21:44 -0700

Hi Sudarshan,

When your question is Java usage related, you will almost certainly getbetter responses by asking just on the Java User list. Oddly enough,hitting all of the mailing lists for the project at once with the samequestion is likely to *reduce* your chances of getting polite/on-topicresponses.

To index PDF files, you need to first extract their content, and Lucenedoes not do this for you. Here is the Lucene FAQ entry on the topic:


<http://wiki.apache.org/jakarta-lucene/LuceneFAQ#head-c45f8b25d786f4e384936fa93ce1137a23b7e422>

Chapter 7 of the excellent book Lucene in Action<http://lucenebook.com/> also covers this topic.

Once you have extracted the text for a document, you'll want to store akey for the document in a separate field in your Lucene index. Thenwhen you have hits from a search, you'll be able to use the DB key toretrieve the PDF blob from Oracle, then maybe save the it to a temp fileand start Adobe Reader to display the doc(s). Or something like that.


Steve

sudarshan angirash wrote:

hi all

i have some PDF files stored in Oracle 9i as BLOB.
now i want to search for a string in those pdf files using Lucene. then i
want to show the selected PDF files which contains The String.

if you can give me any pointers about how to do it, then it will be a gr8
help for me.

regards
sudarshan




---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: query for search through lucene for BLOB

Reply via email to