Hi,
I have a requirement to build an intranet style full text searching system for a relatively small set (less < 500)
of fairly lengthy word and PDF documents.
What they want is all hits for search terms on a particular document to be displayed - together with the context. So if "policy" appears 5 times in a particular document, then 5 hits would be displayed in the search results.

I have been learning nutch and lucene and am now about to look into the details of the source code, if anyone has any information on how this might be implemented I would appreciate it. I guess this question might be more relevant if asked on the lucene lists, however I thought I would start here.

Regards

John Reidy.


-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to