Hi,
I have a requirement to build an intranet style full text searching
system for a relatively small set (less < 500)
of fairly lengthy word and PDF documents.
What they want is all hits for search terms on a particular document to
be displayed - together with the context. So if "policy" appears 5 times
in a particular document, then 5 hits would be displayed in the search
results.
I have been learning nutch and lucene and am now about to look into the
details of the source code, if anyone has any information on how this
might be implemented I would appreciate it. I guess this question might
be more relevant if asked on the lucene lists, however I thought I would
start here.
Regards
John Reidy.
-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems? Stop! Download the new AJAX search engine that makes
searching your log files as easy as surfing the web. DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general