Hi all.
I have a requirement to build an intranet style search engine for a small (<500) set of large Word and PDF documents. What is needed is for all hits (together with the context) of the search phrase in the documents to be returned.

As an example, if the search term is "policy" and the "operations manual" is searched there might be several hits in different sections of the document that would match policy and all would be displayed for the user?

This may be a question better answered on the lucene lists, however at this stage I am looking at the Nutch code and I am hoping there is a fairly high level solution.

Regards

John Reidy.

Reply via email to