Yes. This can be done using Lucene. But, this is subject to a few liberal interpretations of what you asked for. To wit, I am assuming that you want to find interesting documents from a bunch of documents, not just search a single document for matches.
The span queries that another poster mentioned would be good as would sloppy phrase queries. Depending on which European languages you need to handle, there may be some work you need to do to deal with morphological analysis. Lucene has reasonable support for English and somewhat more rudimentary support for a few other European languages. Support for Asian languages is very basic at best. On Sun, May 10, 2009 at 7:43 PM, apgw <[email protected]> wrote: > > I am new to Lucene. Is this the right utility to use for the following use > case: > > 1) Find a search term - eg. 'lithium battery' in some technical rich-text > data (can be in any european language), 4K - 64K size, and return the exact > position in the text so that the occurrence can be turned into a hyperlink > within the text, and the full text returned to the user with the embedded > hyperlinks which he can select if he is interested. > > 2) Also find and hyperlink "lithium batteries", or "lithium hydride > batteries" (with lower ranking) and so on. > >
