reason is performance.  Allowing above means more complex query which causes
more dealy in getting results.  If you need these features, you know how to
get them, but its tradeoff with performance.  May be not if number of pages
are less, it will on large scale.

-- Ravish.


On 5/2/06, Ravi Chintakunta <[EMAIL PROTECTED]> wrote:

Lucene supports fuzzy, wildcard, range, proximity searches as listed
here: http://lucene.apache.org/java/docs/queryparsersyntax.html

But Nutch does not use all these capabilities. It is limited by query
parsing in org.apache.nutch.analysis.NutchAnalysis and the query
filters hosted in plugins.

We have to modify the analyzer and add more plugins to Nutch to use
the Lucene's query syntax. Or we have to directly use Lucene's Query
Parser. I tried the second approach by modifying
org.apache.nutch.searcher.IndexSearcher and that seems to work.

Is there a reason that Nutch does not support the entire Lucene query
syntax by default?

Thanks in advance,
Ravi Chintakunta

Reply via email to