reason is performance. Allowing above means more complex query which causes more dealy in getting results. If you need these features, you know how to get them, but its tradeoff with performance. May be not if number of pages are less, it will on large scale.
-- Ravish. On 5/2/06, Ravi Chintakunta <[EMAIL PROTECTED]> wrote:
Lucene supports fuzzy, wildcard, range, proximity searches as listed here: http://lucene.apache.org/java/docs/queryparsersyntax.html But Nutch does not use all these capabilities. It is limited by query parsing in org.apache.nutch.analysis.NutchAnalysis and the query filters hosted in plugins. We have to modify the analyzer and add more plugins to Nutch to use the Lucene's query syntax. Or we have to directly use Lucene's Query Parser. I tried the second approach by modifying org.apache.nutch.searcher.IndexSearcher and that seems to work. Is there a reason that Nutch does not support the entire Lucene query syntax by default? Thanks in advance, Ravi Chintakunta
