Performance might be a reason, but only the queries that include wildcards or fuzzy characters would be slowed down but not all the queries right? The regular plain text searches performance shouldn't be affected.
Any thoughts? Thanks, Ravi Chintakunta On 5/3/06, Ravish Bhagdev <[EMAIL PROTECTED]> wrote:
reason is performance. Allowing above means more complex query which causes more dealy in getting results. If you need these features, you know how to get them, but its tradeoff with performance. May be not if number of pages are less, it will on large scale. -- Ravish. On 5/2/06, Ravi Chintakunta <[EMAIL PROTECTED]> wrote: > > Lucene supports fuzzy, wildcard, range, proximity searches as listed > here: http://lucene.apache.org/java/docs/queryparsersyntax.html > > But Nutch does not use all these capabilities. It is limited by query > parsing in org.apache.nutch.analysis.NutchAnalysis and the query > filters hosted in plugins. > > We have to modify the analyzer and add more plugins to Nutch to use > the Lucene's query syntax. Or we have to directly use Lucene's Query > Parser. I tried the second approach by modifying > org.apache.nutch.searcher.IndexSearcher and that seems to work. > > Is there a reason that Nutch does not support the entire Lucene query > syntax by default? > > Thanks in advance, > Ravi Chintakunta >
------------------------------------------------------- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnk&kid0709&bid&3057&dat1642 _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
