Hello,

we all know that Lucene supports, among others, boolean queries. Even
though Nutch is built on Lucene, boolean clauses are removed by Nutch
filters so boolean queries end up as "flat" queries where terms are
implicitly connected by an OR operator, as far as I can see.

Is there any simple way to turn off the filtering so a boolean query
remains as such after it is submitted to Nutch?

Just in case a simple way doesn't exist, Ravi Chintakunta suggests the
following workaround:

"We have to modify the analyzer and add more plugins to Nutch
to use the Lucene's query syntax. Or we have to directly use
Lucene's Query Parser. I tried the second approach by modifying
org.apache.nutch.searcher.IndexSearcher and that seems to work."

Can anyone please elaborate on what Ravi actually means by "modifying
org.apache.nutch.searcher.IndexSearcher"? Which methods are supposed
to be modified and how?

It would be really nice to know how to do this. I believe many other
Nutch users would also benefit from an answer to this question.

Thanks so much,

Cristina

-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys -- and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to