1. Right now if you use the distributed search servers, the
QueryFilter plugins are executed on each server machine.  Would it
make sense to have the client execute the plugins once, and then
dispatch the Lucene query object (instead of Nutch query object) to
the server machines?

2. When using distributed search, I believe that docFreq() is handled
incorrectly.  Each server will perform its own docFreq() calculation
instead of taking the true docFreq() of the entire index across all
machines.  This throws off the scores.  I'm having trouble thinking of
a clean solution to this problem.  Any ideas?

Andy


-------------------------------------------------------
SF email is sponsored by - The IT Product Guide
Read honest & candid reviews on hundreds of IT Products from real users.
Discover which products truly live up to the hype. Start reading now.
http://ads.osdn.com/?ad_ide95&alloc_id396&op=click
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to