1. Right now if you use the distributed search servers, the QueryFilter plugins are executed on each server machine. Would it make sense to have the client execute the plugins once, and then dispatch the Lucene query object (instead of Nutch query object) to the server machines?
2. When using distributed search, I believe that docFreq() is handled incorrectly. Each server will perform its own docFreq() calculation instead of taking the true docFreq() of the entire index across all machines. This throws off the scores. I'm having trouble thinking of a clean solution to this problem. Any ideas? Andy ------------------------------------------------------- SF email is sponsored by - The IT Product Guide Read honest & candid reviews on hundreds of IT Products from real users. Discover which products truly live up to the hype. Start reading now. http://ads.osdn.com/?ad_ide95&alloc_id396&op=click _______________________________________________ Nutch-developers mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-developers
