Maximum search time limit
-------------------------

         Key: NUTCH-308
         URL: http://issues.apache.org/jira/browse/NUTCH-308
     Project: Nutch
        Type: Improvement

  Components: searcher  
    Versions: 0.8-dev    
    Reporter: Andrzej Bialecki 
     Fix For: 0.8-dev


With large indexes it may often happen that search servers don't respond in 
time. The IPC code time-outs such calls, and the front-end 
DistributedSearch.Client no longer expects any results from the timed-out 
Server. However, the search thread is still running on the Server, until it 
completes (which may be even tens of seconds later), consuming server 
resources. Subsequent requests will run much slower, eventually leading to 100% 
CPU/disk utilization and an avalanche of timeouts.

This patch adds support for a maximum search time limit. After this time is 
exceeded, the search thread is stopped, partial results (if any) are returned 
and the total number of results is estimated.

This patch tries to minimize the overhead related to time-keeping by using a 
version of safe unsynchronized timer.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira

Reply via email to