Semyon Semyonov created NUTCH-2504:

             Summary: Results of maxCountExpr and fetchDelayExpr should be 
stored in memory in Generate
                 Key: NUTCH-2504
             Project: Nutch
          Issue Type: Sub-task
          Components: generator
            Reporter: Semyon Semyonov

With NUTCH-2455 the expressions maxCountExpr and fetchDelayExpr are calculated 
for each value. That slows the process, instead we can store the results for 
each host in hostDomainCounts. 

That will take only 2 x sizeof(long) extra memory per host.

This message was sent by Atlassian JIRA

Reply via email to