Semyon Semyonov created NUTCH-2504: -------------------------------------- Summary: Results of maxCountExpr and fetchDelayExpr should be stored in memory in Generate Key: NUTCH-2504 URL: https://issues.apache.org/jira/browse/NUTCH-2504 Project: Nutch Issue Type: Sub-task Components: generator Reporter: Semyon Semyonov
With NUTCH-2455 the expressions maxCountExpr and fetchDelayExpr are calculated for each value. That slows the process, instead we can store the results for each host in hostDomainCounts. That will take only 2 x sizeof(long) extra memory per host. -- This message was sent by Atlassian JIRA (v7.6.3#76005)