topN is ignored with maxNumSegments
-----------------------------------
Key: NUTCH-1074
URL: https://issues.apache.org/jira/browse/NUTCH-1074
Project: Nutch
Issue Type: Bug
Components: generator
Affects Versions: 1.3
Reporter: Markus Jelsma
Fix For: 1.4
When generating segments with topN and maxNumSegments, topN is not respected.
It looks like the first generated segment contains topN * maxNumSegments of
URLs's, at least the number of map input records roughly matches.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira