I've been through the code of the CrawlDbReader class. I discovered the
method "processTopNJob" which use the class CrawlDbTopNMapper and
CrawlDbTopNReducer.
I'm wondering why do we have this function. Is it an old implementation that
was used before the Generator to get the TopN links to Fetch or is it
something else ?
I would appreciate if you give me your thoughts.
I found also some class which are not used, "CrawlDbDumpReducer" its defined
but its never used or instanciate.
Don't you think we can remove it from the source code ?
-------------------------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems? Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/
_______________________________________________
Nutch-developers mailing list
Nutch-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-developers