[ https://issues.apache.org/jira/browse/NUTCH-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Lewis John McGibbney updated NUTCH-1370: ---------------------------------------- Attachment: NUTCH-1370-2.x-v2.patch 2nd WIP for 2.x I'm having difficulty correctly implementing JobClient#runJob as the currentJob param is not correct... {code} RunningJob mapJob = JobClient.runJob(currentJob); {code} @Seb, Regarding your patch, this looks great, is much cleaner than my proposal, I've tested and I'm +1 for committing. > Expose exact number of urls injected @runtime > ---------------------------------------------- > > Key: NUTCH-1370 > URL: https://issues.apache.org/jira/browse/NUTCH-1370 > Project: Nutch > Issue Type: Improvement > Components: injector > Affects Versions: nutchgora, 1.5 > Reporter: Lewis John McGibbney > Assignee: Lewis John McGibbney > Priority: Minor > Fix For: 1.6, 2.2 > > Attachments: NUTCH-1370-1.x.patch, NUTCH-1370-2.x.patch, > NUTCH-1370-2.x-v2.patch > > > Example: When using trunk, currently we see > {code} > 2012-05-22 09:04:00,239 INFO crawl.Injector - Injector: starting at > 2012-05-22 09:04:00 > 2012-05-22 09:04:00,239 INFO crawl.Injector - Injector: crawlDb: > crawl/crawldb > 2012-05-22 09:04:00,239 INFO crawl.Injector - Injector: urlDir: urls > 2012-05-22 09:04:00,253 INFO crawl.Injector - Injector: Converting injected > urls to crawl db entries. > 2012-05-22 09:04:00,955 INFO plugin.PluginRepository - Plugins: looking in: > {code} > I would like to see > {code} > 2012-05-22 09:04:00,239 INFO crawl.Injector - Injector: starting at > 2012-05-22 09:04:00 > 2012-05-22 09:04:00,239 INFO crawl.Injector - Injector: crawlDb: > crawl/crawldb > 2012-05-22 09:04:00,239 INFO crawl.Injector - Injector: urlDir: urls > 2012-05-22 09:04:00,253 INFO crawl.Injector - Injector: Injected N urls to > crawl/crawldb > 2012-05-22 09:04:00,253 INFO crawl.Injector - Injector: Converting injected > urls to crawl db entries. > 2012-05-22 09:04:00,955 INFO plugin.PluginRepository - Plugins: looking in: > {code} > This would make debugging easier and would help those who end up getting > {code} > 2012-05-22 09:04:04,850 WARN crawl.Generator - Generator: 0 records selected > for fetching, exiting ... > {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira