Lewis John McGibbney created NUTCH-1370:
-------------------------------------------
Summary: Expose exact number of urls injected @runtime
Key: NUTCH-1370
URL: https://issues.apache.org/jira/browse/NUTCH-1370
Project: Nutch
Issue Type: Improvement
Components: injector
Affects Versions: 1.4, nutchgora
Reporter: Lewis John McGibbney
Fix For: 1.5, 2.1
Example: When using trunk, currently we see
{code}
2012-05-22 09:04:00,239 INFO crawl.Injector - Injector: starting at 2012-05-22
09:04:00
2012-05-22 09:04:00,239 INFO crawl.Injector - Injector: crawlDb: crawl/crawldb
2012-05-22 09:04:00,239 INFO crawl.Injector - Injector: urlDir: urls
2012-05-22 09:04:00,253 INFO crawl.Injector - Injector: Converting injected
urls to crawl db entries.
2012-05-22 09:04:00,955 INFO plugin.PluginRepository - Plugins: looking in:
{code}
I would like to see
{code}
2012-05-22 09:04:00,239 INFO crawl.Injector - Injector: starting at 2012-05-22
09:04:00
2012-05-22 09:04:00,239 INFO crawl.Injector - Injector: crawlDb: crawl/crawldb
2012-05-22 09:04:00,239 INFO crawl.Injector - Injector: urlDir: urls
2012-05-22 09:04:00,253 INFO crawl.Injector - Injector: Injected N urls to
crawl/crawldb
2012-05-22 09:04:00,253 INFO crawl.Injector - Injector: Converting injected
urls to crawl db entries.
2012-05-22 09:04:00,955 INFO plugin.PluginRepository - Plugins: looking in:
{code}
This would make debugging easier and would help those who end up getting
{code}
2012-05-22 09:04:04,850 WARN crawl.Generator - Generator: 0 records selected
for fetching, exiting ...
{code}
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira