[ https://issues.apache.org/jira/browse/NUTCH-1349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13268397#comment-13268397 ]
Ferdy Galema commented on NUTCH-1349: ------------------------------------- Good work on improving the CLI. About the displaying mismatching batchId, your patch prints batchId while you should use 'mark' instead. What do you mean with matching TableUtil.unreverseUrl(key)? > Make batchId explcit within debug logging and improve CLI > --------------------------------------------------------- > > Key: NUTCH-1349 > URL: https://issues.apache.org/jira/browse/NUTCH-1349 > Project: Nutch > Issue Type: Improvement > Components: indexer > Affects Versions: nutchgora > Reporter: Lewis John McGibbney > Priority: Minor > Fix For: nutchgora > > Attachments: NUTCH-1349.patch > > > I find this a pain when trying to locate the batchId of some urls which are > skipped when going to the Solr index. My DEBUG log output gives me > {code} > 2012-05-03 20:44:55,268 DEBUG indexer.IndexerJob (IndexerJob.java:map(83)) - > Skipping http://www.glasgowwheelers.com/; different batch id > 2012-05-03 20:44:55,259 DEBUG indexer.IndexerJob (IndexerJob.java:map(83)) - > Skipping http://www.heraldscotland.com/; different batch id > {code} > when I would actually like > {code} > 2012-05-03 20:44:55,268 DEBUG indexer.IndexerJob (IndexerJob.java:map(83)) - > Skipping http://www.glasgowwheelers.com/; different batch id (ACTUAL BATCH ID) > 2012-05-03 20:44:55,259 DEBUG indexer.IndexerJob (IndexerJob.java:map(83)) - > Skipping http://www.heraldscotland.com/; different batch id (ACTUAL BATCH ID) > {code} > patch coming up soon -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira