Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The "DebugTool" page has been changed by ChrisMattmann: http://wiki.apache.org/nutch/DebugTool?action=diff&rev1=2&rev2=3 It should be possible to generate information that would have answered all of the "is it X" questions that came up during a user's crawl. E.g. - - which URLs were put on the fetch list, versus skipped. + 1. which URLs were put on the fetch list, versus skipped. - - which fetched documents were truncated. + 1. which fetched documents were truncated. - - which URLs in a parsed page were skipped, due to the max outlinks per page limit. + 1. which URLs in a parsed page were skipped, due to the max outlinks per page limit. - - which URLs got filtered by regex + 1. which URLs got filtered by regex Please add more requirements and discussion here.

