GitHub user MJJoyce opened a pull request:
https://github.com/apache/nutch/pull/83
NUTCH-2155 - Add crawl completion utility
- Add simple crawl completion utility that reports count of fetch and
unfetched pages per domain or host.
- Update "nutch" helper script with new utility command.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/MJJoyce/nutch NUTCH-2155
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/nutch/pull/83.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #83
----
commit 2534b7a32417e044f5c1c39f4409a6d6826eee69
Author: Michael Joyce <[email protected]>
Date: 2015-10-28T21:18:16Z
NUTCH-2155 - Add crawl completion util
- Add simple crawl completion utility that reports count of fetch and
unfetched pages per domain or host.
- Update "nutch" helper script with new utility command.
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---