GitHub user sebastian-nagel reopened a pull request:

    https://github.com/apache/nutch/pull/86

    NUTCH-1712 Injector to use MultipleInputs (new MR API)

    Tested inject in combination with other CrawlDb tools (readdb, updatedb, 
mergedb): everything seems to work smoothly, although output files are 
part-00000 or part-r-00000 (for old resp. new MapReduce API).

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/sebastian-nagel/nutch NUTCH-1712

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/nutch/pull/86.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #86
    
----
commit 8900e4fb8b417f1f1e46f4dcb6c02840d2a5b838
Author: Sebastian Nagel <[email protected]>
Date:   2015-10-19T19:48:05Z

    NUTCH-1712 applied to current trunk; run first simple tests (inject + merge)

commit 11942a92bd583eca8253e2b34f259f74c0ae4b81
Author: Sebastian Nagel <[email protected]>
Date:   2016-01-17T20:32:31Z

    add unit tests based on MRUnit

commit 712b0b0ca2883fa399e23f7f22c9ffc236ec3db4
Author: Sebastian Nagel <[email protected]>
Date:   2016-01-17T21:20:32Z

    update tests to reflect change of reduce outputs by new API (part-nnnnn -> 
part-r-nnnnn): all unit tests pass now

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

Reply via email to