Update for CommonCrawlDataDumper saves gzipped body in extracted form NUTCH-2213 contributed by jnioche this closes #88.
Project: http://git-wip-us.apache.org/repos/asf/nutch/repo Commit: http://git-wip-us.apache.org/repos/asf/nutch/commit/a3e74204 Tree: http://git-wip-us.apache.org/repos/asf/nutch/tree/a3e74204 Diff: http://git-wip-us.apache.org/repos/asf/nutch/diff/a3e74204 Branch: refs/heads/master Commit: a3e7420494304bc4de7ee1a0b25a5158108856f5 Parents: 12fc121 Author: Chris Mattmann <[email protected]> Authored: Mon Feb 29 19:35:46 2016 -0800 Committer: Chris Mattmann <[email protected]> Committed: Mon Feb 29 19:35:46 2016 -0800 ---------------------------------------------------------------------- CHANGES.txt | 2 ++ 1 file changed, 2 insertions(+) ---------------------------------------------------------------------- http://git-wip-us.apache.org/repos/asf/nutch/blob/a3e74204/CHANGES.txt ---------------------------------------------------------------------- diff --git a/CHANGES.txt b/CHANGES.txt index 655fe35..e2a9c3d 100644 --- a/CHANGES.txt +++ b/CHANGES.txt @@ -10,6 +10,8 @@ in the release announcement and keep it on top in this CHANGES.txt for the Nutch Nutch Change Log +* NUTCH-2213 CommonCrawlDataDumper saves gzipped body in extracted form (jnioche via mattmann) + * NUTCH-2144 Added an extension point and a plugin to accept external links (Thamme Gowda N. via mattmann) * NUTCH-1712 Use MultipleInputs in Injector to make it a single mapreduce job (tejasp, snagel)
