Merge branch 'my-branch' of https://github.com/jeremie70/nutch into 2.x
Project: http://git-wip-us.apache.org/repos/asf/nutch/repo Commit: http://git-wip-us.apache.org/repos/asf/nutch/commit/d16b5afa Tree: http://git-wip-us.apache.org/repos/asf/nutch/tree/d16b5afa Diff: http://git-wip-us.apache.org/repos/asf/nutch/diff/d16b5afa Branch: refs/heads/2.x Commit: d16b5afa2b216b3a64bcf9cb83b78be6f2833250 Parents: 876aa4f 32dd379 Author: Chris Mattmann <[email protected]> Authored: Sat Mar 19 17:46:42 2016 -0700 Committer: Chris Mattmann <[email protected]> Committed: Sat Mar 19 17:46:42 2016 -0700 ---------------------------------------------------------------------- conf/nutch-default.xml | 18 ++++++ .../nutch/indexer/IndexingFiltersChecker.java | 5 +- .../tika/BoilerpipeExtractorRepository.java | 62 ++++++++++++++++++++ .../org/apache/nutch/parse/tika/DOMBuilder.java | 4 +- .../org/apache/nutch/parse/tika/TikaParser.java | 38 ++++++++++-- 5 files changed, 118 insertions(+), 9 deletions(-) ----------------------------------------------------------------------
