This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git
commit a997f102878dc979f2ce542e63e860edeaf65f68 Merge: 5fb3140 4418a0d Author: Sebastian Nagel <[email protected]> AuthorDate: Sat Oct 13 11:52:25 2018 +0200 Merge pull request #389 from sebastian-nagel/NUTCH-2192-remove-oro NUTCH-2192 Migrate from Apache ORO to java.util.regex LICENSE.txt | 56 ---- conf/parse-plugins.xml | 4 + conf/regex-normalize.xml.template | 3 +- ivy/ivy.xml | 1 - .../org/apache/nutch/parse/OutlinkExtractor.java | 43 ++- src/plugin/parse-js/build.xml | 14 + src/plugin/parse-js/plugin.xml | 4 +- .../parse-js/sample/parse_embedded_js_test.html | 316 +++++++++++++++++++++ src/plugin/parse-js/sample/parse_pure_js_test.js | 24 ++ .../org/apache/nutch/parse/js/JSParseFilter.java | 89 +++--- .../apache/nutch/parse/js/TestJSParseFilter.java | 116 ++++++++ .../sample/regex-normalize-default.xml | 19 +- .../sample/regex-normalize-scope1.xml | 19 +- 13 files changed, 582 insertions(+), 126 deletions(-)
