This is an automated email from the ASF dual-hosted git repository. snagel pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git.
from 5fb3140 Merge pull request #388 from sebastian-nagel/NUTCH-2648-configurable-tls-cert-check add c532c4e NUTCH-2192 NUTCH-1678 NUTCH-1014 NUTCH-1021 Migrate from Apache ORO to java.util.regex - apply Markus' patch of NUTCH-2192 - finish migration of parse-js - remove oro dependency - correct pointer to Java regex syntax (instead of "Perl5") NUTCH-1063 OutlinkExtractor test generates an exception but does not fail - fixed by adding null-check (required by java.util.regex classes) add cf1f2dd NUTCH-1121 JUnit test for parse-js - port tests from 2.x - add test file for "pure" JavaScript (parser extension) add 4418a0d NUTCH-2192 Migrate from Apache ORO to java.util.regex - improve javadoc in plugin parse-js (from 2.x) new a997f10 Merge pull request #389 from sebastian-nagel/NUTCH-2192-remove-oro The 1 revisions listed above as "new" are entirely new to this repository and will be described in separate emails. The revisions listed as "add" were already present in the repository and have only been added to this reference. Summary of changes: LICENSE.txt | 56 ---- conf/parse-plugins.xml | 4 + conf/regex-normalize.xml.template | 3 +- ivy/ivy.xml | 1 - .../org/apache/nutch/parse/OutlinkExtractor.java | 43 ++- src/plugin/parse-js/build.xml | 14 + src/plugin/parse-js/plugin.xml | 4 +- .../parse-js/sample/parse_embedded_js_test.html | 316 +++++++++++++++++++++ src/plugin/parse-js/sample/parse_pure_js_test.js | 24 ++ .../org/apache/nutch/parse/js/JSParseFilter.java | 89 +++--- .../apache/nutch/parse/js/TestJSParseFilter.java | 116 ++++++++ .../sample/regex-normalize-default.xml | 19 +- .../sample/regex-normalize-scope1.xml | 19 +- 13 files changed, 582 insertions(+), 126 deletions(-) create mode 100644 src/plugin/parse-js/sample/parse_embedded_js_test.html create mode 100644 src/plugin/parse-js/sample/parse_pure_js_test.js create mode 100644 src/plugin/parse-js/src/test/org/apache/nutch/parse/js/TestJSParseFilter.java