This is an automated email from the ASF dual-hosted git repository.
snagel pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git.
from 5fb3140 Merge pull request #388 from
sebastian-nagel/NUTCH-2648-configurable-tls-cert-check
add c532c4e NUTCH-2192 NUTCH-1678 NUTCH-1014 NUTCH-1021 Migrate from
Apache ORO to java.util.regex - apply Markus' patch of NUTCH-2192 - finish
migration of parse-js - remove oro dependency - correct pointer to Java regex
syntax (instead of "Perl5") NUTCH-1063 OutlinkExtractor test generates an
exception but does not fail - fixed by adding null-check (required by
java.util.regex classes)
add cf1f2dd NUTCH-1121 JUnit test for parse-js - port tests from 2.x -
add test file for "pure" JavaScript (parser extension)
add 4418a0d NUTCH-2192 Migrate from Apache ORO to java.util.regex -
improve javadoc in plugin parse-js (from 2.x)
new a997f10 Merge pull request #389 from
sebastian-nagel/NUTCH-2192-remove-oro
The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "add" were already present in the repository and have only
been added to this reference.
Summary of changes:
LICENSE.txt | 56 ----
conf/parse-plugins.xml | 4 +
conf/regex-normalize.xml.template | 3 +-
ivy/ivy.xml | 1 -
.../org/apache/nutch/parse/OutlinkExtractor.java | 43 ++-
src/plugin/parse-js/build.xml | 14 +
src/plugin/parse-js/plugin.xml | 4 +-
.../parse-js/sample/parse_embedded_js_test.html | 316 +++++++++++++++++++++
src/plugin/parse-js/sample/parse_pure_js_test.js | 24 ++
.../org/apache/nutch/parse/js/JSParseFilter.java | 89 +++---
.../apache/nutch/parse/js/TestJSParseFilter.java | 116 ++++++++
.../sample/regex-normalize-default.xml | 19 +-
.../sample/regex-normalize-scope1.xml | 19 +-
13 files changed, 582 insertions(+), 126 deletions(-)
create mode 100644 src/plugin/parse-js/sample/parse_embedded_js_test.html
create mode 100644 src/plugin/parse-js/sample/parse_pure_js_test.js
create mode 100644
src/plugin/parse-js/src/test/org/apache/nutch/parse/js/TestJSParseFilter.java