This is an automated email from the ASF dual-hosted git repository.
snagel pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git.
from 111a3cc Merge pull request #340 from r0ann3l/NUTCH-2412
add a484fc0 Merge pull request #1 from apache/master
add 3712fe8 NUTCH-2611: Add line-breaks when parsing HTML block-level
elements
new 3ada351 Merge pull request #354 from YossiTamari/NUTCH-2611
The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "add" were already present in the repository and have only
been added to this reference.
Summary of changes:
conf/nutch-default.xml | 10 ++++++++++
.../src/java/org/apache/nutch/parse/html/DOMContentUtils.java | 11 +++++++++++
.../src/java/org/apache/nutch/parse/tika/DOMContentUtils.java | 10 ++++++++++
3 files changed, 31 insertions(+)