This is an automated email from the ASF dual-hosted git repository.
snagel pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git.
from 2f9110c Merge pull request #355 from sebastian-nagel/NUTCH-2152
add 56ee081 NUTCH-2618 protocol-okhttp not to use http.timeout for max
duration to fetch document - add property http.time.limit to configure the max.
time allowed to fetch a single document - add reason of truncation (content
or time) to response metadata - rename "trimmed" -> "truncated" to follow
common Nutch terminology
add a4569f1 NUTCH-2619 protocol-okhttp: allow to keep partially fetched
docs as truncated - return content as successful response (marked as truncated)
if http.partial.truncated is true and there is already content fetched
new 9777aea Merge pull request #361 from
sebastian-nagel/NUTCH-2619-protocol-okhttp-partial-as-truncated
The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "add" were already present in the repository and have only
been added to this reference.
Summary of changes:
conf/nutch-default.xml | 26 +++++++++
.../org/apache/nutch/net/protocols/Response.java | 26 ++++++++-
.../apache/nutch/protocol/http/api/HttpBase.java | 27 ++++++++-
.../nutch/protocol/okhttp/OkHttpResponse.java | 65 +++++++++++++++++-----
4 files changed, 125 insertions(+), 19 deletions(-)