[nutch] branch master updated: NUTCH-2666 Increase default value for http.content.limit / ftp.content.limit / file.content.limit - increase the default content limit from 64 kB to 1024 kB

2019-04-10 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 13a9a6d NUTCH-2666 Increase default value for

[nutch] branch master updated: NUTCH-2683 DeduplicationJob: add option to prefer https:// over http:// - add optional value "httpsOverHttp" to -compareOrder argument to prefer https:// over http:// if

2019-04-10 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 3958d0c NUTCH-2683 DeduplicationJob: add option

[nutch] branch master updated: NUTCH-2701 Fetcher: log dates and times also in human-readable form - add human-readable date to log message about time limit - move date formatter to TimingUtil - use n

2019-04-10 Thread snagel
This is an automated email from the ASF dual-hosted git repository. snagel pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/nutch.git The following commit(s) were added to refs/heads/master by this push: new 0624d25 NUTCH-2701 Fetcher: log dates and times