This is an automated email from the ASF dual-hosted git repository.
markus pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git
The following commit(s) were added to refs/heads/master by this push:
new 4642c30c2 NUTCH-3029 Host specific max. and min. intervals in adaptive
scheduler
4642c30c2 is described below
commit 4642c30c2aeb2a1fa2436541bd4af877d0aad86a
Author: Markus Jelsma <[email protected]>
AuthorDate: Wed Mar 13 12:58:05 2024 +0000
NUTCH-3029 Host specific max. and min. intervals in adaptive scheduler
---
conf/adaptive-host-specific-intervals.txt.template | 14 ++++++++++++++
1 file changed, 14 insertions(+)
diff --git a/conf/adaptive-host-specific-intervals.txt.template
b/conf/adaptive-host-specific-intervals.txt.template
new file mode 100644
index 000000000..4aa7920d3
--- /dev/null
+++ b/conf/adaptive-host-specific-intervals.txt.template
@@ -0,0 +1,14 @@
+# This file defines a mapping that associates specific min. and max.
refetching time intervals
+# to a host, that deviate from the default settings of the
AdaptiveFetchSchedule class.
+#
+# Format: <hostname> <min_interval> <max_interval>.
+#
+# The two values will be parsed as float and should be STRICTLY between
+# db.fetch.schedule.adaptive.min_interval and
db.fetch.schedule.adaptive.max_interval.
+#
+# To use default values, write "default" or "0".
+# The default min. is 60 (1 min) and default max. is 31536000 (1 year).
+#
+www.apache.org default 1728000
+www.example.org 1296000 0
+nutch.apache.org 864000 2160000