Site search powered by Lucene/Solr
--
Key: NUTCH-743
URL: https://issues.apache.org/jira/browse/NUTCH-743
Project: Nutch
Issue Type: New Feature
Components: documentation
Reporter: Sami
[
https://issues.apache.org/jira/browse/NUTCH-743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sami Siren updated NUTCH-743:
-
Attachment: NUTCH-743.patch
If there are no objections I will commit this within a week or so.
Site
[
https://issues.apache.org/jira/browse/NUTCH-743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12723176#action_12723176
]
Andrzej Bialecki commented on NUTCH-743:
-
+1, based on the outcome of a thorough
Hi,
I was wondering what would be the best way to configure per-host
re-crawl intervals. The default db.fetch.interval applies to all URLs,
but I'd like for some hosts to be recrawled more frequently. Is there
a JIRA ticket open on this? I haven't been able to find one
Sandeep
[
https://issues.apache.org/jira/browse/NUTCH-729?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12723286#action_12723286
]
Tadesse Sefer commented on NUTCH-729:
-
Where do you change the logging to use a url key?
See http://hudson.zones.apache.org/hudson/job/Nutch-trunk/854/
--
[...truncated 4676 lines...]
deploy:
[mkdir] Created dir:
http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/build/plugins/urlfilter-regex
[copy] Copying 1 file to