Build failed in Jenkins: Nutch-trunk #2468

2013-12-29 Thread Apache Jenkins Server
See https://builds.apache.org/job/Nutch-trunk/2468/ -- [...truncated 3380 lines...] init: [mkdir] Created dir: https://builds.apache.org/job/Nutch-trunk/ws/trunk/build/urlnormalizer-host/classes [mkdir] Created dir:

Build failed in Jenkins: Nutch-nutchgora #865

2013-12-29 Thread Apache Jenkins Server
See https://builds.apache.org/job/Nutch-nutchgora/865/ -- [...truncated 203 lines...] at org.tmatesoft.svn.core.internal.io.dav.http.HTTPConnection._request(HTTPConnection.java:775) ... 33 more Caused by: svn: E175002: timed out waiting for

[jira] [Updated] (NUTCH-1687) Pick queue in Round Robin

2013-12-29 Thread Tien Nguyen Manh (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tien Nguyen Manh updated NUTCH-1687: Attachment: NUTCH-1687.patch add Apache Header fixed lost tail pointer when deleting

[jira] [Updated] (NUTCH-1687) Pick queue in Round Robin

2013-12-29 Thread Tien Nguyen Manh (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tien Nguyen Manh updated NUTCH-1687: Attachment: (was: NUTCH-1687.patch) Pick queue in Round Robin

Re: Nutch Crawl a Specific List Of URLs (150K)

2013-12-29 Thread Tejas Patil
Hi Bin Wang, nohup bin/nutch crawl urls -dir result -depth 1 -topN 20 You were creating a new crawldb or reusing some old one ? Were you running this on a cluster or in local mode ? Was there any failure due to which the fetch round got aborted ? (see logs for this). I would like to