;
Sent: Monday, April 30, 2018 4:53:20 PM
To: user@nutch.apache.org
Subject: Re: Nutch fetching times out at 3 hours, not sure why.
Hi Chip,
got it, you probably run bin/crawl which has the option:
--time-limit-fetch Number of minutes allocated to the
fetching [default: 180]
It's good to have
nore it unless it causes a problem for my other cores.
>
> Chip
>
> -Original Message-
> From: Sebastian Nagel [mailto:wastl.na...@googlemail.com]
> Sent: Monday, April 30, 2018 12:21 PM
> To: user@nutch.apache.org
> Subject: Re: Nutch fetching times out at 3 hou
a problem for my other cores.
Chip
-Original Message-
From: Sebastian Nagel [mailto:wastl.na...@googlemail.com]
Sent: Monday, April 30, 2018 12:21 PM
To: user@nutch.apache.org
Subject: Re: Nutch fetching times out at 3 hours, not sure why.
Hi,
if you still see the log message
;
> Are these 3 hour loops standard for large crawls?
>
> -Original Message-
> From: Chip Calhoun [mailto:ccalh...@aip.org]
> Sent: Tuesday, April 17, 2018 3:27 PM
> To: user@nutch.apache.org
> Subject: RE: Nutch fetching times out at 3 hours, not sure why.
>
&g
, April 17, 2018 1:43 PM
To: user@nutch.apache.org
Subject: RE: Nutch fetching times out at 3 hours, not sure why.
Which version are you running? That value is defaulted to -1 in my current
version (1.14) so shouldn't be something you should have needed to change. My
crawls, by default, go for as much
Hi Lewis,
I'm using Nutch 1.2.
Chip
-Original Message-
From: lewis john mcgibbney [mailto:lewi...@apache.org]
Sent: Wednesday, April 18, 2018 1:55 PM
To: user@nutch.apache.org
Subject: Re: Nutch fetching times out at 3 hours, not sure why.
Hi Chip,
Which version of Nutch are you using
...@openindex.io]
Sent: Tuesday, April 17, 2018 3:58 PM
To: user@nutch.apache.org
Subject: RE: Nutch fetching times out at 3 hours, not sure why.
Hello Chip,
I have no clue where the three hour limit could come from. Please take a
further look in the last few minutes of the logs.
The only thing i can
Hi Chip,
Which version of Nutch are you using?
On Tue, Apr 17, 2018 at 7:45 AM, wrote:
> From: Chip Calhoun
> To: "user@nutch.apache.org"
> Cc:
> Bcc:
> Date: Tue, 17 Apr 2018 14:45:01 +
> Subject: Nutch fetching
Tuesday 17th April 2018 21:27
> To: user@nutch.apache.org
> Subject: RE: Nutch fetching times out at 3 hours, not sure why.
>
> I'm on 1.12, and mine also defaulted at -1. It does not fail at the same URL,
> or even at the same point in a URL's fetcher loop; it reall
@nutch.apache.org
Subject: RE: Nutch fetching times out at 3 hours, not sure why.
Which version are you running? That value is defaulted to -1 in my current
version (1.14) so shouldn't be something you should have needed to change. My
crawls, by default, go for as much as even 12 hours with little
Which version are you running? That value is defaulted to -1 in my current
version (1.14) so shouldn't be something you should have needed to change. My
crawls, by default, go for as much as even 12 hours with little to no tweaking
necessary from the nutch-default. Something else is causing
11 matches
Mail list logo