Hi,

No I didn't really find a good solution, but if I remember correctly I
deleted the crawl database. I have noticed that those jobs seem to
take longer and longer times, expected of course since the crawl
database grows every time.

I also setup a hadoop cluster and that helped a lot in increasing the
performance.

But I haven't been following my crawl process thoroughly lately so
maybe the problem is still hanging around.

best regards,
Magnus

On Thu, May 31, 2012 at 10:13 PM, Lewis John Mcgibbney
<[email protected]> wrote:
> Can someone provide and URL please?
>
> On Thu, May 31, 2012 at 9:23 PM, sidbatra <[email protected]> wrote:
>> Hi Magnus,
>>
>> I'm facing the exactly the same issue with Nutch 1.4
>>
>> Did you manage to find a solution?
>>
>> thanks,
>> Sid
>>
>> --
>> View this message in context: 
>> http://lucene.472066.n3.nabble.com/ParseSegment-taking-a-long-time-to-finish-tp3758053p3987122.html
>> Sent from the Nutch - User mailing list archive at Nabble.com.
>
>
>
> --
> Lewis

Reply via email to