Hi, No I didn't really find a good solution, but if I remember correctly I deleted the crawl database. I have noticed that those jobs seem to take longer and longer times, expected of course since the crawl database grows every time.
I also setup a hadoop cluster and that helped a lot in increasing the performance. But I haven't been following my crawl process thoroughly lately so maybe the problem is still hanging around. best regards, Magnus On Thu, May 31, 2012 at 10:13 PM, Lewis John Mcgibbney <[email protected]> wrote: > Can someone provide and URL please? > > On Thu, May 31, 2012 at 9:23 PM, sidbatra <[email protected]> wrote: >> Hi Magnus, >> >> I'm facing the exactly the same issue with Nutch 1.4 >> >> Did you manage to find a solution? >> >> thanks, >> Sid >> >> -- >> View this message in context: >> http://lucene.472066.n3.nabble.com/ParseSegment-taking-a-long-time-to-finish-tp3758053p3987122.html >> Sent from the Nutch - User mailing list archive at Nabble.com. > > > > -- > Lewis

