Well, set up to crawl nutch.apache.org only and fetch some cycles and see what 
happens. If merging goes bad then i can reproduce and perhaps fix it.

If not, you may want to start debugging the thing step by step.

On Tuesday 10 January 2012 18:06:34 Dean Pullen wrote:
> Yes, this is about the parse_data directory dissapearing after a merge.
> 
> I've used a clean Nutch 1.4 multiple times, I've not yet use an example
> crawl though.
> 
> Anything specific you recommend?
> 
> Dean.
> 
> On 10/01/2012 16:59, Markus Jelsma wrote:
> > I haven't followed the entire thread but this is about the parse_data
> > directory disappears after a merge? We have no issues with merges on
> > small crawls.
> > 
> > Do you still store content despite the parsing fetcher? Can you reproduce
> > this on a clean Nutch 1.4  build with an example crawl?

-- 
Markus Jelsma - CTO - Openindex

Reply via email to