Well, set up to crawl nutch.apache.org only and fetch some cycles and see what happens. If merging goes bad then i can reproduce and perhaps fix it.
If not, you may want to start debugging the thing step by step. On Tuesday 10 January 2012 18:06:34 Dean Pullen wrote: > Yes, this is about the parse_data directory dissapearing after a merge. > > I've used a clean Nutch 1.4 multiple times, I've not yet use an example > crawl though. > > Anything specific you recommend? > > Dean. > > On 10/01/2012 16:59, Markus Jelsma wrote: > > I haven't followed the entire thread but this is about the parse_data > > directory disappears after a merge? We have no issues with merges on > > small crawls. > > > > Do you still store content despite the parsing fetcher? Can you reproduce > > this on a clean Nutch 1.4 build with an example crawl? -- Markus Jelsma - CTO - Openindex

