Re: org.apache.hadoop.util.DiskChecker$DiskErrorException: Could not find taskTracker/jobcache/job_local..

2012-05-10 Thread Markus Jelsma
I that case i'm not sure 9GB is enough for 400.000 documents. This is most certainly not enough if you store the content in the segment (default). On Thu, 10 May 2012 10:43:14 +0200, Igor Salma wrote: Thanks Markus, Yes, we've already changed hadoop.tmp.dir and there is plenty free space.

Re: org.apache.hadoop.util.DiskChecker$DiskErrorException: Could not find taskTracker/jobcache/job_local..

2012-05-10 Thread Igor Salma
Thanks Markus, Yes, we've already changed hadoop.tmp.dir and there is plenty free space. All the best, Igor On Thu, May 10, 2012 at 10:35 AM, Markus Jelsma wrote: > Plenty of disk space does not mean you have enough room in your > hadoop.tmp.dir which is /tmp by default. > > > On Thu, 10 May 20

Re: org.apache.hadoop.util.DiskChecker$DiskErrorException: Could not find taskTracker/jobcache/job_local..

2012-05-10 Thread Markus Jelsma
Plenty of disk space does not mean you have enough room in your hadoop.tmp.dir which is /tmp by default. On Thu, 10 May 2012 10:26:00 +0200, Igor Salma wrote: Hi, Adriana, Sebastian, We are struggling with this for a days - the problem is cause it crawls for few days and then it breaks with

Re: org.apache.hadoop.util.DiskChecker$DiskErrorException: Could not find taskTracker/jobcache/job_local..

2012-05-10 Thread Igor Salma
Hi, Adriana, Sebastian, We are struggling with this for a days - the problem is cause it crawls for few days and then it breaks with same exception. At first, it seemed that Adriana was right - that we're having problem with disc space but last two breaks occurred with 9GB still left on disc. Also

Re: org.apache.hadoop.util.DiskChecker$DiskErrorException: Could not find taskTracker/jobcache/job_local..

2012-05-01 Thread Sebastian Nagel
Hi Igor, no disk space on /tmp is one possible reason. The other is: > (working in local mode). Are you running multiple instances of Nutch in parallel? If yes, these instances must use disjoint temp directories (hadoop.tmp.dir). There are multiple posts on this list about this topic. Sebastian

Re: org.apache.hadoop.util.DiskChecker$DiskErrorException: Could not find taskTracker/jobcache/job_local..

2012-04-30 Thread Igor Salma
Hi, Thanks Adriana, for such a quick reply. We'll give it another try with your suggestions. Regarding, missing library - I assumed I'm on wrong track if I need additional library, but, yes, I might be very wrong :) I'll keep you posted. All the best, Igor On Mon, Apr 30, 2012 at 3:33 PM, Adri

Re: org.apache.hadoop.util.DiskChecker$DiskErrorException: Could not find taskTracker/jobcache/job_local..

2012-04-30 Thread Adriana Farina
Hello! I had the same kind of problem. In my case this was caused by one of the node of my cluster with full memory, so to solve the priblem I simply freed up memory on that node. Check if all of the nodes of your cluster have free memory. As for the second error, it seems you're missing some l