It looks like you are running with an older version of 2.0, even though it does not really make much of a difference in this case, The issue shows up when getLocalPathForWrite thinks there is no space on to write to on any of the disks it has configured. This could be because you do not have any directories configured. I really don't know for sure exactly what is happening. It might be disk fail in place removing disks for you because of other issues. Either way we should file a JIRA against Hadoop to make it so we never get the / by zero error and provide a better way to handle the possible causes.
--Bobby Evans On 10/24/12 11:54 PM, "Ted Yu" <[email protected]> wrote: >Hi, >HBase has Jenkins build against hadoop 2.0 >I was checking why TestRowCounter sometimes failed: >https://builds.apache.org/job/HBase-TRUNK-on-Hadoop-2.0.0/231/testReport/o >rg.apache.hadoop.hbase.mapreduce/TestRowCounter/testRowCounterExclusiveCol >umn/ > >I think the following could be the cause: > >2012-10-22 23:46:32,571 WARN [AsyncDispatcher event handler] >resourcemanager.RMAuditLogger(255): USER=jenkins OPERATION=Application >Finished - Failed TARGET=RMAppManager RESULT=FAILURE DESCRIPTION=App >failed with state: FAILED PERMISSIONS=Application >application_1350949562159_0002 failed 1 times due to AM Container for >appattempt_1350949562159_0002_000001 exited with exitCode: -1000 due >to: java.lang.ArithmeticException: / by zero > at >org.apache.hadoop.fs.LocalDirAllocator$AllocatorPerContext.getLocalPathFor >Write(LocalDirAllocator.java:355) > at >org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAlloca >tor.java:150) > at >org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAlloca >tor.java:131) > at >org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAlloca >tor.java:115) > at >org.apache.hadoop.yarn.server.nodemanager.LocalDirsHandlerService.getLocal >PathForWrite(LocalDirsHandlerService.java:257) > at >org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.Resou >rceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.jav >a:849) > >However, I don't seem to find where in getLocalPathForWrite() division by >zero could have arisen. > >Comment / hint is welcome. > >Thanks
