[ 
https://issues.apache.org/jira/browse/HBASE-6330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13552354#comment-13552354
 ] 

Ted Yu commented on HBASE-6330:
-------------------------------

I found division by zero error again, see 
https://builds.apache.org/job/HBase-TRUNK-on-Hadoop-2.0.0/345/testReport/org.apache.hadoop.hbase.mapreduce/TestImportExport/testSimpleCase/
{code}
2013-01-12 11:53:52,809 WARN  [AsyncDispatcher event handler] 
resourcemanager.RMAuditLogger(255): USER=jenkins  OPERATION=Application 
Finished - Failed TARGET=RMAppManager     RESULT=FAILURE  DESCRIPTION=App 
failed with state: FAILED       PERMISSIONS=Application 
application_1357991604658_0002 failed 1 times due to AM Container for 
appattempt_1357991604658_0002_000001 exited with  exitCode: -1000 due to: 
java.lang.ArithmeticException: / by zero
        at 
org.apache.hadoop.fs.LocalDirAllocator$AllocatorPerContext.getLocalPathForWrite(LocalDirAllocator.java:368)
        at 
org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:150)
        at 
org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:131)
        at 
org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:115)
        at 
org.apache.hadoop.yarn.server.nodemanager.LocalDirsHandlerService.getLocalPathForWrite(LocalDirsHandlerService.java:279)
        at 
org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:851)

.Failing this attempt.. Failing the application.        
APPID=application_1357991604658_0002
{code}
Here is related code:
{code}
        // Keep rolling the wheel till we get a valid path
        Random r = new java.util.Random();
        while (numDirsSearched < numDirs && returnPath == null) {

          long randomPosition = Math.abs(r.nextLong()) % totalAvailable;
{code}
My guess is that totalAvailable was 0, meaning dirDF was empty.

Locally, I saw the following:
{code}
 <testcase time="12.008" 
classname="org.apache.hadoop.hbase.mapreduce.TestImportExport" 
name="testExportScannerBatching">
    <error 
type="java.lang.reflect.UndeclaredThrowableException">java.lang.reflect.UndeclaredThrowableException
  at 
org.apache.hadoop.yarn.exceptions.impl.pb.YarnRemoteExceptionPBImpl.unwrapAndThrowException(YarnRemoteExceptionPBImpl.java:135)
  at 
org.apache.hadoop.yarn.api.impl.pb.client.ClientRMProtocolPBClientImpl.getNewApplication(ClientRMProtocolPBClientImpl.java:154)
  at 
org.apache.hadoop.yarn.client.YarnClientImpl.getNewApplication(YarnClientImpl.java:111)
  at 
org.apache.hadoop.mapred.ResourceMgrDelegate.getNewJobID(ResourceMgrDelegate.java:108)
  at org.apache.hadoop.mapred.YARNRunner.getNewJobID(YARNRunner.java:214)
  at 
org.apache.hadoop.mapreduce.JobSubmitter.submitJobInternal(JobSubmitter.java:345)
  at org.apache.hadoop.mapreduce.Job$11.run(Job.java:1218)
  at org.apache.hadoop.mapreduce.Job$11.run(Job.java:1215)
...
Caused by: com.google.protobuf.ServiceException: java.net.ConnectException: 
Call From TYus-MacBook-Pro.local/192.168.0.23 to 0.0.0.0:8032 failed on 
connection exception: java.net.ConnectException: Connection refused; For more 
details see:  http://wiki.apache.org/hadoop/ConnectionRefused
  at 
org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:212)
  at $Proxy92.getNewApplication(Unknown Source)
  at 
org.apache.hadoop.yarn.api.impl.pb.client.ClientRMProtocolPBClientImpl.getNewApplication(ClientRMProtocolPBClientImpl.java:151)
  ... 45 more
Caused by: java.net.ConnectException: Call From 
TYus-MacBook-Pro.local/192.168.0.23 to 0.0.0.0:8032 failed on connection 
exception: java.net.ConnectException: Connection refused; For more details see: 
 http://wiki.apache.org/hadoop/ConnectionRefused
  at org.apache.hadoop.net.NetUtils.wrapException(NetUtils.java:722)
  at org.apache.hadoop.ipc.Client.call(Client.java:1168)
  at 
org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:202)
  ... 47 more
Caused by: java.net.ConnectException: Connection refused
  at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
  at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:692)
  at 
org.apache.hadoop.net.SocketIOWithTimeout.connect(SocketIOWithTimeout.java:206)
  at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:524)
  at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:489)
  at org.apache.hadoop.ipc.Client$Connection.setupConnection(Client.java:478)
  at org.apache.hadoop.ipc.Client$Connection.setupIOstreams(Client.java:573)
  at org.apache.hadoop.ipc.Client$Connection.access$1700(Client.java:220)
  at org.apache.hadoop.ipc.Client.getConnection(Client.java:1217)
  at org.apache.hadoop.ipc.Client.call(Client.java:1144)
{code}
                
> TestImportExport has been failing against hadoop 0.23/2.0 profile [Part2]
> -------------------------------------------------------------------------
>
>                 Key: HBASE-6330
>                 URL: https://issues.apache.org/jira/browse/HBASE-6330
>             Project: HBase
>          Issue Type: Sub-task
>          Components: test
>    Affects Versions: 0.94.1, 0.96.0
>            Reporter: Jonathan Hsieh
>              Labels: hadoop-2.0
>             Fix For: 0.96.0
>
>         Attachments: hbase-6330-94.patch, hbase-6330-trunk.patch, 
> hbase-6330-v2.patch
>
>
> See HBASE-5876.  I'm going to commit the v3 patches under this name since 
> there has been two months (my bad) since the first half was committed and 
> found to be incomplte.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to