Just saw this error again. I filed IMPALA-5765.
On Mon, Jul 31, 2017 at 8:05 PM, Tim Armstrong <tarmstr...@cloudera.com> wrote: > It looks like the same error: > > Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: > org.apache.hadoop.ipc.RemoteException(java.io.IOException): File > /test-warehouse/tpcds.store_sales/.hive-staging_hive_2017-07-31_23-55-05_306_8385818677737494274-760/_task_tmp.-ext-10000/ss_sold_date_sk=2450988/_tmp.000000_0 > could only be replicated to 0 nodes instead of minReplication (=1). There > are 3 datanode(s) running and no node(s) are excluded in this operation. > at > org.apache.hadoop.hdfs.server.blockmanagement.BlockManager.chooseTarget4NewBlock(BlockManager.java:1724) > at > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.getAdditionalBlock(FSNamesystem.java:3385) > at > org.apache.hadoop.hdfs.server.namenode.NameNodeRpcServer.addBlock(NameNodeRpcServer.java:683) > at > org.apache.hadoop.hdfs.server.namenode.AuthorizationProviderProxyClientProtocol.addBlock(AuthorizationProviderProxyClientProtocol.java:214) > at > org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocolServerSideTranslatorPB.addBlock(ClientNamenodeProtocolServerSideTranslatorPB.java:495) > at > org.apache.hadoop.hdfs.protocol.proto.ClientNamenodeProtocolProtos$ClientNamenodeProtocol$2.callBlockingMethod(ClientNamenodeProtocolProtos.java) > at > org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:617) > at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:1073) > at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2217) > at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2213) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:415) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1917) > at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2211) > > at > org.apache.hadoop.hive.ql.exec.FileSinkOperator.processOp(FileSinkOperator.java:751) > at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:815) > at > org.apache.hadoop.hive.ql.exec.SelectOperator.processOp(SelectOperator.java:84) > at > org.apache.hadoop.hive.ql.exec.mr.ExecReducer.reduce(ExecReducer.java:244) > ... 8 more > Caused by: org.apache.hadoop.ipc.RemoteException(java.io.IOException): File > /test-warehouse/tpcds.store_sales/.hive-staging_hive_2017-07-31_23-55-05_306_8385818677737494274-760/_task_tmp.-ext-10000/ss_sold_date_sk=2450988/_tmp.000000_0 > could only be replicated to 0 nodes instead of minReplication (=1). There > are 3 datanode(s) running and no node(s) are excluded in this operation. > at > org.apache.hadoop.hdfs.server.blockmanagement.BlockManager.chooseTarget4NewBlock(BlockManager.java:1724) > at > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.getAdditionalBlock(FSNamesystem.java:3385) > at > org.apache.hadoop.hdfs.server.namenode.NameNodeRpcServer.addBlock(NameNodeRpcServer.java:683) > at > org.apache.hadoop.hdfs.server.namenode.AuthorizationProviderProxyClientProtocol.addBlock(AuthorizationProviderProxyClientProtocol.java:214) > at > org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocolServerSideTranslatorPB.addBlock(ClientNamenodeProtocolServerSideTranslatorPB.java:495) > at > org.apache.hadoop.hdfs.protocol.proto.ClientNamenodeProtocolProtos$ClientNamenodeProtocol$2.callBlockingMethod(ClientNamenodeProtocolProtos.java) > at > org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:617) > at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:1073) > at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2217) > at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2213) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:415) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1917) > at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2211) > > at org.apache.hadoop.ipc.Client.call(Client.java:1502) > at org.apache.hadoop.ipc.Client.call(Client.java:1439) > at > org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:230) > at com.sun.proxy.$Proxy12.addBlock(Unknown Source) > at > org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocolTranslatorPB.addBlock(ClientNamenodeProtocolTranslatorPB.java:413) > at sun.reflect.GeneratedMethodAccessor68.invoke(Unknown Source) > at > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > at java.lang.reflect.Method.invoke(Method.java:606) > at > org.apache.hadoop.io.retry.RetryInvocationHandler.invokeMethod(RetryInvocationHandler.java:260) > at > org.apache.hadoop.io.retry.RetryInvocationHandler.invoke(RetryInvocationHandler.java:104) > at com.sun.proxy.$Proxy13.addBlock(Unknown Source) > at > org.apache.hadoop.hdfs.DFSOutputStream$DataStreamer.locateFollowingBlock(DFSOutputStream.java:1814) > at > org.apache.hadoop.hdfs.DFSOutputStream$DataStreamer.nextBlockOutputStream(DFSOutputStream.java:1610) > at > org.apache.hadoop.hdfs.DFSOutputStream$DataStreamer.run(DFSOutputStream.java:773) > 2017-07-31 23:55:38,630 ERROR exec.Task > (SessionState.java:printError(1103)) - Ended Job = job_local1252085428_0826 > with errors > 2017-07-31 23:55:38,631 ERROR exec.Task > (SessionState.java:printError(1103)) - Error during job, obtaining > debugging information... > 2017-07-31 23:55:38,641 ERROR ql.Driver > (SessionState.java:printError(1103)) - FAILED: Execution Error, return code > 2 from org.apache.hadoop.hive.ql.exec.mr.MapRedTask > 2017-07-31 23:55:38,641 INFO log.PerfLogger > (PerfLogger.java:PerfLogEnd(168)) - </PERFLOG method=Driver.execute > start=1501545305365 end=1501545338641 duration=33276 > from=org.apache.hadoop.hive.ql.Driver> > > > On Mon, Jul 31, 2017 at 8:03 PM, Tim Armstrong <tarmstr...@cloudera.com> > wrote: > >> I saw this on GVO: https://jenkins.impala.io/job/ubuntu-14.04-from- >> scratch/1807/ >> >> I haven't pulled out the error from hive.log yet - for some reason that >> log is almost 500mb. >> >> On Thu, Jul 13, 2017 at 3:52 PM, Tim Armstrong <tarmstr...@cloudera.com> >> wrote: >> >>> I'm not sure exactly what is going on, but I can confirm that I was able >>> to load data on Ubuntu 16.04 with OpenJDK 8 a while back. >>> >>> On Thu, Jul 13, 2017 at 2:58 PM, Jim Apple <jbap...@cloudera.com> wrote: >>> >>>> I also see this with the Oracle JDK. I have also now checked I am not >>>> running out of memory. >>>> >>>> Oracle JDK7 is harder to get one's hands on, and OpenJDK7 isn't packaged >>>> by >>>> canonical for Ubuntu 16.04. >>>> >>>> On Wed, Jul 12, 2017 at 11:20 PM, Jim Apple <jbap...@cloudera.com> >>>> wrote: >>>> >>>> > I'm getting data loading errors on Ubuntu 16.04 in TPC-DS. The terminal >>>> > shows: >>>> > >>>> > ERROR : FAILED: Execution Error, return code 2 from >>>> > org.apache.hadoop.hive.ql.exec.mr.MapRedTask >>>> > >>>> > logs/cluster/hive/hive.log shows the error below, which previous bugs >>>> have >>>> > called an issue with the disk being out of space, but my disk has at >>>> least >>>> > 45GB left on it >>>> > >>>> > IMPALA-3246, IMPALA-2856, IMPALA-2617 >>>> > >>>> > I see this with openJDK8. I haven't tried Oracle's JDK yet. >>>> > >>>> > Has anyone else seen this and been able to diagnose it as something >>>> that >>>> > doesn't mean a full disk? >>>> > >>>> > >>>> > FATAL ExecReducer (ExecReducer.java:reduce(264)) - >>>> > org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error >>>> > while processing row (tag=0) {"key":{},"value":{"_col0": >>>> > 48147,"_col1":17805,"_col2":27944,"_col3":606992,"_col4": >>>> > 3193,"_col5":16641,"_col6":10,"_col7":209,"_col8":44757,"_ >>>> > col9":20,"_col10":5.51,"_col11":9.36,"_col12":9.17,"_ >>>> > col13":0,"_col14":183.4,"_col15":110.2,"_col16":187.2,"_ >>>> > col17":3.66,"_col18":0,"_col19":183.4,"_col20":187.06," >>>> > _col21":73.2,"_col22":2452013}} >>>> > at org.apache.hadoop.hive.ql.exec.mr.ExecReducer.reduce( >>>> > ExecReducer.java:253) >>>> > at org.apache.hadoop.mapred.ReduceTask.runOldReducer( >>>> > ReduceTask.java:444) >>>> > at org.apache.hadoop.mapred.Reduc >>>> eTask.run(ReduceTask.java:392) >>>> > at org.apache.hadoop.mapred.LocalJobRunner$Job$ >>>> > ReduceTaskRunnable.run(LocalJobRunner.java:346) >>>> > at java.util.concurrent.Executors$RunnableAdapter. >>>> > call(Executors.java:511) >>>> > at java.util.concurrent.FutureTask.run(FutureTask.java:266) >>>> > at java.util.concurrent.ThreadPoolExecutor.runWorker( >>>> > ThreadPoolExecutor.java:1142) >>>> > at java.util.concurrent.ThreadPoolExecutor$Worker.run( >>>> > ThreadPoolExecutor.java:617) >>>> > at java.lang.Thread.run(Thread.java:748) >>>> > Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: >>>> > org.apache.hadoop.ipc.RemoteException(java.io.IOException): File >>>> > /test-warehouse/tpcds.store_sales/.hive-staging_hive_2017- >>>> > 07-12_22-51-18_139_3687815919405186455-760/_task_ >>>> > tmp.-ext-10000/ss_sold_date_sk=2452013/_tmp.000001_0 could only be >>>> > replicated to 0 nodes instead of minReplication (=1). There are 3 >>>> > datanode(s) running and no node(s) are excluded in this operation. >>>> > at org.apache.hadoop.hdfs.server.blockmanagement.BlockManager. >>>> > chooseTarget4NewBlock(BlockManager.java:1724) >>>> > at org.apache.hadoop.hdfs.server.namenode.FSNamesystem. >>>> > getAdditionalBlock(FSNamesystem.java:3385) >>>> > at org.apache.hadoop.hdfs.server.namenode.NameNodeRpcServer. >>>> > addBlock(NameNodeRpcServer.java:683) >>>> > at org.apache.hadoop.hdfs.server.namenode. >>>> > AuthorizationProviderProxyClientProtocol.addBlock( >>>> > AuthorizationProviderProxyClientProtocol.java:214) >>>> > at org.apache.hadoop.hdfs.protocolPB. >>>> > ClientNamenodeProtocolServerSideTranslatorPB.addBlock( >>>> > ClientNamenodeProtocolServerSideTranslatorPB.java:495) >>>> > at org.apache.hadoop.hdfs.protocol.proto. >>>> > ClientNamenodeProtocolProtos$ClientNamenodeProtocol$2.callBl >>>> ockingMethod( >>>> > ClientNamenodeProtocolProtos.java) >>>> > at org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ >>>> > ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:617) >>>> > at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:1073) >>>> > at org.apache.hadoop.ipc.Server$H >>>> andler$1.run(Server.java:2217) >>>> > at org.apache.hadoop.ipc.Server$H >>>> andler$1.run(Server.java:2213) >>>> > at java.security.AccessController.doPrivileged(Native Method) >>>> > at javax.security.auth.Subject.doAs(Subject.java:422) >>>> > at org.apache.hadoop.security.UserGroupInformation.doAs( >>>> > UserGroupInformation.java:1917) >>>> > at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2211) >>>> > >>>> > at org.apache.hadoop.hive.ql.exec.FileSinkOperator. >>>> > processOp(FileSinkOperator.java:751) >>>> > at org.apache.hadoop.hive.ql.exec.Operator.forward( >>>> > Operator.java:815) >>>> > at org.apache.hadoop.hive.ql.exec.SelectOperator.processOp( >>>> > SelectOperator.java:84) >>>> > at org.apache.hadoop.hive.ql.exec.mr.ExecReducer.reduce( >>>> > ExecReducer.java:244) >>>> > >>>> >>> >>> >>