[ https://issues.apache.org/jira/browse/HIVE-20253?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
gehaijiang updated HIVE-20253: ------------------------------ Description: hadoop 3.0.3, Support nativetask. mapred-site.xml: <property> <name>mapreduce.job.map.output.collector.class</name> <value>org.apache.hadoop.mapred.nativetask.NativeMapOutputCollectorDelegator</value> </property> hive sql: set mapreduce.job.map.output.collector.class=org.apache.hadoop.mapred.nativetask.NativeMapOutputCollectorDelegator; select count(*) from test_cold; --test_cold (orcfile table) URL: [http://0.0.0.0:8088/taskdetails.jsp?jobid=job_1532646043398_0019&tipid=task_1532646043398_0019_m_000000] ---- Diagnostic Messages for this Task: Error: java.io.IOException: Initialization of all the collectors failed. Error in last collector was:java.io.IOException: Cannot find serializer for org.apache.hadoop.hive.ql.io.HiveKey at org.apache.hadoop.mapred.MapTask.createSortingCollector(MapTask.java:423) at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:454) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:349) at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:174) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:422) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1686) at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:168) Caused by: java.io.IOException: Cannot find serializer for org.apache.hadoop.hive.ql.io.HiveKey at org.apache.hadoop.mapred.nativetask.NativeMapOutputCollectorDelegator.init(NativeMapOutputCollectorDelegator.java:127) at org.apache.hadoop.mapred.MapTask.createSortingCollector(MapTask.java:408) ... 7 more 2018-07-27 10:08:25,391 ERROR operation.Operation (SQLOperation.java:run(209)) - Error running hive query: org.apache.hive.service.cli.HiveSQLException: Error while processing statement: FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.mr.MapRedTask at org.apache.hive.service.cli.operation.Operation.toSQLException(Operation.java:316) at org.apache.hive.service.cli.operation.SQLOperation.runQuery(SQLOperation.java:156) at org.apache.hive.service.cli.operation.SQLOperation.access$100(SQLOperation.java:71) at org.apache.hive.service.cli.operation.SQLOperation$1$1.run(SQLOperation.java:206) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:422) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1657) at org.apache.hive.service.cli.operation.SQLOperation$1.run(SQLOperation.java:218) at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) at java.util.concurrent.FutureTask.run(FutureTask.java:266) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) at java.lang.Thread.run(Thread.java:745) was: hadoop 3.0.3, Support nativetask. mapred-site.xml: <property> <name>mapreduce.job.map.output.collector.class</name> <value>org.apache.hadoop.mapred.nativetask.NativeMapOutputCollectorDelegator</value> </property> hive sql: set mapreduce.job.map.output.collector.class=org.apache.hadoop.mapred.nativetask.NativeMapOutputCollectorDelegator; select count(*) from test_cold; URL: http://0.0.0.0:8088/taskdetails.jsp?jobid=job_1532646043398_0019&tipid=task_1532646043398_0019_m_000000 ----- Diagnostic Messages for this Task: Error: java.io.IOException: Initialization of all the collectors failed. Error in last collector was:java.io.IOException: Cannot find serializer for org.apache.hadoop.hive.ql.io.HiveKey at org.apache.hadoop.mapred.MapTask.createSortingCollector(MapTask.java:423) at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:454) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:349) at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:174) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:422) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1686) at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:168) Caused by: java.io.IOException: Cannot find serializer for org.apache.hadoop.hive.ql.io.HiveKey at org.apache.hadoop.mapred.nativetask.NativeMapOutputCollectorDelegator.init(NativeMapOutputCollectorDelegator.java:127) at org.apache.hadoop.mapred.MapTask.createSortingCollector(MapTask.java:408) ... 7 more 2018-07-27 10:08:25,391 ERROR operation.Operation (SQLOperation.java:run(209)) - Error running hive query: org.apache.hive.service.cli.HiveSQLException: Error while processing statement: FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.mr.MapRedTask at org.apache.hive.service.cli.operation.Operation.toSQLException(Operation.java:316) at org.apache.hive.service.cli.operation.SQLOperation.runQuery(SQLOperation.java:156) at org.apache.hive.service.cli.operation.SQLOperation.access$100(SQLOperation.java:71) at org.apache.hive.service.cli.operation.SQLOperation$1$1.run(SQLOperation.java:206) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:422) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1657) at org.apache.hive.service.cli.operation.SQLOperation$1.run(SQLOperation.java:218) at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) at java.util.concurrent.FutureTask.run(FutureTask.java:266) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) at java.lang.Thread.run(Thread.java:745) > nativetask cann't working in hive > ---------------------------------- > > Key: HIVE-20253 > URL: https://issues.apache.org/jira/browse/HIVE-20253 > Project: Hive > Issue Type: Wish > Affects Versions: 1.2.1 > Environment: hadoop 3.0.3 > hive sql: > set > mapreduce.job.map.output.collector.class=org.apache.hadoop.mapred.nativetask.NativeMapOutputCollectorDelegator; > select count(*) from test_cold; > Reporter: gehaijiang > Priority: Major > > hadoop 3.0.3, Support nativetask. > mapred-site.xml: > <property> > <name>mapreduce.job.map.output.collector.class</name> > > <value>org.apache.hadoop.mapred.nativetask.NativeMapOutputCollectorDelegator</value> > </property> > > hive sql: > set > mapreduce.job.map.output.collector.class=org.apache.hadoop.mapred.nativetask.NativeMapOutputCollectorDelegator; > select count(*) from test_cold; --test_cold (orcfile table) > > URL: > > [http://0.0.0.0:8088/taskdetails.jsp?jobid=job_1532646043398_0019&tipid=task_1532646043398_0019_m_000000] > ---- > Diagnostic Messages for this Task: > Error: java.io.IOException: Initialization of all the collectors failed. > Error in last collector was:java.io.IOException: Cannot find serializer for > org.apache.hadoop.hive.ql.io.HiveKey > at org.apache.hadoop.mapred.MapTask.createSortingCollector(MapTask.java:423) > at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:454) > at org.apache.hadoop.mapred.MapTask.run(MapTask.java:349) > at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:174) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1686) > at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:168) > Caused by: java.io.IOException: Cannot find serializer for > org.apache.hadoop.hive.ql.io.HiveKey > at > org.apache.hadoop.mapred.nativetask.NativeMapOutputCollectorDelegator.init(NativeMapOutputCollectorDelegator.java:127) > at org.apache.hadoop.mapred.MapTask.createSortingCollector(MapTask.java:408) > ... 7 more > > 2018-07-27 10:08:25,391 ERROR operation.Operation > (SQLOperation.java:run(209)) - Error running hive query: > org.apache.hive.service.cli.HiveSQLException: Error while processing > statement: FAILED: Execution Error, return code 2 from > org.apache.hadoop.hive.ql.exec.mr.MapRedTask > at > org.apache.hive.service.cli.operation.Operation.toSQLException(Operation.java:316) > at > org.apache.hive.service.cli.operation.SQLOperation.runQuery(SQLOperation.java:156) > at > org.apache.hive.service.cli.operation.SQLOperation.access$100(SQLOperation.java:71) > at > org.apache.hive.service.cli.operation.SQLOperation$1$1.run(SQLOperation.java:206) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1657) > at > org.apache.hive.service.cli.operation.SQLOperation$1.run(SQLOperation.java:218) > at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) -- This message was sent by Atlassian JIRA (v7.6.3#76005)