Mykhailo Kysliuk created HIVE-20587: ---------------------------------------
Summary: Reduce phase could not be completed on MapReduce engine Key: HIVE-20587 URL: https://issues.apache.org/jira/browse/HIVE-20587 Project: Hive Issue Type: Bug Reporter: Mykhailo Kysliuk Steps to reproduce: {code} CREATE TABLE IF NOT EXISTS `test`( `account_id` bigint, `subscription_id` bigint, `previous_subscription_id` bigint, `subscription_effective_from_dt` date, `subscription_effective_to_dt` date, `order_completed_dtm` timestamp, `subscription_status` string) STORED AS ORC; insert into test values (1, 1, 1, NULL, NULL, NULL, 'ss'); SELECT account_id, subscription_id, previous_subscription_id, subscription_status, subscription_effective_from_dt, subscription_effective_to_dt FROM ( SELECT DISTINCT account_id, subscription_id, previous_subscription_id, subscription_status, subscription_effective_from_dt, subscription_effective_to_dt, order_completed_dtm, ROW_NUMBER()OVER(PARTITION BY ACCOUNT_ID) as r1 FROM test WHERE account_id=1 ) AS RTE where r1=1; {code} Expected result: {code} OK 1 1 1 ss NULL NULL {code} Actual result: {code} Examining task ID: task_1537276416358_0006_m_000000 (and more) from job job_1537276416358_0006 Task with the most failures(4): ----- Task ID: task_1537276416358_0006_r_000000 ----- Diagnostic Messages for this Task: AttemptID:attempt_1537276416358_0006_r_000000_3 Timed out after 600 secs FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.mr.MapRedTask MapReduce Jobs Launched: Stage-Stage-1: Map: 1 Reduce: 1 Cumulative CPU: 1.3 sec HDFS Read: 0 HDFS Write: 0 FAIL Total MapReduce CPU Time Spent: 1 seconds 300 msec {code} On tez engine this query runs fine. At yarn logs: {code} 2018-09-18 14:53:18,012 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Got allocated containers 1 2018-09-18 14:53:18,012 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Assigned to reduce 2018-09-18 14:53:18,013 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Assigned container container_e1322_1537276416358_0014_01_000003 to attempt_1537276416358_0014_r_000000_0 2018-09-18 14:53:18,013 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: After Scheduling: PendingReds:0 ScheduledMaps:0 ScheduledReds:0 AssignedMaps:0 AssignedReds:1 CompletedMaps:1 CompletedReds:0 ContAlloc:2 ContRel:0 HostLocal:1 RackLocal:0 2018-09-18 14:53:18,020 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapred.Task: mapOutputFile class: org.apache.hadoop.mapred.MapRFsOutputFile 2018-09-18 14:53:18,021 INFO [AsyncDispatcher event handler] org.apache.hadoop.yarn.util.RackResolver: Resolved node10.cluster.com to /default-rack 2018-09-18 14:53:18,021 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.TaskAttemptImpl: attempt_1537276416358_0014_r_000000_0 TaskAttempt Transitioned from UNASSIGNED to ASSIGNED 2018-09-18 14:53:18,021 INFO [ContainerLauncher #2] org.apache.hadoop.mapreduce.v2.app.launcher.ContainerLauncherImpl: Processing the event EventType: CONTAINER_REMOTE_LAUNCH for container container_e1322_1537276416358_0014_01_000003 taskAttempt attempt_1537276416358_0014_r_000000_0 2018-09-18 14:53:18,021 INFO [ContainerLauncher #2] org.apache.hadoop.mapreduce.v2.app.launcher.ContainerLauncherImpl: Launching attempt_1537276416358_0014_r_000000_0 2018-09-18 14:53:18,021 INFO [ContainerLauncher #2] org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy: Opening proxy : node10.cluster.com:8099 2018-09-18 14:53:18,030 INFO [ContainerLauncher #2] org.apache.hadoop.mapreduce.v2.app.job.event.TaskAttemptContainerLaunchedEvent: Shuffle port returned by ContainerManager for attempt_1537276416358_0014_r_000000_0 : 13562 2018-09-18 14:53:18,030 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.TaskAttemptImpl: TaskAttempt: [attempt_1537276416358_0014_r_000000_0] using containerId: [container_e1322_1537276416358_0014_01_000003 on NM: [node10.cluster.com:8099] 2018-09-18 14:53:18,030 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.TaskAttemptImpl: attempt_1537276416358_0014_r_000000_0 TaskAttempt Transitioned from ASSIGNED to RUNNING 2018-09-18 14:53:18,030 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.speculate.DefaultSpeculator: ATTEMPT_START task_1537276416358_0014_r_000000 2018-09-18 14:53:18,031 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.TaskImpl: task_1537276416358_0014_r_000000 Task Transitioned from SCHEDULED to RUNNING 2018-09-18 14:53:19,018 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerRequestor: getResources() for application_1537276416358_0014: ask=1 release= 0 newContainers=0 finishedContainers=0 resourcelimit=<memory:8784, vCores:1, disks:0.0> knownNMs=1 2018-09-18 15:03:41,844 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.TaskAttemptImpl: Diagnostics report from attempt_1537276416358_0014_r_000000_0: AttemptID:attempt_1537276416358_0014_r_000000_0 Timed out after 600 secs 2018-09-18 15:03:41,845 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.TaskAttemptImpl: attempt_1537276416358_0014_r_000000_0 TaskAttempt Transitioned from RUNNING to FAIL_CONTAINER_CLEANUP 2018-09-18 15:03:41,846 INFO [ContainerLauncher #3] org.apache.hadoop.mapreduce.v2.app.launcher.ContainerLauncherImpl: Processing the event EventType: CONTAINER_REMOTE_CLEANUP for container container_e1322_1537276416358_0014_01_000003 taskAttempt attempt_1537276416358_0014_r_000000_0 2018-09-18 15:03:41,847 INFO [ContainerLauncher #3] org.apache.hadoop.mapreduce.v2.app.launcher.ContainerLauncherImpl: KILLING attempt_1537276416358_0014_r_000000_0 2018-09-18 15:03:41,848 INFO [ContainerLauncher #3] org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy: Opening proxy : node10.cluster.com:8099 2018-09-18 15:03:41,881 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.TaskAttemptImpl: attempt_1537276416358_0014_r_000000_0 TaskAttempt Transitioned from FAIL_CONTAINER_CLEANUP to FAIL_TASK_CLEANUP 2018-09-18 15:03:41,882 INFO [CommitterEvent Processor #1] org.apache.hadoop.mapreduce.v2.app.commit.CommitterEventHandler: Processing the event EventType: TASK_ABORT 2018-09-18 15:03:41,885 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.TaskAttemptImpl: attempt_1537276416358_0014_r_000000_0 TaskAttempt Transitioned from FAIL_TASK_CLEANUP to FAILED 2018-09-18 15:03:41,888 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.TaskAttemptImpl: attempt_1537276416358_0014_r_000000_1 TaskAttempt Transitioned from NEW to UNASSIGNED 2018-09-18 15:03:41,888 INFO [Thread-49] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerRequestor: 1 failures on node node10.cluster.com 2018-09-18 15:03:42,510 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Before Scheduling: PendingReds:1 ScheduledMaps:0 ScheduledReds:0 AssignedMaps:0 AssignedReds:1 CompletedMaps:1 CompletedReds:0 ContAlloc:2 ContRel:0 HostLocal:1 RackLocal:0 2018-09-18 15:03:42,513 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Recalculating schedule, headroom=<memory:8784, vCores:1, disks:0.0> 2018-09-18 15:03:42,513 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: All maps assigned. Ramping up all remaining reduces:1 2018-09-18 15:03:42,513 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: After Scheduling: PendingReds:0 ScheduledMaps:0 ScheduledReds:1 AssignedMaps:0 AssignedReds:1 CompletedMaps:1 CompletedReds:0 ContAlloc:2 ContRel:0 HostLocal:1 RackLocal:0 2018-09-18 15:03:43,515 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerRequestor: getResources() for application_1537276416358_0014: ask=1 release= 0 newContainers=0 finishedContainers=1 resourcelimit=<memory:12880, vCores:2, disks:1.33> knownNMs=1 2018-09-18 15:03:43,515 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Received completed container container_e1322_1537276416358_0014_01_000003 2018-09-18 15:03:43,515 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: After Scheduling: PendingReds:0 ScheduledMaps:0 ScheduledReds:1 AssignedMaps:0 AssignedReds:0 CompletedMaps:1 CompletedReds:0 ContAlloc:2 ContRel:0 HostLocal:1 RackLocal:0 2018-09-18 15:03:43,515 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.TaskAttemptImpl: Diagnostics report from attempt_1537276416358_0014_r_000000_0: Container killed by the ApplicationMaster. Container killed on request. Exit code is 143 Container exited with a non-zero exit code 143. {code} Maybe this is not a bug, but MR has not enough resources to run this query. -- This message was sent by Atlassian JIRA (v7.6.3#76005)