[jira] [Created] (HIVE-20587) Reduce phase could not be completed on MapReduce engine

Mykhailo Kysliuk (JIRA) Tue, 18 Sep 2018 08:08:11 -0700

Mykhailo Kysliuk created HIVE-20587:
---------------------------------------


             Summary: Reduce phase could not be completed on MapReduce engine
                 Key: HIVE-20587
                 URL: https://issues.apache.org/jira/browse/HIVE-20587
             Project: Hive
          Issue Type: Bug
            Reporter: Mykhailo Kysliuk


Steps to reproduce:
{code}
CREATE TABLE IF NOT EXISTS `test`(
  `account_id` bigint, 
  `subscription_id` bigint, 
  `previous_subscription_id` bigint, 
  `subscription_effective_from_dt` date, 
  `subscription_effective_to_dt` date, 
  `order_completed_dtm` timestamp, 
  `subscription_status` string) STORED AS ORC;

insert into test values (1, 1, 1, NULL, NULL, NULL, 'ss');

SELECT
  account_id,
  subscription_id,
  previous_subscription_id,
  subscription_status,
  subscription_effective_from_dt,
  subscription_effective_to_dt 
FROM 
( 
  SELECT 
    DISTINCT account_id, 
    subscription_id, 
    previous_subscription_id, 
    subscription_status, 
    subscription_effective_from_dt, 
    subscription_effective_to_dt, 
    order_completed_dtm, 
    ROW_NUMBER()OVER(PARTITION BY ACCOUNT_ID) as r1
  FROM test
  WHERE account_id=1 
) AS RTE where r1=1;
{code}

Expected result:
{code}
OK
1       1       1       ss      NULL    NULL
{code}

Actual result:
{code}
Examining task ID: task_1537276416358_0006_m_000000 (and more) from job 
job_1537276416358_0006

Task with the most failures(4): 
-----
Task ID:
  task_1537276416358_0006_r_000000

-----
Diagnostic Messages for this Task:
AttemptID:attempt_1537276416358_0006_r_000000_3 Timed out after 600 secs

FAILED: Execution Error, return code 2 from 
org.apache.hadoop.hive.ql.exec.mr.MapRedTask
MapReduce Jobs Launched: 
Stage-Stage-1: Map: 1  Reduce: 1   Cumulative CPU: 1.3 sec   HDFS Read: 0 HDFS 
Write: 0 FAIL
Total MapReduce CPU Time Spent: 1 seconds 300 msec
{code}

On tez engine this query runs fine.

At yarn logs:
{code}
2018-09-18 14:53:18,012 INFO [RMCommunicator Allocator] 
org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Got allocated 
containers 1
2018-09-18 14:53:18,012 INFO [RMCommunicator Allocator] 
org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Assigned to reduce
2018-09-18 14:53:18,013 INFO [RMCommunicator Allocator] 
org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Assigned container 
container_e1322_1537276416358_0014_01_000003 to 
attempt_1537276416358_0014_r_000000_0
2018-09-18 14:53:18,013 INFO [RMCommunicator Allocator] 
org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: After Scheduling: 
PendingReds:0 ScheduledMaps:0 ScheduledReds:0 AssignedMaps:0 AssignedReds:1 
CompletedMaps:1 CompletedReds:0 ContAlloc:2 ContRel:0 HostLocal:1 RackLocal:0
2018-09-18 14:53:18,020 INFO [AsyncDispatcher event handler] 
org.apache.hadoop.mapred.Task: mapOutputFile class: 
org.apache.hadoop.mapred.MapRFsOutputFile
2018-09-18 14:53:18,021 INFO [AsyncDispatcher event handler] 
org.apache.hadoop.yarn.util.RackResolver: Resolved node10.cluster.com to 
/default-rack
2018-09-18 14:53:18,021 INFO [AsyncDispatcher event handler] 
org.apache.hadoop.mapreduce.v2.app.job.impl.TaskAttemptImpl: 
attempt_1537276416358_0014_r_000000_0 TaskAttempt Transitioned from UNASSIGNED 
to ASSIGNED
2018-09-18 14:53:18,021 INFO [ContainerLauncher #2] 
org.apache.hadoop.mapreduce.v2.app.launcher.ContainerLauncherImpl: Processing 
the event EventType: CONTAINER_REMOTE_LAUNCH for container 
container_e1322_1537276416358_0014_01_000003 taskAttempt 
attempt_1537276416358_0014_r_000000_0
2018-09-18 14:53:18,021 INFO [ContainerLauncher #2] 
org.apache.hadoop.mapreduce.v2.app.launcher.ContainerLauncherImpl: Launching 
attempt_1537276416358_0014_r_000000_0
2018-09-18 14:53:18,021 INFO [ContainerLauncher #2] 
org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy: 
Opening proxy : node10.cluster.com:8099
2018-09-18 14:53:18,030 INFO [ContainerLauncher #2] 
org.apache.hadoop.mapreduce.v2.app.job.event.TaskAttemptContainerLaunchedEvent: 
Shuffle port returned by ContainerManager for 
attempt_1537276416358_0014_r_000000_0 : 13562
2018-09-18 14:53:18,030 INFO [AsyncDispatcher event handler] 
org.apache.hadoop.mapreduce.v2.app.job.impl.TaskAttemptImpl: TaskAttempt: 
[attempt_1537276416358_0014_r_000000_0] using containerId: 
[container_e1322_1537276416358_0014_01_000003 on NM: [node10.cluster.com:8099]
2018-09-18 14:53:18,030 INFO [AsyncDispatcher event handler] 
org.apache.hadoop.mapreduce.v2.app.job.impl.TaskAttemptImpl: 
attempt_1537276416358_0014_r_000000_0 TaskAttempt Transitioned from ASSIGNED to 
RUNNING
2018-09-18 14:53:18,030 INFO [AsyncDispatcher event handler] 
org.apache.hadoop.mapreduce.v2.app.speculate.DefaultSpeculator: ATTEMPT_START 
task_1537276416358_0014_r_000000
2018-09-18 14:53:18,031 INFO [AsyncDispatcher event handler] 
org.apache.hadoop.mapreduce.v2.app.job.impl.TaskImpl: 
task_1537276416358_0014_r_000000 Task Transitioned from SCHEDULED to RUNNING
2018-09-18 14:53:19,018 INFO [RMCommunicator Allocator] 
org.apache.hadoop.mapreduce.v2.app.rm.RMContainerRequestor: getResources() for 
application_1537276416358_0014: ask=1 release= 0 newContainers=0 
finishedContainers=0 resourcelimit=<memory:8784, vCores:1, disks:0.0> knownNMs=1
2018-09-18 15:03:41,844 INFO [AsyncDispatcher event handler] 
org.apache.hadoop.mapreduce.v2.app.job.impl.TaskAttemptImpl: Diagnostics report 
from attempt_1537276416358_0014_r_000000_0: 
AttemptID:attempt_1537276416358_0014_r_000000_0 Timed out after 600 secs
2018-09-18 15:03:41,845 INFO [AsyncDispatcher event handler] 
org.apache.hadoop.mapreduce.v2.app.job.impl.TaskAttemptImpl: 
attempt_1537276416358_0014_r_000000_0 TaskAttempt Transitioned from RUNNING to 
FAIL_CONTAINER_CLEANUP
2018-09-18 15:03:41,846 INFO [ContainerLauncher #3] 
org.apache.hadoop.mapreduce.v2.app.launcher.ContainerLauncherImpl: Processing 
the event EventType: CONTAINER_REMOTE_CLEANUP for container 
container_e1322_1537276416358_0014_01_000003 taskAttempt 
attempt_1537276416358_0014_r_000000_0
2018-09-18 15:03:41,847 INFO [ContainerLauncher #3] 
org.apache.hadoop.mapreduce.v2.app.launcher.ContainerLauncherImpl: KILLING 
attempt_1537276416358_0014_r_000000_0
2018-09-18 15:03:41,848 INFO [ContainerLauncher #3] 
org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy: 
Opening proxy : node10.cluster.com:8099
2018-09-18 15:03:41,881 INFO [AsyncDispatcher event handler] 
org.apache.hadoop.mapreduce.v2.app.job.impl.TaskAttemptImpl: 
attempt_1537276416358_0014_r_000000_0 TaskAttempt Transitioned from 
FAIL_CONTAINER_CLEANUP to FAIL_TASK_CLEANUP
2018-09-18 15:03:41,882 INFO [CommitterEvent Processor #1] 
org.apache.hadoop.mapreduce.v2.app.commit.CommitterEventHandler: Processing the 
event EventType: TASK_ABORT
2018-09-18 15:03:41,885 INFO [AsyncDispatcher event handler] 
org.apache.hadoop.mapreduce.v2.app.job.impl.TaskAttemptImpl: 
attempt_1537276416358_0014_r_000000_0 TaskAttempt Transitioned from 
FAIL_TASK_CLEANUP to FAILED
2018-09-18 15:03:41,888 INFO [AsyncDispatcher event handler] 
org.apache.hadoop.mapreduce.v2.app.job.impl.TaskAttemptImpl: 
attempt_1537276416358_0014_r_000000_1 TaskAttempt Transitioned from NEW to 
UNASSIGNED
2018-09-18 15:03:41,888 INFO [Thread-49] 
org.apache.hadoop.mapreduce.v2.app.rm.RMContainerRequestor: 1 failures on node 
node10.cluster.com
2018-09-18 15:03:42,510 INFO [RMCommunicator Allocator] 
org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Before Scheduling: 
PendingReds:1 ScheduledMaps:0 ScheduledReds:0 AssignedMaps:0 AssignedReds:1 
CompletedMaps:1 CompletedReds:0 ContAlloc:2 ContRel:0 HostLocal:1 RackLocal:0
2018-09-18 15:03:42,513 INFO [RMCommunicator Allocator] 
org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Recalculating 
schedule, headroom=<memory:8784, vCores:1, disks:0.0>
2018-09-18 15:03:42,513 INFO [RMCommunicator Allocator] 
org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: All maps assigned. 
Ramping up all remaining reduces:1
2018-09-18 15:03:42,513 INFO [RMCommunicator Allocator] 
org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: After Scheduling: 
PendingReds:0 ScheduledMaps:0 ScheduledReds:1 AssignedMaps:0 AssignedReds:1 
CompletedMaps:1 CompletedReds:0 ContAlloc:2 ContRel:0 HostLocal:1 RackLocal:0
2018-09-18 15:03:43,515 INFO [RMCommunicator Allocator] 
org.apache.hadoop.mapreduce.v2.app.rm.RMContainerRequestor: getResources() for 
application_1537276416358_0014: ask=1 release= 0 newContainers=0 
finishedContainers=1 resourcelimit=<memory:12880, vCores:2, disks:1.33> 
knownNMs=1
2018-09-18 15:03:43,515 INFO [RMCommunicator Allocator] 
org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Received completed 
container container_e1322_1537276416358_0014_01_000003
2018-09-18 15:03:43,515 INFO [RMCommunicator Allocator] 
org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: After Scheduling: 
PendingReds:0 ScheduledMaps:0 ScheduledReds:1 AssignedMaps:0 AssignedReds:0 
CompletedMaps:1 CompletedReds:0 ContAlloc:2 ContRel:0 HostLocal:1 RackLocal:0
2018-09-18 15:03:43,515 INFO [AsyncDispatcher event handler] 
org.apache.hadoop.mapreduce.v2.app.job.impl.TaskAttemptImpl: Diagnostics report 
from attempt_1537276416358_0014_r_000000_0: Container killed by the 
ApplicationMaster.
Container killed on request. Exit code is 143
Container exited with a non-zero exit code 143. 
{code}

Maybe this is not a bug, but MR has not enough resources to run this query.




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

[jira] [Created] (HIVE-20587) Reduce phase could not be completed on MapReduce engine

Reply via email to