[
https://issues.apache.org/jira/browse/TEZ-4566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ayush Saxena resolved TEZ-4566.
-------------------------------
Fix Version/s: 0.10.4
Resolution: Fixed
> NPE in TezChild while fetching attemptId when container is asked to shut down
> -----------------------------------------------------------------------------
>
> Key: TEZ-4566
> URL: https://issues.apache.org/jira/browse/TEZ-4566
> Project: Apache Tez
> Issue Type: Bug
> Reporter: Ayush Saxena
> Assignee: Ayush Saxena
> Priority: Major
> Fix For: 0.10.4
>
> Time Spent: 1h
> Remaining Estimate: 0h
>
> {noformat}
> 2024-05-21T08:50:28,800 WARN [LocalTaskExecutionThread #0]
> common.TezUtilsInternal: Not configured with appender named: CLA. Cannot
> reconfigure logger output
> 2024-05-21T08:50:28,800 INFO [Dispatcher thread {Central}] impl.VertexImpl:
> Task Completion: vertex_1716306608007_0001_13_00 [Map 1], tasks=4, failed=0,
> killed=0, success=2, completed=2, commits=0, err=null
> 2024-05-21T08:50:28,800 INFO [TezChild] task.ContainerReporter: Attempting
> to fetch new task for container container_1716306608007_0001_00_000024
> 2024-05-21T08:50:28,800 INFO [Dispatcher thread {Central}]
> HistoryEventHandler.criticalEvents:
> [HISTORY][DAG:dag_1716306608007_0001_13][Event:CONTAINER_STOPPED]:
> containerId=container_1716306608007_0001_00_000024,
> stoppedTime=1716306628800, exitStatus=0
> 2024-05-21T08:50:28,800 INFO [TezChild] app.TezTaskCommunicatorImpl:
> Container with id: container_1716306608007_0001_00_000024 is valid, but no
> longer registered, and will be killed
> 2024-05-21T08:50:28,800 INFO [TezChild] task.ContainerReporter: Got
> TaskUpdate for containerId= container_1716306608007_0001_00_000024: 0 ms
> after starting to poll. TaskInfo: shouldDie: true
> 2024-05-21T08:50:28,800 INFO [Dispatcher thread {Central}] impl.VertexImpl:
> Source task attempt completed for vertex: vertex_1716306608007_0001_13_01
> [Reducer 2] attempt: attempt_1716306608007_0001_13_00_000001_0 with state:
> SUCCEEDED vertexState: RUNNING
> 2024-05-21T08:50:28,801 INFO [LocalContainerLauncher-SubTaskRunner]
> launcher.LocalContainerLauncher: Ignoring stop request for containerId:
> container_1716306608007_0001_00_000024
> 2024-05-21T08:50:28,800 INFO [CallbackExecutor]
> launcher.LocalContainerLauncher: Container:
> container_1716306608007_0001_00_000024: Execution Failed:
> java.lang.NullPointerException: null
> at org.apache.tez.runtime.task.TezChild.run(TezChild.java:252)
> ~[tez-runtime-internals-0.10.3.jar:0.10.3]
> at
> org.apache.tez.dag.app.launcher.LocalContainerLauncher$1.call(LocalContainerLauncher.java:409)
> ~[tez-dag-0.10.3.jar:0.10.3]
> at
> org.apache.tez.dag.app.launcher.LocalContainerLauncher$1.call(LocalContainerLauncher.java:400)
> ~[tez-dag-0.10.3.jar:0.10.3]
> at
> com.google.common.util.concurrent.TrustedListenableFutureTask$TrustedFutureInterruptibleTask.runInterruptibly(TrustedListenableFutureTask.java:111)
> ~[guava-22.0.jar:?]
> at
> com.google.common.util.concurrent.InterruptibleTask.run(InterruptibleTask.java:58)
> ~[guava-22.0.jar:?]
> at
> com.google.common.util.concurrent.TrustedListenableFutureTask.run(TrustedListenableFutureTask.java:75)
> ~[guava-22.0.jar:?]
> at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
> ~[?:1.8.0_342]
> at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
> ~[?:1.8.0_342]
> at java.lang.Thread.run(Thread.java:750) ~[?:1.8.0_342]
> {noformat}
> Can be reproduced by Running {{TestCrudCompactorOnTez}} in Hive code
--
This message was sent by Atlassian Jira
(v8.20.10#820010)