[
https://issues.apache.org/jira/browse/HIVE-29419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18064488#comment-18064488
]
László Bodor edited comment on HIVE-29419 at 3/10/26 2:06 PM:
--------------------------------------------------------------
this is blocked on an 1.0.0 Tez release, as the currently available 0.10.5
doesn't contain Tez code that makes DAGAppMaster able to start in a standalone
mode, so in the meantime, let's try to make TEZ-4682 and use it in docker
compose files
with my current Dockerfile and build script I get:
{code}
2026-03-10 14:02:44,294 INFO app.DAGAppMaster: Creating DAGAppMaster for
applicationId=application_1700000000000_0001, attemptNum=1,
AMContainerId=container_1700000000000_0001_01_000001, jvmPid=null,
userFromEnv=tez, cliSessionOption=false, pwd=/opt/hadoop, localDirs=/tmp,
logDirs=/opt/tez/logs
2026-03-10 14:02:44,295 ERROR app.DAGAppMaster: Error starting DAGAppMaster
java.io.FileNotFoundException: /opt/hadoop/tez-conf.pb (No such file or
directory)
at java.base/java.io.FileInputStream.open0(Native Method)
at java.base/java.io.FileInputStream.open(Unknown Source)
at java.base/java.io.FileInputStream.<init>(Unknown Source)
at
org.apache.tez.common.TezUtilsInternal.readUserSpecifiedTezConfiguration(TezUtilsInternal.java:79)
at org.apache.tez.dag.app.DAGAppMaster.main(DAGAppMaster.java:2396)
{code}
code here: https://github.com/abstractdog/hive/commits/HIVE-29419/
was (Author: abstractdog):
this is blocked on an 1.0.0 Tez release, as the currently available 0.10.5
doesn't contain Tez code that makes DAGAppMaster able to start in a standalone
mode, so in the meantime, let's try to make TEZ-4682 and use it in docker
compose files
> Provide a Hive-specific docker image for Tez AM
> -----------------------------------------------
>
> Key: HIVE-29419
> URL: https://issues.apache.org/jira/browse/HIVE-29419
> Project: Hive
> Issue Type: Sub-task
> Reporter: László Bodor
> Assignee: László Bodor
> Priority: Major
> Attachments: Screenshot 2026-01-27 at 14.51.39.png
>
>
> This ticket is related to the Dockerized Hive and Tez initiative.
> While Hive Docker was implemented in HIVE-26400, and a Tez AM image is
> currently under development in TEZ-4682, there is an open question about how
> to seamlessly integrate Hive and Tez docker containers (build and runtime
> also)
> TEZ-4682 aims to build a generic Tez AM image, which is crucial for making
> Tez a modern execution engine, while Hive has a lot of dependencies on Tez.
> This makes the independent development of Hive (HiveServer2) and a Tez AM
> docker images quite hard.
> Consider the different classes used in TezAM:
> !Screenshot 2026-01-27 at 14.51.39.png|width=570,height=280!
> Every yellow class induces a separate question about "how Hive jars make
> their way to an independent Tez image", and here is how this Jira could be a
> game-changer. Consider *HiveSplitGenerator* (in hive-exec module) and
> *LlapTaskCommunicator* (in llap-tez module) classes. In the Yarn world, their
> localization was taken care of by Yarn, but from the point we deploy loosely
> coupled Docker containers, we cannot rely on such a mechanism anymore.
> Hence, the *proposal* is to include Tez jars into the Hive image (if they are
> not yet included since HIVE-26400), and make a Tez AM specific entrypoint
> (and separate Dockerfile if needed), that starts {*}DAGAppMaster{*}.
> *Motivation:* a dockerized, real distributed testing of Apache Hive upstream
> as described in
> [https://docs.google.com/document/d/1k92cIzRUXy9IIdivklXR877l8C-0wnmZUrszP_KJgs4]
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)