Jing-Long Wu created HIVE-27318:
-----------------------------------

             Summary: Hive CLI creates 2 Tez sessions
                 Key: HIVE-27318
                 URL: https://issues.apache.org/jira/browse/HIVE-27318
             Project: Hive
          Issue Type: Bug
          Components: Hive, Tez
    Affects Versions: 3.2.0
         Environment: Bigtop 3.2.0 on Debian 10
h4.
            Reporter: Jing-Long Wu


Running Hive CLI I noticed in the new version of Hive 3.1.3 and Tez 0.10.1 in 
Bigtop 3.2.0 (as opposed to Bigtop 1.5.0), there are now 2 Tez sessions created 
with "hive.execution.engine=tez" enabled as default:
{code:java}
2023-05-04T13:46:19,189  INFO [main] SessionState: Hive Session ID = 
dbd5b26f-9904-44a1-b70b-9477287e0c03

2023-05-04T13:46:22,275  INFO [pool-6-thread-1] SessionState: Hive Session ID = 
58d15cbd-f815-4c55-b323-02559dcacd61{code}
When compared to previous behavior, there was only 1 session created with the 
main thread. Worst thing is that the session created by the pool thread is not 
even utilized at all, and will just linger and take up resources for the 
ApplicationMaster container, until it dies from timeout defined in 
"tez.session.am.dag.submit.timeout.secs".

Running queries also only utilizes the session created by the main thread ie.

 
{code:java}
Status: Running (Executing on YARN cluster with App id 
application_1682588680048_0135)
2023-05-04T13:50:18,804  INFO [dbd5b26f-9904-44a1-b70b-9477287e0c03 main] 
SessionState: Status: Running (Executing on YARN cluster with App id 
application_1682588680048_0135)
{code}
This is using identical configs in Hive for both Hive 3.1.3 and Hive 2.3.6.

This behavior is not present in Hiveserver2 though, and could only be observed 
when using the Hive CLI.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to