[ 
https://issues.apache.org/jira/browse/HIVE-27023?focusedWorklogId=843592&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-843592
 ]

ASF GitHub Bot logged work on HIVE-27023:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 03/Feb/23 22:15
            Start Date: 03/Feb/23 22:15
    Worklog Time Spent: 10m 
      Work Description: maswin opened a new pull request, #4015:
URL: https://github.com/apache/hive/pull/4015

   ### What changes were proposed in this pull request?
   A setting to disable tez session being preemptively opened during startup of 
hive cli.
   
   
   ### Why are the changes needed?
   DML only operations won't need a tez session to be opened. Oozie action that 
launches hive cli for DML only operations  need this setting. Sometimes before 
the tez session thread could completed, the oozie hive action completes its DML 
operations and clears the scratch folders causing the tez session to fail with 
following exception:
   
   Added a setting disable Tez session being opened during the startup of Hive 
cli. This is particularly needed when Hive action is being launched by Oozie to 
perform DML only operations. In that cases before the tez session thread could 
complete the launcher gets completed and clears all the scratch directories. 
This leads to a failure with following stack trace:
   
   ```
   Added a setting disable Tez session being opened during the startup of Hive 
cli. This is particularly needed when Hive action is being launched by Oozie to 
perform DML only operations. In that cases before the tez session thread could 
complete the launcher gets completed and clears all the scratch directories. 
This leads to a failure with following stack trace:
   
   Application application_1667416881396_24229473 failed 3 times due to AM 
Container for appattempt_1667416881396_24229473_000003 exited with exitCode: 
-1000
   Failing this attempt.Diagnostics: [2023-02-02 19:02:12.139]File does not 
exist: 
hdfs://<name_node>/tmp/<db>/_tez_session_dir/4050c4b0-b7af-4eda-832b-399c954eb576/.tez/application_1667416881396_24229473/tez.session.local-resources.pbjava.io.FileNotFoundException:
 File does not exist: 
hdfs://<name_node>/tmp/<db>/_tez_session_dir/4050c4b0-b7af-4eda-832b-399c954eb576/.tez/application_1667416881396_24229473/tez.session.local-resources.pbat
 
org.apache.hadoop.hdfs.DistributedFileSystem$29.doCall(DistributedFileSystem.java:1529)at
 
org.apache.hadoop.hdfs.DistributedFileSystem$29.doCall(DistributedFileSystem.java:1522)at
 
org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81)at
 
org.apache.hadoop.hdfs.DistributedFileSystem.getFileStatus(DistributedFileSystem.java:1537)at
   ```
   
   
   ### Does this PR introduce _any_ user-facing change?
   New config value, otherwise no.
   
   
   ### How was this patch tested?
   Manually tested
   




Issue Time Tracking
-------------------

            Worklog Id:     (was: 843592)
    Remaining Estimate: 0h
            Time Spent: 10m

> Add setting to disable hive session being started during Hive Cli startup
> -------------------------------------------------------------------------
>
>                 Key: HIVE-27023
>                 URL: https://issues.apache.org/jira/browse/HIVE-27023
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Alagappan Maruthappan
>            Assignee: Alagappan Maruthappan
>            Priority: Minor
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> Added a setting disable Tez session being opened during the startup of Hive 
> cli. This is particularly needed when Hive action is being launched by Oozie 
> to perform DML only operations. In that cases before the tez session thread 
> could complete the launcher gets completed and clears all the scratch 
> directories. This leads to a failure with following stack trace:
> {code:java}
> Application application_1667416881396_24229473 failed 3 times due to AM 
> Container for appattempt_1667416881396_24229473_000003 exited with exitCode: 
> -1000
> Failing this attempt.Diagnostics: [2023-02-02 19:02:12.139]File does not 
> exist: 
> hdfs://<name_node>/tmp/<db>/_tez_session_dir/4050c4b0-b7af-4eda-832b-399c954eb576/.tez/application_1667416881396_24229473/tez.session.local-resources.pbjava.io.FileNotFoundException:
>  File does not exist: 
> hdfs://<name_node>/tmp/<db>/_tez_session_dir/4050c4b0-b7af-4eda-832b-399c954eb576/.tez/application_1667416881396_24229473/tez.session.local-resources.pbat
>  
> org.apache.hadoop.hdfs.DistributedFileSystem$29.doCall(DistributedFileSystem.java:1529)at
>  
> org.apache.hadoop.hdfs.DistributedFileSystem$29.doCall(DistributedFileSystem.java:1522)at
>  
> org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81)at
>  
> org.apache.hadoop.hdfs.DistributedFileSystem.getFileStatus(DistributedFileSystem.java:1537)at
>  org.apache.hadoop.yarn.util.FSDownload.copy(FSDownload.java:253)at 
> org.apache.hadoop.yarn.util.FSDownload.access$000(FSDownload.java:63)at 
> org.apache.hadoop.yarn.util.FSDownload$2.run(FSDownload.java:366)at 
> org.apache.hadoop.yarn.util.FSDownload$2.run(FSDownload.java:364)at 
> java.security.AccessController.doPrivileged(Native Method)at 
> javax.security.auth.Subject.doAs(Subject.java:422)at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:2136)at
>  
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:2116)at
>  org.apache.hadoop.yarn.util.FSDownload.call(FSDownload.java:364)at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ContainerLocalizer$FSDownloadWrapper.doDownloadCall(ContainerLocalizer.java:249)at
>  
> org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ContainerLocalizer$FSDownloadWrapper.call(ContainerLocalizer.java:242)at
>  
> org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ContainerLocalizer$FSDownloadWrapper.call(ContainerLocalizer.java:230)at
>  java.util.concurrent.FutureTask.run(FutureTask.java:266)at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)at 
> java.util.concurrent.FutureTask.run(FutureTask.java:266)at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)at
>  
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)at
>  java.lang.Thread.run(Thread.java:748) {code}
> Also opening up of tez session in such cases is not needed.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to