[ 
https://issues.apache.org/jira/browse/TEZ-4104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16989572#comment-16989572
 ] 

Nikita Jaisawal commented on TEZ-4104:
--------------------------------------

To rectify this issue I tried refering the tar file directly from S3 without 
changing fs.defaultFS to s3://bucketName.

I did the following changes in tez-site.xml:
 * set tez.lib.uris property to s3://bucketName

In hive-site.xml:
 *  set hive.execution.engine=tez
 * fs.defaultFS=hdfs:// 

But  still it is searching for tar file in hdfs.

S3 location where tez.tar.gz has been stored: 
s3://srev-emr-dev/F129/tez/tez.tar.gz where s3 bucket name is 'srev-emr-dev'

 

Facing below error:

java.io.FileNotFoundException: File does not exist: /F129/tez/tez.tar.gz

java.io.FileNotFoundException: File does not exist: /F129/tez/tez.tar.gz at 
org.apache.hadoop.hdfs.DistributedFileSystem$27.doCall(DistributedFileSystem.java:1441)
 ~[hadoop-hdfs-client-2.8.3-amzn-1.jar:?] at 
org.apache.hadoop.hdfs.DistributedFileSystem$27.doCall(DistributedFileSystem.java:1434)
 ~[hadoop-hdfs-client-2.8.3-amzn-1.jar:?] at 
org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81)
 ~[hadoop-common-2.8.3-amzn-1.jar:?] at 
org.apache.hadoop.hdfs.DistributedFileSystem.getFileStatus(DistributedFileSystem.java:1449)
 ~[hadoop-hdfs-client-2.8.3-amzn-1.jar:?] at 
org.apache.tez.client.TezClientUtils.checkAncestorPermissionsForAllUsers(TezClientUtils.java:1031)
 ~[tez-api-0.8.4.jar:0.8.4] at 
org.apache.tez.client.TezClientUtils.addLocalResources(TezClientUtils.java:275) 
~[tez-api-0.8.4.jar:0.8.4] at 
org.apache.tez.client.TezClientUtils.setupTezJarsLocalResources(TezClientUtils.java:183)
 ~[tez-api-0.8.4.jar:0.8.4] at 
org.apache.tez.client.TezClient.getTezJarResources(TezClient.java:1057) 
~[tez-api-0.8.4.jar:0.8.4] at 
org.apache.tez.client.TezClient.start(TezClient.java:447) 
~[tez-api-0.8.4.jar:0.8.4] at 
org.apache.hadoop.hive.ql.exec.tez.TezSessionState.startSessionAndContainers(TezSessionState.java:376)
 ~[hive-exec-2.3.2-amzn-0.jar:2.3.2-amzn-0] at 
org.apache.hadoop.hive.ql.exec.tez.TezSessionState.access$000(TezSessionState.java:87)
 ~[hive-exec-2.3.2-amzn-0.jar:2.3.2-amzn-0] at 
org.apache.hadoop.hive.ql.exec.tez.TezSessionState$1.call(TezSessionState.java:330)
 [hive-exec-2.3.2-amzn-0.jar:2.3.2-amzn-0] at 
org.apache.hadoop.hive.ql.exec.tez.TezSessionState$1.call(TezSessionState.java:326)
 [hive-exec-2.3.2-amzn-0.jar:2.3.2-amzn-0] at 
java.util.concurrent.FutureTask.run(FutureTask.java:266) [?:1.8.0_222] at 
java.lang.Thread.run(Thread.java:748) [?:1.8.0_222] 

> Upscale/Downscale of the emr cluster corrupts hdfs jars /apps/tez/tez.tar.gz
> ----------------------------------------------------------------------------
>
>                 Key: TEZ-4104
>                 URL: https://issues.apache.org/jira/browse/TEZ-4104
>             Project: Apache Tez
>          Issue Type: Bug
>    Affects Versions: 0.8.4
>            Reporter: Nikita Jaisawal
>            Priority: Major
>
> I am facing an issue in which the JARs kept in *hdfs:///apps/tez/tez/tar.gz* 
> gets corrupted whenever I do emr cluster resizing after setting the property 
> in hive-site.xml:
> *set hive.execution.engine=tez* 
> (with default file system as *hdfs://)*
> How do I resolve this issue? Can anyone suggest?
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to