[ 
https://issues.apache.org/jira/browse/YARN-8079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16417816#comment-16417816
 ] 

Eric Yang edited comment on YARN-8079 at 3/29/18 7:57 PM:
----------------------------------------------------------

[~leftnoteasy] For accessing remote HDFS, it requires username + password of 
the remote cluster, and the cluster has a way to contact to remote cluster KDC 
server to verify the user.  I don't think Hadoop supports 
hdfs://user:pass@cluster:port/path.  I think remoteFile throw me off in 
thinking to access another HDFS other than current cluster.  Sorry for the 
confusion.  For S3, s3://ID:SECRET@BUCKET/ maybe this works.


was (Author: eyang):
[~leftnoteasy] For accessing remote HDFS, it requires username + password of 
the remote cluster, and the cluster has a way to contact to remote cluster KDC 
server to verify the user.  I don't think Hadoop supports 
hdfs://user:pass@cluster:port/path.  I think remoteFile throw me off in 
thinking to access another HDFS other than current cluster.  Sorry for the 
confusion.  For S3, s3://ID:SECRET@BUCKET/ maybe this works.  +1 for patch 3.

> YARN native service should respect source file of ConfigFile inside 
> Service/Component spec
> ------------------------------------------------------------------------------------------
>
>                 Key: YARN-8079
>                 URL: https://issues.apache.org/jira/browse/YARN-8079
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: Wangda Tan
>            Assignee: Wangda Tan
>            Priority: Blocker
>         Attachments: YARN-8079.001.patch, YARN-8079.002.patch, 
> YARN-8079.003.patch, YARN-8079.004.patch, YARN-8079.005.patch
>
>
> Currently, {{srcFile}} is not respected. {{ProviderUtils}} doesn't properly 
> read srcFile, instead it always construct {{remoteFile}} by using 
> componentDir and fileName of {{destFile}}:
> {code}
> Path remoteFile = new Path(compInstanceDir, fileName);
> {code} 
> To me it is a common use case which services have some files existed in HDFS 
> and need to be localized when components get launched. (For example, if we 
> want to serve a Tensorflow model, we need to localize Tensorflow model 
> (typically not huge, less than GB) to local disk. Otherwise launched docker 
> container has to access HDFS.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to