Marcelo Vanzin created SPARK-3788:
-------------------------------------

             Summary: Yarn dist cache code is not friendly to HDFS HA, 
Federation
                 Key: SPARK-3788
                 URL: https://issues.apache.org/jira/browse/SPARK-3788
             Project: Spark
          Issue Type: Bug
          Components: YARN
            Reporter: Marcelo Vanzin


There are two bugs here.

1. The {{compareFs()}} method in ClientBase considers the 'host' part of the 
URI to be an actual host. In the case of HA and Federation, that's a namespace 
name, which doesn't resolve to anything. So in those cases, {{compareFs()}} 
always says the file systems are different.

2. In {{prepareLocalResources()}}, when adding a file to the distributed cache, 
that is done with the common FileSystem object instantiated at the start of the 
method. In the case of Federation that doesn't work: the qualified URL's scheme 
may differ from the non-qualified one, so the FileSystem instance will not work.

Fixes are pretty trivial.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to