zheju_he created SPARK-44845:
--------------------------------

             Summary: spark job copies jars repeatedly if fs.defaultFS and 
application jar are same url
                 Key: SPARK-44845
                 URL: https://issues.apache.org/jira/browse/SPARK-44845
             Project: Spark
          Issue Type: Bug
          Components: YARN
    Affects Versions: 3.4.1
            Reporter: zheju_he


In the org.apache.spark.deploy.yarn.Client#compareUri method, 
hdfs://hadoop81:8020 and hdfs://192.168.0.81:8020 are regarded as different 
file systems (hadoop81 corresponds to 192.168.0.81). The specific reason is 
that in the last pr, different URIs of user information are also regarded as 
different file systems. Uri.getauthority is used to determine the user 
information, but authority contains the host so the URI above must be different 
from authority. To determine whether the user authentication information is 
different, you only need to determine URI.getUserInfo.

 

the last pr and issue link:
https://issues.apache.org/jira/browse/SPARK-22587

https://github.com/apache/spark/pull/19885



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to