[ 
https://issues.apache.org/jira/browse/SPARK-3979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Marcelo Vanzin updated SPARK-3979:
----------------------------------
    Description: 
This code in ClientBase.scala sets the replication used for files uploaded to 
HDFS:

{code}
    val replication = sparkConf.getInt("spark.yarn.submit.file.replication", 
3).toShort
{code}

Instead of a hardcoded "3" (which is the default value for HDFS), it should be 
using the default value from the HDFS conf ("dfs.replication").

  was:
This code in ClientBase.scala sets the replication used for files uploaded to 
HDFS:

{{noformat}}
    val replication = sparkConf.getInt("spark.yarn.submit.file.replication", 
3).toShort
{{noformat}}

Instead of a hardcoded "3" (which is the default value for HDFS), it should be 
using the default value from the HDFS conf ("dfs.replication").


> Yarn backend's default file replication should match HDFS's default one
> -----------------------------------------------------------------------
>
>                 Key: SPARK-3979
>                 URL: https://issues.apache.org/jira/browse/SPARK-3979
>             Project: Spark
>          Issue Type: Bug
>          Components: YARN
>            Reporter: Marcelo Vanzin
>            Assignee: Marcelo Vanzin
>            Priority: Minor
>
> This code in ClientBase.scala sets the replication used for files uploaded to 
> HDFS:
> {code}
>     val replication = sparkConf.getInt("spark.yarn.submit.file.replication", 
> 3).toShort
> {code}
> Instead of a hardcoded "3" (which is the default value for HDFS), it should 
> be using the default value from the HDFS conf ("dfs.replication").



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to