[ https://issues.apache.org/jira/browse/SPARK-3979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Marcelo Vanzin updated SPARK-3979: ---------------------------------- Description: This code in ClientBase.scala sets the replication used for files uploaded to HDFS: {code} val replication = sparkConf.getInt("spark.yarn.submit.file.replication", 3).toShort {code} Instead of a hardcoded "3" (which is the default value for HDFS), it should be using the default value from the HDFS conf ("dfs.replication"). was: This code in ClientBase.scala sets the replication used for files uploaded to HDFS: {{noformat}} val replication = sparkConf.getInt("spark.yarn.submit.file.replication", 3).toShort {{noformat}} Instead of a hardcoded "3" (which is the default value for HDFS), it should be using the default value from the HDFS conf ("dfs.replication"). > Yarn backend's default file replication should match HDFS's default one > ----------------------------------------------------------------------- > > Key: SPARK-3979 > URL: https://issues.apache.org/jira/browse/SPARK-3979 > Project: Spark > Issue Type: Bug > Components: YARN > Reporter: Marcelo Vanzin > Assignee: Marcelo Vanzin > Priority: Minor > > This code in ClientBase.scala sets the replication used for files uploaded to > HDFS: > {code} > val replication = sparkConf.getInt("spark.yarn.submit.file.replication", > 3).toShort > {code} > Instead of a hardcoded "3" (which is the default value for HDFS), it should > be using the default value from the HDFS conf ("dfs.replication"). -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org