Marcelo Vanzin created SPARK-3979: ------------------------------------- Summary: Yarn backend's default file replication should match HDFS's default one Key: SPARK-3979 URL: https://issues.apache.org/jira/browse/SPARK-3979 Project: Spark Issue Type: Bug Components: YARN Reporter: Marcelo Vanzin Priority: Minor
This code in ClientBase.scala sets the replication used for files uploaded to HDFS: {{noformat}} val replication = sparkConf.getInt("spark.yarn.submit.file.replication", 3).toShort {{noformat}} Instead of a hardcoded "3" (which is the default value for HDFS), it should be using the default value from the HDFS conf ("dfs.replication"). -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org