Github user srowen commented on a diff in the pull request:
https://github.com/apache/spark/pull/13738#discussion_r67813873
--- Diff: core/src/main/scala/org/apache/spark/deploy/SparkHadoopUtil.scala
---
@@ -421,6 +421,13 @@ object SparkHadoopUtil {
val SPARK_YARN_CREDS_COUNTER_DELIM = "-"
+ // Just load HdfsConfiguration into the class loader to add
+ // hdfs-site.xml as a default configuration file otherwise
+ // some HDFS related configurations doesn't ship to Executors and
+ // it can cause UnknownHostException when NameNode HA is enabled.
+ // See SPARK-11227 for more details.
+ Utils.classForName("org.apache.hadoop.hdfs.HdfsConfiguration")
--- End diff --
Just reference it in any way, but, I guess we should ask, what does
classloading do that we need, and is there any way to do that directly? this is
fairly indirect. Is it that `
Configuration.addDefaultResource("hdfs-site.xml");` must be called?
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]