[ https://issues.apache.org/jira/browse/SPARK-6469?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14376017#comment-14376017 ]
Sean Owen commented on SPARK-6469: ---------------------------------- Ah I get you, I think you have a point. So, you are running in YARN on Hadoop 2 in cluster mode and neither {{YARN_LOCAL_DIRS}} or {{CONTAINER_ID}} is set. Paging [~sandyr] [~tgraves] [~vanzin] for thoughts on whether that's to be expected, not, or means a check here has to be adjusted. > Local directories configured for YARN are not used in yarn-client mode > ---------------------------------------------------------------------- > > Key: SPARK-6469 > URL: https://issues.apache.org/jira/browse/SPARK-6469 > Project: Spark > Issue Type: Bug > Components: Spark Core > Reporter: Christophe PRÉAUD > Priority: Minor > Attachments: TestYarnVars.scala > > > According to the [Spark YARN doc > page|http://spark.apache.org/docs/latest/running-on-yarn.html#important-notes], > Spark executors will use the local directories configured for YARN, not > spark.local.dir which should be ignored. > If this works correctly in yarn-cluster mode, I've found out that it is not > the case in yarn-client mode. > The problem seems to originate in the method > [isRunningInYarnContainer|https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/util/Utils.scala#L686]. > Indeed, I've checked with a simple application that the {{CONTAINER_ID}} > environment variable is correctly set in yarn-cluster mode (to something like > {{container_1426666761810_0151_01_000001}}, but not in yarn-client mode. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org