[
https://issues.apache.org/jira/browse/SPARK-8642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Juliet Hougland updated SPARK-8642:
-----------------------------------
Attachment: yarnretries.log
Log file from failed bc of misconfiguration spakr job.
counting lines with 9 retires in it gives:
cat yarnretries.log | grep 'Already tried 9 time(s);' | wc -l
31
> Ungraceful failure when yarn client is not configured.
> ------------------------------------------------------
>
> Key: SPARK-8642
> URL: https://issues.apache.org/jira/browse/SPARK-8642
> Project: Spark
> Issue Type: Bug
> Components: YARN
> Affects Versions: 1.3.0, 1.3.1
> Reporter: Juliet Hougland
> Priority: Minor
> Attachments: yarnretries.log
>
>
> When HADOOP_CONF_DIR is not configured (ie yarn-site.xml is not available)
> the yarn client will try to submit an application. No connection to the
> resource manager will be able to be established. The client will try to
> connect 10 times (with a max retry of ten), and then do that 30 more time.
> This takes about 5 minutes before an Error is recorded for spark context
> initialization, which is caused by a connect exception. I would expect that
> after the first 1- tries fail, the initialization of the spark context should
> fail too. At least that is what I would think given the logs. An earlier
> failure would be ideal/preferred.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]