StephenZou created SPARK-22243:
----------------------------------

             Summary: job failed to restart from checkpoint
                 Key: SPARK-22243
                 URL: https://issues.apache.org/jira/browse/SPARK-22243
             Project: Spark
          Issue Type: Bug
          Components: Spark Core
    Affects Versions: 2.2.0, 2.1.0
            Reporter: StephenZou


My spark-defaults.conf has an item related to the issue, I upload all jars in 
spark's jars folder to the hdfs path:
spark.yarn.jars  hdfs:///spark/cache/spark2.2/* 

Streaming job failed to restart from checkpoint, ApplicationMaster throws  
"Error: Could not find or load main class 
org.apache.spark.deploy.yarn.ExecutorLauncher".  The problem is always 
reproducible.

I examine the sparkconf object recovered from checkpoint, and find 
spark.yarn.jars are set empty, which let all jars not exist in AM side. The 
solution is spark.yarn.jars should be reload from properties files when 
recovering from checkpoint. 

attach is a demo to reproduce the issue.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to