Parag Chaudhari created SPARK-13914:
---------------------------------------

             Summary: Add functionality to back up spark event logs
                 Key: SPARK-13914
                 URL: https://issues.apache.org/jira/browse/SPARK-13914
             Project: Spark
          Issue Type: Improvement
          Components: Scheduler
    Affects Versions: 1.6.0, 1.6.2, 2.0.0
            Reporter: Parag Chaudhari


Spark event logs are usually stored in HDFS when running Spark on YARN. In a 
cloud environment, these HDFS files are often stored on the disks of ephemeral 
instances that could go away once the instances are terminated. Users may want 
to persist the event logs as the event happens for issue investigation and 
performance analysis before and after the cluster is terminated. The backup 
path can be managed by the spark users based on their needs. For example, some 
users may copy the event logs to a cloud storage service directly and keep them 
there forever. While some other users may want to store the event logs on local 
disks and back them up to a cloud storage service from time to time. Other 
users will not want to use the feature, so this feature should be off by 
default; users enable the feature when and only when they need it.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to