Charles Reiss created SPARK-4351:
------------------------------------

             Summary: Record cacheable RDD reads and display RDD miss rates
                 Key: SPARK-4351
                 URL: https://issues.apache.org/jira/browse/SPARK-4351
             Project: Spark
          Issue Type: Improvement
            Reporter: Charles Reiss
            Priority: Minor


Currently, when Spark fails to keep an RDD cached, there is little visibility 
to the user (beyond performance effects), especially if the user is not reading 
executor logs. We could expose this information to the Web UI and the event log 
like we do for RDD storage information by reporting RDD reads and their results 
with task metrics.

>From this, live computation of RDD miss rates is straightforward, and 
>information in the event log would enable more complicated post-hoc analyses.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to