[
https://issues.apache.org/jira/browse/SPARK-4351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen updated SPARK-4351:
-----------------------------
Component/s: Spark Core
> Record cacheable RDD reads and display RDD miss rates
> -----------------------------------------------------
>
> Key: SPARK-4351
> URL: https://issues.apache.org/jira/browse/SPARK-4351
> Project: Spark
> Issue Type: Improvement
> Components: Spark Core
> Reporter: Charles Reiss
> Priority: Minor
>
> Currently, when Spark fails to keep an RDD cached, there is little visibility
> to the user (beyond performance effects), especially if the user is not
> reading executor logs. We could expose this information to the Web UI and the
> event log like we do for RDD storage information by reporting RDD reads and
> their results with task metrics.
> From this, live computation of RDD miss rates is straightforward, and
> information in the event log would enable more complicated post-hoc analyses.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]