[ 
https://issues.apache.org/jira/browse/SPARK-21598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16110028#comment-16110028
 ] 

Marcelo Vanzin commented on SPARK-21598:
----------------------------------------

[~steve_l] has been working for some time on SPARK-11373, which has some 
overlapping goals. I'm fine with adding metrics, I just want the metrics 
exposed to make sense.

This seems to be a different way to expose those metrics, so I'd rather pick 
one way of doing it. The approach in SPARK-11373 integrates with existing tools 
that collect metrics, while this approach would require people to write custom 
code, which makes me by default prefer the former.

> Collect usability/events information from Spark History Server
> --------------------------------------------------------------
>
>                 Key: SPARK-21598
>                 URL: https://issues.apache.org/jira/browse/SPARK-21598
>             Project: Spark
>          Issue Type: Improvement
>          Components: Scheduler
>    Affects Versions: 2.0.2
>            Reporter: Eric Vandenberg
>            Priority: Minor
>
> The Spark History Server doesn't currently have a way to collect 
> usability/performance on its main activity, loading/replay of history files.  
> We'd like to collect this information to monitor, target and measure 
> improvements in the spark debugging experience (via history server usage.)  
> Once available these usability events could be analyzed using other analytics 
> tools.
> The event info to collect:
>     SparkHistoryReplayEvent(
>         logPath: String,
>         logCompressionType: String,
>         logReplayException: String // if an error
>         logReplayAction: String // user replay, vs checkForLogs replay
>         logCompleteFlag: Boolean,
>         logFileSize: Long,
>         logFileSizeUncompressed: Long,
>         logLastModifiedTimestamp: Long,
>         logCreationTimestamp: Long,
>         logJobId: Long,
>         logNumEvents: Int,
>         logNumStages: Int,
>         logNumTasks: Int
>         logReplayDurationMillis: Long
> )
> The main spark engine has a SparkListenerInterface through which all compute 
> engine events are broadcast.  It probably doesn't make sense to reuse this 
> abstraction for broadcasting spark history server events since the "events" 
> are not related or compatible with one another.  Also note the metrics 
> registry collects history caching metrics but doesn't provide the type of 
> above information.
> Proposal here would be to add some basic event listener infrastructure to 
> capture history server activity events.  This would work similar to how the 
> SparkListener infrastructure works.  It could be configured in a similar 
> manner, eg. spark.history.listeners=MyHistoryListenerClass.
> Open to feedback / suggestions / comments on the approach or alternatives.
> cc: [~vanzin] [~cloud_fan] [~ajbozarth] [~jiangxb1987]



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to