[
https://issues.apache.org/jira/browse/HIVE-18034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16356393#comment-16356393
]
Sahil Takiar commented on HIVE-18034:
-------------------------------------
Looks like the Spark Web UI uses the following logic to detect if excessive GC
time has been spent:
{code}
// totalDuration range from 0 to 50% GC time, alpha max = 1
function totalDurationAlpha(totalGCTime, totalDuration) {
return totalDuration > 0 ?
(Math.min(totalGCTime / totalDuration + 0.5, 1)) : 1;
}
// When GCTimePercent is edited change ToolTips.TASK_TIME to match
var GCTimePercent = 0.1;
function totalDurationStyle(totalGCTime, totalDuration) {
// Red if GC time over GCTimePercent of total time
return (totalGCTime > GCTimePercent * totalDuration) ?
("hsla(0, 100%, 50%, " + totalDurationAlpha(totalGCTime, totalDuration)
+ ")") : "";
{code}
We should be able to use the info in {{o.a.hive.spark.client.metrics}} to do
the same calculation. {{executorRunTime}} + {{ executorDeserializeTime}} should
give us the total executor duration, and {{jvmGCTime}} should give the GC time.
I think a {{LOG#warn}} and a warning on the console should be printed.
> Improving logging with HoS executors spend lots of time in GC
> -------------------------------------------------------------
>
> Key: HIVE-18034
> URL: https://issues.apache.org/jira/browse/HIVE-18034
> Project: Hive
> Issue Type: Sub-task
> Components: Spark
> Reporter: Sahil Takiar
> Priority: Major
>
> There are times when Spark will spend lots of time doing GC. The Spark
> History UI shows a bunch of red flags when too much time is spent in GC. It
> would be nice if those warnings are propagated to Hive.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)