Hi,

I'm running Spark 2.4.4. When I execute a simple query "select * from table 
group by col", I found the SparkListenerTaskEnd event in event log reports all 
negative time duration for aggregate time total:

{"ID":6,"Name":"aggregate time total (min, med, 
max)","Update":"2","Value":"-46","Internal":true,"Count Failed 
Values":true,"Metadata":"sql"}

The same thing happens in SparkListenerStageCompleted event:

{"ID":6,"Name":"aggregate time total (min, med, 
max)","Value":"-133","Internal":true,"Count Failed 
Values":true,"Metadata":"sql"}

Then I checked the history server web UI, but the SQL tab displays positive 
numbers for the HashAggregate operator:

aggregate time total (min, med, max): 35 ms (0 ms, 2 ms, 6 ms)

I'm wondering is this a bug in Spark 2.4? If not, how does Spark compute the 
"aggregate time total" from those negative numbers?

Thanks

Reply via email to