[ https://issues.apache.org/jira/browse/SPARK-42227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17687889#comment-17687889 ]
xuanzhiang commented on SPARK-42227: ------------------------------------ spark version : 3.2.1 hadoop version : 3.0.0 job info: !percentile_approx objectHashAggregateExec.png! shuffle read task info : !percentile_approx objectHashAggregateExec_shuffle_task.png! > Use approx_percentile function running slower in spark3 than spark2 > ------------------------------------------------------------------- > > Key: SPARK-42227 > URL: https://issues.apache.org/jira/browse/SPARK-42227 > Project: Spark > Issue Type: Bug > Components: SQL > Affects Versions: 3.2.1 > Reporter: xuanzhiang > Priority: Major > Attachments: percentile+objectHashAggregateExec.png, > percentile+objectHashAggregateExec_shuffle_task.png, > percentile_approx+objectHashAggregateExec.png, > percentile_approx+objectHashAggregateExec_shuffle_task.png > > > approx_percentile(end_ts-start_ts,0.9) cost_p90 > in spark3 , it use objectHashAggregate method , but it shuffle very slow. > when i use percentile , it become fast. i dont know the reson, i think > approx_percentile should fast. -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org