[ https://issues.apache.org/jira/browse/HIVE-9561?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14325120#comment-14325120 ]
Rui Li commented on HIVE-9561: ------------------------------ Hi [~xuefuz], thanks very much for taking care of this. I can't really work on it due to limited network access. Sorry for the inconvenience. > SHUFFLE_SORT should only be used for order by query [Spark Branch] > ------------------------------------------------------------------ > > Key: HIVE-9561 > URL: https://issues.apache.org/jira/browse/HIVE-9561 > Project: Hive > Issue Type: Sub-task > Components: Spark > Reporter: Rui Li > Assignee: Rui Li > Attachments: HIVE-9561.1-spark.patch, HIVE-9561.2-spark.patch, > HIVE-9561.3-spark.patch, HIVE-9561.4-spark.patch, HIVE-9561.5-spark.patch, > HIVE-9561.6-spark.patch > > > The {{sortByKey}} shuffle launches probe jobs. Such jobs can hurt performance > and are difficult to control. So we should limit the use of {{sortByKey}} to > order by query only. -- This message was sent by Atlassian JIRA (v6.3.4#6332)