ulysses-you commented on a change in pull request #32960:
URL: https://github.com/apache/spark/pull/32960#discussion_r654154456
##########
File path: docs/sql-performance-tuning.md
##########
@@ -273,7 +273,32 @@ This feature coalesces the post shuffle partitions based
on the map output stati
</table>
### Converting sort-merge join to broadcast join
-AQE converts sort-merge join to broadcast hash join when the runtime
statistics of any join side is smaller than the broadcast hash join threshold.
This is not as efficient as planning a broadcast hash join in the first place,
but it's better than keep doing the sort-merge join, as we can save the sorting
of both the join sides, and read shuffle files locally to save network
traffic(if `spark.sql.adaptive.localShuffleReader.enabled` is true)
+AQE converts sort-merge join to broadcast hash join when the runtime
statistics of any join side is smaller than the adaptive broadcast hash join
threshold. This is not as efficient as planning a broadcast hash join in the
first place, but it's better than keep doing the sort-merge join, as we can
save the sorting of both the join sides, and read shuffle files locally to save
network traffic(if `spark.sql.adaptive.localShuffleReader.enabled` is true)
Review comment:
`smaller than the broadcast hash join` -> `smaller than the adaptive
broadcast hash join`
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]