cloud-fan commented on a change in pull request #24164: [SPARK-27225][SQL]
Implement join strategy hints
URL: https://github.com/apache/spark/pull/24164#discussion_r273090423
##########
File path:
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/hints.scala
##########
@@ -66,17 +66,89 @@ object JoinHint {
/**
* The hint attributes to be applied on a specific node.
*
- * @param broadcast If set to true, it indicates that the broadcast hash join
is the preferred join
- * strategy and the node with this hint is preferred to be
the build side.
+ * @param strategy The preferred join strategy.
*/
-case class HintInfo(broadcast: Boolean = false) {
+case class HintInfo(strategy: Option[JoinStrategyHint] = None) {
+
+ /**
+ * Combine two [[HintInfo]]s into one [[HintInfo]], in which the new
strategy will the strategy
+ * in this [[HintInfo]] if defined, otherwise the strategy in the other
[[HintInfo]].
Review comment:
shouldn't we consider the priority here?
> Spark prioritizes the `BROADCAST` hint over the `MERGE` hint over the
`SHUFFLE_HASH` hint over the `SHUFFLE_REPLICATE_NL` hint.
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
With regards,
Apache Git Services
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]