Martin Andersson created SEDONA-217:
---------------------------------------
Summary: Automatically broadcast small datasets
Key: SEDONA-217
URL: https://issues.apache.org/jira/browse/SEDONA-217
Project: Apache Sedona
Issue Type: Improvement
Reporter: Martin Andersson
Spark uses stats for each side of a join and automatically switches to a
broadcast join if size is below threshold. Sedona could do the same.
This is where it's done in Spark.
https://github.com/apache/spark/blob/master/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/joins.scala
--
This message was sent by Atlassian Jira
(v8.20.10#820010)