[jira] [Created] (KUDU-2515) Implement Spark join optimization support

Mike Percy (JIRA) Thu, 26 Jul 2018 13:22:31 -0700

Mike Percy created KUDU-2515:
--------------------------------

             Summary: Implement Spark join optimization support
                 Key: KUDU-2515
                 URL: https://issues.apache.org/jira/browse/KUDU-2515
             Project: Kudu
          Issue Type: Improvement
    Affects Versions: 1.7.1
            Reporter: Mike Percy



At the time of writing, Spark is not able to properly optimize joins on Kudu 
tables because Kudu does not provide statistics for Spark to use to determine 
the optimal join strategy.

It would be a big improvement to find some way to help Spark optimize joins 
between Kudu tables or between Kudu tables and Parquet-on-HDFS tables. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

[jira] [Created] (KUDU-2515) Implement Spark join optimization support

Reply via email to