[
https://issues.apache.org/jira/browse/SOLR-8593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15372906#comment-15372906
]
Joel Bernstein commented on SOLR-8593:
--------------------------------------
Hi Julian!
Thanks for the offer to help out. [~risdenk] and I are very interested in using
Calcite to power Solr's Parallel SQL engine so we can use Calcites awesome
optimizer.
Kevin has been doing most of the work on this but I will be helping out more
following the next Solr release.
I think I our biggest struggle has been understanding how to apply the rules
properly to push-down distributed joins and aggregations. Solr supports fast
MapReduce shuffling, distributed joins and also has mature faceting analytics
so we'd like to take advantage of all this power from the SQL interface.
> Integrate Apache Calcite into the SQLHandler
> --------------------------------------------
>
> Key: SOLR-8593
> URL: https://issues.apache.org/jira/browse/SOLR-8593
> Project: Solr
> Issue Type: Improvement
> Reporter: Joel Bernstein
>
> The Presto SQL Parser was perfect for phase one of the SQLHandler. It was
> nicely split off from the larger Presto project and it did everything that
> was needed for the initial implementation.
> Phase two of the SQL work though will require an optimizer. Here is where
> Apache Calcite comes into play. It has a battle tested cost based optimizer
> and has been integrated into Apache Drill and Hive.
> This work can begin in trunk following the 6.0 release. The final query plans
> will continue to be translated to Streaming API objects (TupleStreams), so
> continued work on the JDBC driver should plug in nicely with the Calcite work.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]