[
https://issues.apache.org/jira/browse/DRILL-6755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16630910#comment-16630910
]
ASF GitHub Bot commented on DRILL-6755:
---------------------------------------
Ben-Zvi commented on issue #1480: DRILL-6755: Avoid building Hash Table for
inner/left join when probe side is empty
URL: https://github.com/apache/drill/pull/1480#issuecomment-425207228
Indeed this PR only addresses the "NONE 1st" case. As for the "NONE 2nd"
case (following a 1st OK_NEW_SCHEMA) - this would open up the can of worms we
saw before - early sniffing the probe side cranks the actual execution there
(e.g., another Hash Join below an Exchange) while other parts of the query are
still in schema discovery.
Not sure if in all cases where an operator finds it has no data to return
during schema discovery, a NONE is returned first. Note that the NONE does have
a schema (e.g., so a Right Join can return nulls for those missing columns).
Maybe we can increase the impact of this PR by checking all these cases, and
documenting/testing that our protocol allows for a NONE to be returned first
(i.e. during schema discovery).
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
> HashJoin should not build hash tables when probe side is empty.
> ---------------------------------------------------------------
>
> Key: DRILL-6755
> URL: https://issues.apache.org/jira/browse/DRILL-6755
> Project: Apache Drill
> Issue Type: Improvement
> Reporter: Timothy Farkas
> Assignee: Boaz Ben-Zvi
> Priority: Major
>
> Currently when doing an Inner or a Right join we still build hashtables when
> the probe side is empty. A performance optimization would be to not build
> them.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)