osipovartem opened a new pull request, #18589: URL: https://github.com/apache/datafusion/pull/18589
## Which issue does this PR close? Related to https://github.com/apache/datafusion/issues/17267 Maybe some functionality can be removed since it was copied from HashJoin. Also filters are not applied to spilling data yet (we pass them to HashJoin itself, maybe we need to apply them here to reduce spilled data size, the same for projection). ## Rationale for this change This is a basic draft for GraceHashJoin/HybridHashJoin based on current HashJoin implementation https://en.wikipedia.org/wiki/Hash_join#Grace_hash_join ## What changes are included in this PR? Added new GraceHashJoinExec ## Are these changes tested? Simple tests, need much more tests here ## Are there any user-facing changes? No changes. We should add a physical plan optimizer rule to replace HashJoin by GraceHashJoin for inner join type only for now. Also maybe add some config flag to enbale/disable, also flag to set number of partitions -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
