Dandandan commented on a change in pull request #796: URL: https://github.com/apache/arrow-datafusion/pull/796#discussion_r680472226
########## File path: datafusion/src/sql/planner.rs ########## @@ -372,25 +373,91 @@ impl<'a, S: ContextProvider> SqlToRel<'a, S> { // extract join keys extract_join_keys(&expr, &mut keys, &mut filter); + let mut cols = HashSet::new(); + exprlist_to_columns(&filter, &mut cols)?; + let (left_keys, right_keys): (Vec<Column>, Vec<Column>) = keys.into_iter().unzip(); - // return the logical plan representing the join - let join = LogicalPlanBuilder::from(left).join( - right, - join_type, - (left_keys, right_keys), - )?; + // return the logical plan representing the join if filter.is_empty() { + let join = LogicalPlanBuilder::from(left).join( + &right, + join_type, + (left_keys, right_keys), + )?; join.build() } else if join_type == JoinType::Inner { + let join = LogicalPlanBuilder::from(left).join( + &right, + join_type, + (left_keys, right_keys), + )?; join.filter( filter .iter() .skip(1) .fold(filter[0].clone(), |acc, e| acc.and(e.clone())), )? .build() + } + // Left join with all non-equijoin expressions from the right Review comment: If you would push the filters to the left logical plan, it would filter out the left keys. For example: ``` left | right 1 | 2 2 | 4 ``` if we would push the filter `left > 2` to the left side it would filter out that row, but the left join requires to result in all left keys (and only match / not match to values on the right). -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org