Weston Pace created ARROW-16646:
-----------------------------------
Summary: [C++] HashJoin node can crash if a key column is a scalar
Key: ARROW-16646
URL: https://issues.apache.org/jira/browse/ARROW-16646
Project: Apache Arrow
Issue Type: Bug
Components: C++
Reporter: Weston Pace
This only happens when the node has a bloom filter pushed down into it. In
that case it will attempt to hash the key columns in
{{arrow::compute::HashJoinBasicImpl::ApplyBloomFiltersToBatch}} by calling
{{Hashing32::HashBatch}} on a batch made up only of key columns.
If one of those key columns happens to be a scalar, and not an array, then this
method triggers a {{DCHECK}} and crashes.
--
This message was sent by Atlassian Jira
(v8.20.7#820007)