Ruben Q L created CALCITE-4995:
----------------------------------
Summary: ArrayIndexOutOfBoundsException caused by RelFieldTrimmer
on SEMI/ANTI join
Key: CALCITE-4995
URL: https://issues.apache.org/jira/browse/CALCITE-4995
Project: Calcite
Issue Type: Bug
Components: core
Affects Versions: 1.29.0
Reporter: Ruben Q L
Assignee: Ruben Q L
(Unit test to be provided)
It seems {{RelFieldTrimmer}} can cause an {{ArrayIndexOutOfBoundsException}} on
certain plans involving SEMI/ANTI join (i.e. joins that do NOT project the RHS
fields).
The root cause seems to be the "early return" in
{{RelFieldTrimmer#trimFields(Join join, ImmutableBitSet fieldsUsed,
Set<RelDataTypeField> extraFields)}} when nothing has been trimmed inside
join's inputs (so the join itself can be return as it is):
{code:java}
if (changeCount == 0
&& mapping.isIdentity()) {
return result(join, Mappings.createIdentity(fieldCount));
}
{code}
The problem is that this {{fieldCount}} is an addition of LHS + RHS fields (+
system fields); but in case of a SEMI/ANTI the mappings to be returned must not
consider RHS fields (since they are not projected by these join types).
The problem only happens here (when the trimmer does not trim the join). Notice
that, a few lines below, in the "other return scenario" of the method (when
something has been trimmed), there is a special treatment of the mapping for
ANTI/SEMI, so things will work fine in this case:
{code:java}
switch (join.getJoinType()) {
case SEMI:
case ANTI:
// For SemiJoins and AntiJoins only map fields from the left-side
if (join.getJoinType() == JoinRelType.SEMI) {
relBuilder.semiJoin(newConditionExpr);
} else {
relBuilder.antiJoin(newConditionExpr);
}
Mapping inputMapping = inputMappings.get(0);
mapping = Mappings.create(MappingType.INVERSE_SURJECTION,
join.getRowType().getFieldCount(),
newSystemFieldCount + inputMapping.getTargetCount());
for (int i = 0; i < newSystemFieldCount; ++i) {
mapping.set(i, i);
}
offset = systemFieldCount;
newOffset = newSystemFieldCount;
for (IntPair pair : inputMapping) {
mapping.set(pair.source + offset, pair.target + newOffset);
}
break;
default:
relBuilder.join(join.getJoinType(), newConditionExpr);
}
relBuilder.hints(join.getHints());
return result(relBuilder.build(), mapping);
{code}
--
This message was sent by Atlassian Jira
(v8.20.1#820001)