Yi Zhang created HIVE-26447:
-------------------------------
Summary: Vectorization: wrong results when filter on repeating map
key
Key: HIVE-26447
URL: https://issues.apache.org/jira/browse/HIVE-26447
Project: Hive
Issue Type: Bug
Components: Hive
Affects Versions: 3.1.3, 4.0.0
Reporter: Yi Zhang
Assignee: Yi Zhang
Example reproducible case:
set hive.vectorized.execution.enabled=true;
set hive.fetch.task.conversion=none;
create temporary table foo (id int, x map<string,int>) stored as orc;
insert into foo values(1, map('ABC', 9)), (2, map('ABC', 7)), (3, map('ABC',
8)), (4, map('ABC', 9));
select id from foo where x['ABC']=9;
this only gives 1, when correct result should be 1,4
For every VectorizedRowBatch, only the first row is checked.
This seems to be a corner case of ORC table have repeating string type key for
map field in the MapColumnVector.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)