loukey-lj opened a new issue, #37006:
URL: https://github.com/apache/arrow/issues/37006
### Describe the bug, including details regarding any error messages,
version, and platform.
1、parquet come from
use apache hive 2.1.1 :
``
create table all_f_hive_sn2 stored as PARQUET as
select '22323' as f1, 'abc' as f2, '在在在' as f3, 'abc' as f4, '22323' as
f5, 'abc' as f6
union
select '22323' as f1, 'abc' as f2, '在在在' as f3, '22323' as f4, '在在在' as
f5, '在在在' as f6`
``
2、query by gluten with spark 3.2.1
select * from all_f_hive_sn2 where f1= '22323' , result right;
select * from all_f_hive_sn2 where f2= 'abc' , result right;
select * from all_f_hive_sn2 where f3= '在在在' , result right;
select * from all_f_hive_sn2 where f4= 'abc' , result right;
select * from all_f_hive_sn2 where f5= '22323' , no result ;
select * from all_f_hive_sn2 where f6= 'abc' , no result ;
000000_0.zip is the parquet file stored in table all_f_hive_sn2 ;
3、Since gluten depends on arrow, can someone help build a PushedFilters
query parquet using arrow? See if it's an arrow bug
[000000_0.zip](https://github.com/apache/arrow/files/12246604/000000_0.zip)
### Component(s)
Parquet
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]