loukey-lj opened a new issue, #37006:
URL: https://github.com/apache/arrow/issues/37006

   ### Describe the bug, including details regarding any error messages, 
version, and platform.
   
   1、parquet come from
   use apache hive 2.1.1 : 
   ``
   create table all_f_hive_sn2 stored as PARQUET as
   select '22323' as f1, 'abc'  as f2, '在在在'  as f3, 'abc'   as f4, '22323'  as 
f5, 'abc'    as f6
   union 
   select '22323' as f1, 'abc'  as f2, '在在在'  as f3, '22323' as f4, '在在在' as 
f5, '在在在' as f6`
   ``
   
   2、query by gluten with spark 3.2.1
    select * from all_f_hive_sn2  where f1= '22323' , result right;
    select * from all_f_hive_sn2  where f2= 'abc'    , result right;
    select * from all_f_hive_sn2  where f3= '在在在'    , result right;
    select * from all_f_hive_sn2  where f4= 'abc'    , result right;
   
    select * from all_f_hive_sn2  where f5= '22323'    , no result ;
    select * from all_f_hive_sn2  where f6= 'abc'       , no result ;
   
   000000_0.zip is the parquet file stored in table all_f_hive_sn2 ;
   
   3、Since gluten depends on arrow, can someone help build a PushedFilters 
query parquet using arrow? See if it's an arrow bug
   
   [000000_0.zip](https://github.com/apache/arrow/files/12246604/000000_0.zip)
   
   
   
   
   ### Component(s)
   
   Parquet


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to