PengleiShi opened a new issue #1061:
URL: https://github.com/apache/orc/issues/1061


   orc version: 1.6.11, sql: `select xxx from xxx where str is not null`
   
   Recently i found some orc files wrote by presto didn't have complete 
statistics in files meta(maybe a presto bug), this causes 
`OrcProto.ColumnStatistics` can't be deserialized to any specific 
`ColumnStatisticsImpl` such as `StringStatisticsImpl`, then 
`RecordReaderImpl.getValueRange()` returns `ValueRange` with null `lower` and 
`RecordReaderImpl.pickRowGroups()` skips this row group, which should not be 
skipped. In normal conditions except above, everything is ok. And i found 
orc-1.5.x can handle above case according to  `RecordReaderImpl.UNKNOWN_VALUE`, 
 which is removed in 1.6.x. Maybe we could add it back for better 
compatibility. @dongjoon-hyun @omalley 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@orc.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to