alamb opened a new issue, #10926:
URL: https://github.com/apache/datafusion/issues/10926

   ### Describe the bug
   
    I noticed this while working on 
https://github.com/apache/datafusion/pull/10852 with @marvinlanhenke
   
   Basially, when generating statistics for a non existent column, the 
StatisticsExtractor will return a null array of the type of the column not a 
UInt64Array
   
   Specifically 
https://github.com/apache/datafusion/blob/2f4347647172f6997448b2e24d322b50c856f3a0/datafusion/core/src/datasource/physical_plan/parquet/statistics.rs#L871-L886
   
   The same problem exists for  `data_page_null_counts` and 
`data_page_row_counts` (not for `row_group_row_counts`
   
   ### To Reproduce
   
   Try to call row_group_null_counts for a column that isn't in the parquet file
   
   ### Expected behavior
   
   1. row_group_null_counts should always return an `UInt64Array` (not an 
`ArrayRef`)
   2. If there is not a column, the UInt64Array should be all nulls
   
   ### Additional context
   
   _No response_


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to