wgtmac commented on issue #46205: URL: https://github.com/apache/arrow/issues/46205#issuecomment-2822845340
The current `parquet::Statistics` implementation is tied to `TypeDefinedOrder`. We will also add a `IEEE754TotalOrder` as proposed by https://github.com/apache/parquet-format/pull/221. Perhaps we can refactor the `parquet::Statistics` to be aware of column order? For `TypeDefinedOrder` and `IEEE754TotalOrder`, `parquet::Statistics` collects all fields. For `UndefinedOrder`, the stats writes empty `min` and `max` but keeps other fields including null_count. cc @mapleFU -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org