wgtmac opened a new pull request, #50157:
URL: https://github.com/apache/arrow/pull/50157

   ### Rationale for this change
   
   Parquet column order defines how min/max statistics should be interpreted. 
If a reader sees an unsupported ColumnOrder, it cannot safely use chunk 
statistics or page index min/max values for that column.
   
   ### What changes are included in this PR?
   
   - Added an internal `ColumnOrder::UNKNOWN` state for unsupported thrift 
column order.
   - Kept missing ColumnOrder as `UNDEFINED`, preserving legacy min/max 
behavior.
   - Ignored chunk-level min/max statistics when column order or sort order is 
unknown.
   - Ignored page indexes when column order is missing/unsupported or sort 
order is unknown.
   
   ### Are these changes tested?
   
   Added regression tests for unsupported column order, missing column order, 
and page index guards.
   
   ### Are there any user-facing changes?
   
   No.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to