emcake opened a new pull request, #5076:
URL: https://github.com/apache/arrow-rs/pull/5076

   # Which issue does this PR close?
   
   Closes #5037.
   
   # Rationale for this change
    
   Similar to parquet-mr (https://github.com/apache/parquet-mr/pull/696), we 
allow truncation of statistics for binary and fix-length binary columns.
   
   # What changes are included in this PR?
   
   `7b37fd4` introduces the min/max exactness parameters and parses them for 
various statistics, and ensures round-tripping.
   
   `e57634d` creates a new writer property, and implements the truncation. It's 
tested for both strings and for decimals, and in the decimal case we ensure 
that re-constructed min and max decimals of the correct byte length will 
properly bound the true value.
   
   # Are there any user-facing changes?
   
   Introduction of new functionality to set the truncation length, but no 
breaking changes.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to