alamb commented on issue #6867:
URL: https://github.com/apache/arrow-rs/issues/6867#issuecomment-2538926316

   > We could do the same in parquet-rs if we were willing to have a truncated 
max statistic that's 1 byte larger than requested (which seems ok to me as long 
as it's communicated that the truncation is a best effort, just like with page 
and row group sizes).
   
   This seems fine with me
   
   Another approach might be to fallback to a one fewer characters if 
incrementing the truncated character at `T` bytes would increase past `T` due 
to promotion.
   
   maybe something like
   
   ```rust
   if let Some(incremented) = increment_utf8(input) {
     if incremented.len() > max_len {
       increment_utf8(remove_last_char(input))
     }
   }
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to