alamb commented on issue #6867:
URL: https://github.com/apache/arrow-rs/issues/6867#issuecomment-2538926316
> We could do the same in parquet-rs if we were willing to have a truncated
max statistic that's 1 byte larger than requested (which seems ok to me as long
as it's communicated that the truncation is a best effort, just like with page
and row group sizes).
This seems fine with me
Another approach might be to fallback to a one fewer characters if
incrementing the truncated character at `T` bytes would increase past `T` due
to promotion.
maybe something like
```rust
if let Some(incremented) = increment_utf8(input) {
if incremented.len() > max_len {
increment_utf8(remove_last_char(input))
}
}
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]