HawaiianSpork opened a new pull request, #6805:
URL: https://github.com/apache/arrow-rs/pull/6805
When encoding offsets the code had an optimization to reuse the offsets if
the first offset was zero assuming the slice already pointed to first element.
But the offset can also be zero if all previous lists were empty. When this
occured it mold make all lists in the slice as empty, even if they shouldn't be.
# Which issue does this PR close?
Closes #6803 .
# Rationale for this change
Fixing a bug that can lead to loss of information when encoding record
batches.
# What changes are included in this PR?
- Change the encoding and a test to demonstrate the fix.
# Are there any user-facing changes?
No, though some clients will experience smaller encoded messages since only
the relevant slice will be sent.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]