Dandandan opened a new issue, #9581:
URL: https://github.com/apache/arrow-rs/issues/9581

   ## Description
   
   The RLE dictionary decoding path uses if/else branching to select between 
checked and unchecked indexing. This can be replaced with a single branchless 
`.min(max_idx)` clamp that:
   - Prevents UB on corrupt parquet files (indices clamped to valid range)
   - Removes the if/else branch, simplifying codegen
   - Improves i32 dict perf by ~13%
   
   Additionally, adding a `get_batch_direct` method to `RleDecoder` that 
exposes RLE vs bit-packed batches via a callback allows callers to handle each 
case optimally (e.g., using `repeat_n` for RLE runs).
   
   🤖 Generated with [Claude Code](https://claude.com/claude-code)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to