alamb opened a new pull request, #8159:
URL: https://github.com/apache/arrow-rs/pull/8159

   # Which issue does this PR close?
   
   - Part of https://github.com/apache/arrow-rs/issues/7983
   
   
   
   # Rationale for this change
   
   I am refactoring the control logic for the parquet reader so we don't have 
two paths in the Async Reader and the Sync Reader. You can see the push decoder 
here
   - https://github.com/apache/arrow-rs/pull/7997
   
   I need to refactor the async and sync readers to use the new push decoder in 
order to:
   
   1. avoid the [xkcd standards effect](https://xkcd.com/927/)  (aka three 
decoders) 
   2. Prove that the push decoder works (by passing all the tests of the other 
two)
   
   <img width="400"  alt="image" 
src="https://github.com/user-attachments/assets/e6886ee9-58b3-4a1e-8e88-9d2d03132b19";
 />
   
   In order to keep the review somewhat manageable, I made this PR with the 
refactor of the async parquet arrow reader builder on top of 
https://github.com/apache/arrow-rs/pull/7997
   
   
   
   # What changes are included in this PR?
   
   1. Refactor the `ParquetRecordBatchStream` to use `ParquetPushDecoder`
   
   # Are these changes tested?
   Yes, by the existing CI tests
   
   # Are there any user-facing changes?
   
   No


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to