westonpace opened a new pull request, #35889:
URL: https://github.com/apache/arrow/pull/35889

   ### Rationale for this change
   
   This ended up being considerably more change than just connecting parquet to 
the new scan node.  In order to do this I had to refactor the scan node itself 
somewhat.  It introduces the concept of scan tasks (or maybe scan streams would 
be a better name) to help clarify the concept of a row group (which I didn't 
have to worry about with CSV).  I also introduced the staging area which is a 
slightly different approach to sequencing that I think will be much simpler.
   
   ### What changes are included in this PR?
   
   The new scan node now supports the parquet format.
   
   ### Are these changes tested?
   
   Yes
   
   ### Are there any user-facing changes?
   
   There are breaking changes to the scan2 node but this feature hasn't really 
been released yet.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to