westonpace opened a new pull request, #35889: URL: https://github.com/apache/arrow/pull/35889
### Rationale for this change This ended up being considerably more change than just connecting parquet to the new scan node. In order to do this I had to refactor the scan node itself somewhat. It introduces the concept of scan tasks (or maybe scan streams would be a better name) to help clarify the concept of a row group (which I didn't have to worry about with CSV). I also introduced the staging area which is a slightly different approach to sequencing that I think will be much simpler. ### What changes are included in this PR? The new scan node now supports the parquet format. ### Are these changes tested? Yes ### Are there any user-facing changes? There are breaking changes to the scan2 node but this feature hasn't really been released yet. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
