[ https://issues.apache.org/jira/browse/ARROW-11797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
David Li resolved ARROW-11797. ------------------------------ Fix Version/s: (was: 5.0.0) 4.0.0 Resolution: Fixed Issue resolved by pull request 9589 [https://github.com/apache/arrow/pull/9589] > [C++][Dataset] Provide Scanner methods to yield/visit scanned batches > --------------------------------------------------------------------- > > Key: ARROW-11797 > URL: https://issues.apache.org/jira/browse/ARROW-11797 > Project: Apache Arrow > Issue Type: Improvement > Components: C++ > Affects Versions: 3.0.0 > Reporter: Ben Kietzman > Assignee: Ben Kietzman > Priority: Major > Labels: dataset, pull-request-available > Fix For: 4.0.0 > > Time Spent: 4.5h > Remaining Estimate: 0h > > From discussion in https://issues.apache.org/jira/browse/ARROW-11782 > It'd be useful to consumers of Scanner to receive an iterator of scanned > record batches or apply a visitor to batches as they are scanned without > handling ScanTasks. For example, this could enable aggregations or other > computations which don't require the entire table to be materialized. -- This message was sent by Atlassian Jira (v8.3.4#803005)