binmahone opened a new issue, #38865:
URL: https://github.com/apache/arrow/issues/38865
### Describe the enhancement requested
Currently GetRecordBatchReader API accepts row_group_indices and
column_indices. It would be nice to extend the API to accept one more
parameter: A row_ranges indicating a subset of rows to be retrieved. With the
provided row_ranges, RecordBatchReader can skip unnecessary pages (by comparing
the row_ranges with the might-exist page index) as well as unwanted rows.
```c++
::arrow::Status GetRecordBatchReader(const std::vector<int>&
row_group_indices,
const std::vector<int>&
column_indices,
std::shared_ptr<::arrow::RecordBatchReader>* out);
```
API clients can query page index or other kinds of index (e.g. external
secondary index) to construct the row_ranges.
### Component(s)
C++
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]