binmahone opened a new issue, #38865:
URL: https://github.com/apache/arrow/issues/38865

   ### Describe the enhancement requested
   
   Currently GetRecordBatchReader API accepts row_group_indices and 
column_indices. It would be nice to extend the API to accept one more 
parameter: A row_ranges indicating a subset of rows to be retrieved.  With the 
provided row_ranges, RecordBatchReader can skip unnecessary pages (by comparing 
the row_ranges with the might-exist page index) as well as unwanted rows.
   
   ```c++
     ::arrow::Status GetRecordBatchReader(const std::vector<int>& 
row_group_indices,
                                          const std::vector<int>& 
column_indices,
                                          
std::shared_ptr<::arrow::RecordBatchReader>* out);
   ``` 
   
   API clients can query page index or other kinds of index (e.g. external 
secondary index) to construct the row_ranges.
   
   ### Component(s)
   
   C++


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to