[ 
https://issues.apache.org/jira/browse/ARROW-8391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17080601#comment-17080601
 ] 

Radu Teodorescu commented on ARROW-8391:
----------------------------------------

Fair point

I guess I was doing two things at once:
 # Trying to understand the scope of the ticket - based on your comments it 
sound like the preferred approach would be something along the lines of binary 
searching the recordBatches from the file footer.
 # Exploring the notion of adding broader support for efficiently identifying a 
row range within the arrow API in general (as in ChunkedArray and beyond) - 
this might fall out of the scope of this ticket but I am intrigued wrt the 
current state of the conversation on the topic. 

> [C++] Implement row range read API for IPC file (and Feather)
> -------------------------------------------------------------
>
>                 Key: ARROW-8391
>                 URL: https://issues.apache.org/jira/browse/ARROW-8391
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: C++
>            Reporter: Wes McKinney
>            Priority: Major
>
> The objective would be able to read a range of rows from the middle of a 
> file. It's not as easy as it might sound since all the record batch metadata 
> must be examined to determine the start and end point of the row range



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to