Weston Pace created ARROW-15589:
-----------------------------------

             Summary: [C++] Add support for sliced Substrait reads
                 Key: ARROW-15589
                 URL: https://issues.apache.org/jira/browse/ARROW-15589
             Project: Apache Arrow
          Issue Type: Improvement
          Components: C++
            Reporter: Weston Pace


The Substrait format allows for "sliced reads" which only read a part of the 
file and would most likely be used if a read operation were distributed across 
multiple files.

For each file a start byte and length is specified.  For files that contain 
indivisible "groups" (e.g. Parquet row groups) this is handled by picking some 
heuristic.  For example, read all row groups whose midpoint is contained in the 
interval.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to