Weston Pace created ARROW-14357:
-----------------------------------

             Summary: [C++] Improve array size estimation to account for shared 
buffers
                 Key: ARROW-14357
                 URL: https://issues.apache.org/jira/browse/ARROW-14357
             Project: Apache Arrow
          Issue Type: Sub-task
          Components: C++
            Reporter: Weston Pace


Overlapping buffers could be detected using some kind of sorted list of ranges 
and then detecting and subtracting overlaps.  This could provide a more 
accurate size estimation when tables or record batches share the same buffers.

This should be controlled by an option as sometimes it may be important to know 
how much space in memory a table is occupying and somehow it is more important 
to instead know how much data a table represents (e.g. the amount of CPU work 
necessary to process a table is going to depend on the latter).



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to