Jacek Pliszka created ARROW-12637:
-------------------------------------

             Summary: Range type (similar to Pandas intervals)
                 Key: ARROW-12637
                 URL: https://issues.apache.org/jira/browse/ARROW-12637
             Project: Apache Arrow
          Issue Type: Wish
          Components: C, C++
            Reporter: Jacek Pliszka


This is wish for range(interval) extension type.

 

Type would describe range/interval with defined start and end and would have 
metadata

whether range is open/close/left-open/right-open.

 

Considerations:
 * should it focus on integers and like or on floats as well?
 * are open/semi-open cases worth the effort? Maybe not.

 

The most useful case might be closed range of 32bit/64bit and maybe 8bit/16bit 
unsigned integers packed in twice larger integer.

 

This would allow computation kernels like:

 
 * overlap joins (see R foverlaps)
 * range algebra: unions, intersections, differences
 * classification in Allen's interval algebra
 * conversion to/from 2 columns/rows
 * exploding (see Pandas explode)

 

 

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to