[
https://issues.apache.org/jira/browse/ARROW-4757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rok Mihevc updated ARROW-4757:
------------------------------
External issue URL: https://github.com/apache/arrow/issues/21276
> [C++] Nested chunked array support
> ----------------------------------
>
> Key: ARROW-4757
> URL: https://issues.apache.org/jira/browse/ARROW-4757
> Project: Apache Arrow
> Issue Type: Improvement
> Components: C++
> Reporter: Philipp Moritz
> Priority: Major
>
> Dear all,
> I'm currently trying to lift the 2GB limit on the python serialization. For
> this, I implemented a chunked union builder to split the array into smaller
> arrays.
> However, some of the children of the union array can be ListArrays, which can
> themselves contain UnionArrays which can contain ListArrays etc. I'm at a bit
> of a loss how to handle this. In principle I'd like to chunk the children
> too. However, currently UnionArrays can only have children of type Array, and
> there is no way to treat a chunked array (which is a vector of Arrays) as an
> Array to store it as a child of a UnionArray. Any ideas how to best support
> this use case?
> -- Philipp.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)