[ https://issues.apache.org/jira/browse/ARROW-4083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Wes McKinney closed ARROW-4083. ------------------------------- Resolution: Won't Fix I will take care of this elsewhere when it is actually needed > [C++] Allowing ChunkedArrays to contain a mix of DictionaryArray and dense > Array (of the dictionary type) > --------------------------------------------------------------------------------------------------------- > > Key: ARROW-4083 > URL: https://issues.apache.org/jira/browse/ARROW-4083 > Project: Apache Arrow > Issue Type: Improvement > Components: C++ > Reporter: Wes McKinney > Priority: Major > Labels: dataframe > > In some applications we may receive a stream of some dictionary encoded data > followed by some non-dictionary encoded data. For example this happens in > Parquet files when the dictionary reaches a certain configurable size > threshold. > We should think about how we can model this in our in-memory data structures, > and how it can flow through to relevant computational components (i.e. > certain data flow observers -- like an Aggregation -- might need to be able > to process either a dense or dictionary encoded version of a particular array > in the same stream) -- This message was sent by Atlassian Jira (v8.3.2#803003)