[
https://issues.apache.org/jira/browse/ARROW-692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16048150#comment-16048150
]
Bryan Cutler commented on ARROW-692:
------------------------------------
Thanks Wes, that will help things on the Java side. Currently, Java also
stores the dictionaries in a similar DictionaryMemo called
"DictionaryProvider". This is an interface that is implemented by the
ArrowFileReader and ArrowJsonReader, so these end up storing the dictionaries
and allow to lookup one from a given id. The Schema is modified post-facto to
change the type of the fields between the index type and dictionary type. So
there is a little difference, but I don't know if there would be any impact to
usage. I'll have to look into it more to see about support for nested
subfields.
> Java<->C++ Integration tests for dictionary-encoded vectors
> -----------------------------------------------------------
>
> Key: ARROW-692
> URL: https://issues.apache.org/jira/browse/ARROW-692
> Project: Apache Arrow
> Issue Type: New Feature
> Components: C++, Java - Vectors
> Reporter: Wes McKinney
>
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)