[ 
https://issues.apache.org/jira/browse/ARROW-692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16048150#comment-16048150
 ] 

Bryan Cutler commented on ARROW-692:
------------------------------------

Thanks Wes, that will help things on the Java side.  Currently, Java also 
stores the dictionaries in a similar DictionaryMemo called 
"DictionaryProvider".  This is an interface that is implemented by the 
ArrowFileReader and ArrowJsonReader, so these end up storing the dictionaries 
and allow to lookup one from a given id.  The Schema is modified post-facto to 
change the type of the fields between the index type and dictionary type.  So 
there is a little difference, but I don't know if there would be any impact to 
usage.  I'll have to look into it more to see about support for nested 
subfields.

> Java<->C++ Integration tests for dictionary-encoded vectors
> -----------------------------------------------------------
>
>                 Key: ARROW-692
>                 URL: https://issues.apache.org/jira/browse/ARROW-692
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: C++, Java - Vectors
>            Reporter: Wes McKinney
>




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to