[ 
https://issues.apache.org/jira/browse/ARROW-692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16046830#comment-16046830
 ] 

Bryan Cutler edited comment on ARROW-692 at 6/12/17 5:46 PM:
-------------------------------------------------------------

[~wesmckinn] By dictionaries, you are asking about the dictionary record 
batches right?  Adding the dictionaries to the {{schema}} object would make 
things more difficult on the Java side.  Right now the {{Schema}} is written 
and read back as a single object ({{generator.writeObjectField("schema", 
schema);}}) with the class members that go into JSON having an annotation that 
lets the generator/parser know.  So to put the dictionaries into the Schema, I 
would need to either re-write the JSON parts to not use annotations or add 
another member in the {{Schema}} class to hold the dictionaries - but that 
doesn't seem right.  There might be someway to manipulate the built-in parser 
to do this, but I'm not sure.  Would it be too difficult to keep the format 
above with dictionaries on the same level as the schema?


was (Author: bryanc):
[~wesmckinn]] By dictionaries, you are asking about the dictionary record 
batches right?  Adding the dictionaries to the {{schema}} object would make 
things more difficult on the Java side.  Right now the {{Schema}} is written 
and read back as a single object ({{generator.writeObjectField("schema", 
schema);}}) with the class members that go into JSON having an annotation that 
lets the generator/parser know.  So to put the dictionaries into the Schema, I 
would need to either re-write the JSON parts to not use annotations or add 
another member in the {{Schema}} class to hold the dictionaries - but that 
doesn't seem right.  There might be someway to manipulate the built-in parser 
to do this, but I'm not sure.  Would it be too difficult to keep the format 
above with dictionaries on the same level as the schema?

> Java<->C++ Integration tests for dictionary-encoded vectors
> -----------------------------------------------------------
>
>                 Key: ARROW-692
>                 URL: https://issues.apache.org/jira/browse/ARROW-692
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: C++, Java - Vectors
>            Reporter: Wes McKinney
>




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to