[
https://issues.apache.org/jira/browse/ARROW-692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16046830#comment-16046830
]
Bryan Cutler edited comment on ARROW-692 at 6/12/17 5:46 PM:
-------------------------------------------------------------
[~wesmckinn] By dictionaries, you are asking about the dictionary record
batches right? Adding the dictionaries to the {{schema}} object would make
things more difficult on the Java side. Right now the {{Schema}} is written
and read back as a single object ({{generator.writeObjectField("schema",
schema);}}) with the class members that go into JSON having an annotation that
lets the generator/parser know. So to put the dictionaries into the Schema, I
would need to either re-write the JSON parts to not use annotations or add
another member in the {{Schema}} class to hold the dictionaries - but that
doesn't seem right. There might be someway to manipulate the built-in parser
to do this, but I'm not sure. Would it be too difficult to keep the format
above with dictionaries on the same level as the schema?
was (Author: bryanc):
[~wesmckinn]] By dictionaries, you are asking about the dictionary record
batches right? Adding the dictionaries to the {{schema}} object would make
things more difficult on the Java side. Right now the {{Schema}} is written
and read back as a single object ({{generator.writeObjectField("schema",
schema);}}) with the class members that go into JSON having an annotation that
lets the generator/parser know. So to put the dictionaries into the Schema, I
would need to either re-write the JSON parts to not use annotations or add
another member in the {{Schema}} class to hold the dictionaries - but that
doesn't seem right. There might be someway to manipulate the built-in parser
to do this, but I'm not sure. Would it be too difficult to keep the format
above with dictionaries on the same level as the schema?
> Java<->C++ Integration tests for dictionary-encoded vectors
> -----------------------------------------------------------
>
> Key: ARROW-692
> URL: https://issues.apache.org/jira/browse/ARROW-692
> Project: Apache Arrow
> Issue Type: New Feature
> Components: C++, Java - Vectors
> Reporter: Wes McKinney
>
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)