[
https://issues.apache.org/jira/browse/ARROW-542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15858295#comment-15858295
]
Emilio Lahr-Vivaz commented on ARROW-542:
-----------------------------------------
[~wesmckinn] I'm looking into how dictionary vectors will be encoded in the
file format. In the current message definitions, it appears dictionary batches
are distinct from regular batches, and have an ID associated with them:
https://github.com/apache/arrow/blob/b99d049c3d1894908b7e52774eb657675dc1f439/format/Message.fbs#L284
Wouldn't the dictionary already be defined by the Field? I'm unclear what the
ID in the DictionaryBatch is supposed to represent.
Thanks,
> [Java] Implement dictionaries in stream/file encoding
> -----------------------------------------------------
>
> Key: ARROW-542
> URL: https://issues.apache.org/jira/browse/ARROW-542
> Project: Apache Arrow
> Issue Type: Improvement
> Components: Java - Vectors
> Reporter: Emilio Lahr-Vivaz
> Assignee: Emilio Lahr-Vivaz
>
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)