[
https://issues.apache.org/jira/browse/HIVE-20932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16692549#comment-16692549
]
slim bouguerra edited comment on HIVE-20932 at 11/20/18 2:13 AM:
-----------------------------------------------------------------
[~t3rmin4t0r] and [~teddy.choi] the most important classes methods to look at
druid-handler/src/java/org/apache/hadoop/hive/druid/io/DruidVectorizedWrapper.java
org.apache.hadoop.hive.druid.serde.DruidSerDe#deserializeAsPrimitive
org.apache.hadoop.hive.druid.io.DruidQueryBasedInputFormat#getRecordReader
all the reset is cleaning the code style.
was (Author: bslim):
[~t3rmin4t0r] and [~teddy.choi] the most important class to look at
druid-handler/src/java/org/apache/hadoop/hive/druid/io/DruidVectorizedWrapper.java
org.apache.hadoop.hive.druid.serde.DruidSerDe#deserializeAsPrimitive
org.apache.hadoop.hive.druid.io.DruidQueryBasedInputFormat#getRecordReader
> Vectorize Druid Storage Handler Reader
> --------------------------------------
>
> Key: HIVE-20932
> URL: https://issues.apache.org/jira/browse/HIVE-20932
> Project: Hive
> Issue Type: Improvement
> Reporter: slim bouguerra
> Assignee: slim bouguerra
> Priority: Major
> Attachments: HIVE-20932.3.patch, HIVE-20932.4.patch, HIVE-20932.patch
>
>
> This patch aims at adding support for vectorize read of data from Druid to
> Hive.
> [~t3rmin4t0r] suggested that this will improve the performance of the top
> level operators that supports vectorization.
> As a first cut am just adding a wrapper around the existing Record Reader to
> read up to 1024 row at a time.
> Future work will be to avoid going via old reader and convert straight the
> Json (smile format) to Vector primitive types.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)