[ 
https://issues.apache.org/jira/browse/HIVE-20932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16692549#comment-16692549
 ] 

slim bouguerra edited comment on HIVE-20932 at 11/20/18 2:13 AM:
-----------------------------------------------------------------

[~t3rmin4t0r] and [~teddy.choi] the most important classes methods to look at

 
druid-handler/src/java/org/apache/hadoop/hive/druid/io/DruidVectorizedWrapper.java

org.apache.hadoop.hive.druid.serde.DruidSerDe#deserializeAsPrimitive

org.apache.hadoop.hive.druid.io.DruidQueryBasedInputFormat#getRecordReader

 

all the reset is cleaning the code style. 


was (Author: bslim):
[~t3rmin4t0r] and [~teddy.choi] the most important class to look at

 
druid-handler/src/java/org/apache/hadoop/hive/druid/io/DruidVectorizedWrapper.java

org.apache.hadoop.hive.druid.serde.DruidSerDe#deserializeAsPrimitive

org.apache.hadoop.hive.druid.io.DruidQueryBasedInputFormat#getRecordReader

 

> Vectorize Druid Storage Handler Reader
> --------------------------------------
>
>                 Key: HIVE-20932
>                 URL: https://issues.apache.org/jira/browse/HIVE-20932
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: slim bouguerra
>            Assignee: slim bouguerra
>            Priority: Major
>         Attachments: HIVE-20932.3.patch, HIVE-20932.4.patch, HIVE-20932.patch
>
>
> This patch aims at adding support for vectorize read of data from Druid to 
> Hive.
> [~t3rmin4t0r] suggested that this will improve the performance of the top 
> level operators that supports vectorization.
> As a first cut am just adding a wrapper around the existing Record Reader to 
> read up to 1024 row at a time. 
> Future work will be to avoid going via old reader and convert straight the 
> Json (smile format) to Vector primitive types. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to