[ 
https://issues.apache.org/jira/browse/HIVE-17931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16244711#comment-16244711
 ] 

Vihang Karajgaonkar commented on HIVE-17931:
--------------------------------------------

Hi [~colinma] I left some comments on the review board. Since the patch is not 
committed yet, can we refactor the code in this patch itself to remove the 
duplicate code. Having duplicate code causing problems like HIVE-17961 where 
the bug was fixed only in one part of the code and the test was really testing 
duplicate code which didn't catch the regression.

Also, can you add some qtests which for the list types? Specifically, can add 
q-tests where the elements in the List are <1024, =1024 and >1024 as well to 
make we don't have corner conditions. Its unclear to me from the code how does 
the vectorized batch handle these cases.

Thanks a lot for the patch!

> Implement Parquet vectorization reader for Array type
> -----------------------------------------------------
>
>                 Key: HIVE-17931
>                 URL: https://issues.apache.org/jira/browse/HIVE-17931
>             Project: Hive
>          Issue Type: Sub-task
>            Reporter: Colin Ma
>            Assignee: Colin Ma
>         Attachments: HIVE-17931.001.patch, HIVE-17931.002.patch
>
>
> Parquet vectorized reader can't support array type, it should be supported to 
> improve the performance when the query with array type. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to