[jira] [Commented] (DRILL-315) Reading only select columns from a parquet file

Jason Altekruse (JIRA) Sat, 30 Nov 2013 11:13:37 -0800

    [ 
https://issues.apache.org/jira/browse/DRILL-315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13835868#comment-13835868
 ]


Jason Altekruse commented on DRILL-315:
---------------------------------------

I do apologize, I had thought I finished posting to review board, but 
apparently it did not go through. It should not be available publicly for 
comments. I did put in syntax for predicate push-down while I was adding the 
plan syntax for limit and column selection. I did not plan on implementing it 
right away fro parquet, because we likely will not get that great a performance 
gain from doing the predicates on individual values before coping them into 
VVs. The problem is right now we seem to able to get objects or byte arrays out 
of the interface how we are using them now. If we copy out individual values to 
evaluate a predicate, I believe we'll get worse performance than just copying 
out all of the data and sending it along to the filter operation in a lot of 
cases. This is definitely something worth implementing for json though, as the 
files will likely be smaller, and we have to copy/translate individual values 
anyway.

> Reading only select columns from a parquet file
> -----------------------------------------------
>
>                 Key: DRILL-315
>                 URL: https://issues.apache.org/jira/browse/DRILL-315
>             Project: Apache Drill
>          Issue Type: New Feature
>            Reporter: Jason Altekruse
>            Assignee: Jason Altekruse
>         Attachments: 
> 0001-Parquet-files-now-read-by-column-rather-than-buffer-.patch
>
>




--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (DRILL-315) Reading only select columns from a parquet file

Reply via email to