[ 
https://issues.apache.org/jira/browse/DRILL-1388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14126075#comment-14126075
 ] 

Jason Altekruse commented on DRILL-1388:
----------------------------------------

I had generated the select list from looking at all of the schema elements 
listed by parquet. pig_schema is not actually a column in the file, so the 
parquet reader currently will be producing a column with the name that is null 
filled. It appears that the project operator might not be handling this 
correctly, so it should be reviewed. I downgraded the priority as there is not 
an issue reading the real data. 

> Incorrect results when projecting nulls
> ---------------------------------------
>
>                 Key: DRILL-1388
>                 URL: https://issues.apache.org/jira/browse/DRILL-1388
>             Project: Apache Drill
>          Issue Type: Bug
>            Reporter: Jason Altekruse
>
> While testing fixed for the parquet nullable support I ran into an issue with 
> unexpected results. I was selecting several columns out of file parquet file, 
> which supports project pushdown. Currently the planner still includes a 
> project operation after the scan in this case (to properly modify schema in 
> the case of array indexing, project pushdown into scans is currently not 
> supposed to be changing structure). I pulled the physical plan from the query 
> and ran it without the extra project (as I was not selecting any array 
> values) and got the expected results.
> Here is the query I ran, the file is too large to attach so you can e-mail me 
> to get a copy of it.
> select pig_schema,ss_sold_date_sk,ss_item_sk,ss_cdemo_sk,ss_addr_sk, 
> ss_hdemo_sk from store_sales



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to