[ 
https://issues.apache.org/jira/browse/HIVE-8654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14189324#comment-14189324
 ] 

Sergey Shelukhin commented on HIVE-8654:
----------------------------------------

Looks like changing the output schema, as we do now, is not enough for parquet. 
Patching AST column names in addition fixes parquet, but that is problematic 
because it breaks other things (e.g. references from order by to select 
expressions), and patching those also is too hacky. I'll look for some other fix

> CBO: parquet_ctas test returns incorrect results
> ------------------------------------------------
>
>                 Key: HIVE-8654
>                 URL: https://issues.apache.org/jira/browse/HIVE-8654
>             Project: Hive
>          Issue Type: Sub-task
>          Components: CBO
>            Reporter: Sergey Shelukhin
>            Assignee: Sergey Shelukhin
>             Fix For: 0.14.0
>
>
> I am investigating right now. 
> The issue is specific to Parquet:
> {noformat}
> set hive.cbo.enable=true;
> drop table staging;
> drop table parquet_ctas;
> create table staging (key int, value string) stored as textfile;
> insert into table staging select * from src order by key limit 10;
> select * from staging;
> create table parquet_ctas stored as parquet as select * from staging;
> select * from parquet_ctas;
> create table orc_ctas stored as orc as select * from staging;
> select * from orc_ctas;
> create table txt_ctas stored as textfile as select * from staging;
> select * from txt_ctas;
> {noformat}
> The parquet query returns all NULLs with CBO on.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to