[ https://issues.apache.org/jira/browse/DRILL-4323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15124470#comment-15124470 ]
ASF GitHub Bot commented on DRILL-4323: --------------------------------------- GitHub user hsuanyi opened a pull request: https://github.com/apache/drill/pull/349 DRILL-4323: When converting HiveParquetScan To DrillParquetScan, do n… …ot add Project when no column is needed to be read out from Scan (e.g., select count(*) from hive.table) You can merge this pull request into a Git repository by running: $ git pull https://github.com/hsuanyi/incubator-drill DRILL-4323 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/drill/pull/349.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #349 ---- commit f06ef505628e12131c48bc7747ca21b007c2d2b4 Author: Hsuan-Yi Chu <hsua...@usc.edu> Date: 2016-01-29T21:20:12Z DRILL-4323: When converting HiveParquetScan To DrillParquetScan, do not add Project when no column is needed to be read out from Scan (e.g., select count(*) from hive.table) ---- > Hive Native Reader : A simple count(*) throws Incoming batch has an empty > schema error > -------------------------------------------------------------------------------------- > > Key: DRILL-4323 > URL: https://issues.apache.org/jira/browse/DRILL-4323 > Project: Apache Drill > Issue Type: Bug > Components: Storage - Hive > Affects Versions: 1.5.0 > Reporter: Rahul Challapalli > Assignee: Sean Hsuan-Yi Chu > Priority: Critical > Attachments: error.log > > > git.commit.id.abbrev=3d0b4b0 > A simple count(*) query does not work when hive native reader is enabled > {code} > 0: jdbc:drill:zk=10.10.100.190:5181> select count(*) from customer; > +---------+ > | EXPR$0 | > +---------+ > | 100000 | > +---------+ > 1 row selected (3.074 seconds) > 0: jdbc:drill:zk=10.10.100.190:5181> alter session set > `store.hive.optimize_scan_with_native_readers` = true; > +-------+--------------------------------------------------------+ > | ok | summary | > +-------+--------------------------------------------------------+ > | true | store.hive.optimize_scan_with_native_readers updated. | > +-------+--------------------------------------------------------+ > 1 row selected (0.2 seconds) > 0: jdbc:drill:zk=10.10.100.190:5181> select count(*) from customer; > Error: SYSTEM ERROR: IllegalStateException: Incoming batch [#1341, > ProjectRecordBatch] has an empty schema. This is not allowed. > Fragment 0:0 > [Error Id: 4c867440-0fd3-4eda-922f-0f5eadcb1463 on qa-node191.qa.lab:31010] > (state=,code=0) > {code} > Hive DDL for the table : > {code} > create table customer > ( > c_customer_sk int, > c_customer_id string, > c_current_cdemo_sk int, > c_current_hdemo_sk int, > c_current_addr_sk int, > c_first_shipto_date_sk int, > c_first_sales_date_sk int, > c_salutation string, > c_first_name string, > c_last_name string, > c_preferred_cust_flag string, > c_birth_day int, > c_birth_month int, > c_birth_year int, > c_birth_country string, > c_login string, > c_email_address string, > c_last_review_date string > ) > STORED AS PARQUET > LOCATION '/drill/testdata/customer' > {code} > Attached the log file with the stacktrace -- This message was sent by Atlassian JIRA (v6.3.4#6332)