armitage420 commented on PR #6035: URL: https://github.com/apache/hive/pull/6035#issuecomment-3199588695
@Aggarwal-Raghav @zhangbutao @deniskuzZ Thanks for your time! @deniskuzZ The test is not reproducible in q files, rather it is reproducible in my cluster with Hadoop - 3.4.1, Tez - 0.10.4. Following are the steps followed: ``` set hive.execution.engine=tez; drop database if exists hive_header_footer_db cascade; create database if not exists hive_header_footer_db; use hive_header_footer_db; CREATE EXTERNAL TABLE `testcase1`(id int, name string) ROW FORMAT SERDE 'org.apache.hadoop.hive.serde2.OpenCSVSerde' LOCATION '/user/hive/hiveData/hive_header_footer_compressed/testcase1' TBLPROPERTIES ("skip.header.line.count"="1", "skip.footer.line.count"="1"); LOAD DATA INPATH '/tmp/testcase1.csv' INTO TABLE testcase1; select count(*) from testcase1; ``` The error is thrown during count(*) and not during simple reads. Following is the testcase1.csv content: ``` 1,2019-12-31 2,2019-12-31 3,2019-12-31 ``` Here is the reproducible error without the changes: <img width="1719" height="917" alt="Screenshot 2025-08-19 at 12 51 52 PM" src="https://github.com/user-attachments/assets/5cd324d9-cd90-4351-a7d3-f49c10d33315" /> Here is the result of the same command with replacing hive-exec jar with the new change: <img width="3450" height="2098" alt="image" src="https://github.com/user-attachments/assets/59474eea-96f2-4948-887f-340d83db4346" /> -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org For additional commands, e-mail: gitbox-h...@hive.apache.org