[
https://issues.apache.org/jira/browse/IMPALA-11900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17684986#comment-17684986
]
Joe McDonnell commented on IMPALA-11900:
----------------------------------------
There are test failures on our S3 job when selecting from
iceberg_partitioned_orc that look related to this. Here is the output on
query_test/test_scanners.py's test_iceberg_old_fileformat:
{noformat}
query_test/test_scanners.py:496: in test_iceberg_old_fileformat
self.run_test_case('QueryTest/iceberg-old-fileformat', vector,
use_db=unique_database)
common/impala_test_suite.py:772: in run_test_case
user=test_section.get('USER', '').strip() or None)
common/impala_test_suite.py:694: in __exec_in_impala
result = self.__execute_query(target_impalad_client, query, user=user)
common/impala_test_suite.py:1048: in __execute_query
return impalad_client.execute(query, user=user)
common/impala_connection.py:215: in execute
return self.__beeswax_client.execute(sql_stmt, user=user)
beeswax/impala_beeswax.py:189: in execute
handle = self.__execute_query(query_string.strip(), user=user)
beeswax/impala_beeswax.py:367: in __execute_query
self.wait_for_finished(handle)
beeswax/impala_beeswax.py:388: in wait_for_finished
raise ImpalaBeeswaxException("Query aborted:" + error_log, None)
E ImpalaBeeswaxException: ImpalaBeeswaxException:
E Query aborted:Encountered parse error in tail of ORC file
s3a://impala-test-uswest2-2/test-warehouse/iceberg_test/hadoop_catalog/iceberg_partitioned_orc/functional_parquet/iceberg_partitioned_orc/data/action=download/00007-7-fc53a761-1ee2-4ce4-9507-c9917917d2c9-00000.orc:
Failed to parse the postscript from
s3a://impala-test-uswest2-2/test-warehouse/iceberg_test/hadoop_catalog/iceberg_partitioned_orc/functional_parquet/iceberg_partitioned_orc/data/action=download/00007-7-fc53a761-1ee2-4ce4-9507-c9917917d2c9-00000.orc{noformat}
There is a similar error on query_test/test_scanners.py's test_iceberg_query.
Let me know if this is not actually related.
I'm going to bump the priority of this, just because tests are failing.
> Test table iceberg_partitioned_orc has wrong metadata
> -----------------------------------------------------
>
> Key: IMPALA-11900
> URL: https://issues.apache.org/jira/browse/IMPALA-11900
> Project: IMPALA
> Issue Type: Bug
> Reporter: Zoltán Borók-Nagy
> Assignee: Zoltán Borók-Nagy
> Priority: Major
> Labels: impala-iceberg
>
> Iceberg table iceberg_partitioned_orc has wrong metadata.
> The field 'file_size_in_bytes' is wrong for the data files.
> This causes issues on object stores where we rely more on Iceberg metadata
> since
> IMPALA-11798.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]