[ 
https://issues.apache.org/jira/browse/IMPALA-11900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17684986#comment-17684986
 ] 

Joe McDonnell commented on IMPALA-11900:
----------------------------------------

There are test failures on our S3 job when selecting from 
iceberg_partitioned_orc that look related to this. Here is the output on 
query_test/test_scanners.py's test_iceberg_old_fileformat:
{noformat}
query_test/test_scanners.py:496: in test_iceberg_old_fileformat
    self.run_test_case('QueryTest/iceberg-old-fileformat', vector, 
use_db=unique_database)
common/impala_test_suite.py:772: in run_test_case
    user=test_section.get('USER', '').strip() or None)
common/impala_test_suite.py:694: in __exec_in_impala
    result = self.__execute_query(target_impalad_client, query, user=user)
common/impala_test_suite.py:1048: in __execute_query
    return impalad_client.execute(query, user=user)
common/impala_connection.py:215: in execute
    return self.__beeswax_client.execute(sql_stmt, user=user)
beeswax/impala_beeswax.py:189: in execute
    handle = self.__execute_query(query_string.strip(), user=user)
beeswax/impala_beeswax.py:367: in __execute_query
    self.wait_for_finished(handle)
beeswax/impala_beeswax.py:388: in wait_for_finished
    raise ImpalaBeeswaxException("Query aborted:" + error_log, None)
E   ImpalaBeeswaxException: ImpalaBeeswaxException:
E    Query aborted:Encountered parse error in tail of ORC file 
s3a://impala-test-uswest2-2/test-warehouse/iceberg_test/hadoop_catalog/iceberg_partitioned_orc/functional_parquet/iceberg_partitioned_orc/data/action=download/00007-7-fc53a761-1ee2-4ce4-9507-c9917917d2c9-00000.orc:
 Failed to parse the postscript from 
s3a://impala-test-uswest2-2/test-warehouse/iceberg_test/hadoop_catalog/iceberg_partitioned_orc/functional_parquet/iceberg_partitioned_orc/data/action=download/00007-7-fc53a761-1ee2-4ce4-9507-c9917917d2c9-00000.orc{noformat}
There is a similar error on query_test/test_scanners.py's test_iceberg_query. 
Let me know if this is not actually related.

I'm going to bump the priority of this, just because tests are failing.

> Test table iceberg_partitioned_orc has wrong metadata
> -----------------------------------------------------
>
>                 Key: IMPALA-11900
>                 URL: https://issues.apache.org/jira/browse/IMPALA-11900
>             Project: IMPALA
>          Issue Type: Bug
>            Reporter: Zoltán Borók-Nagy
>            Assignee: Zoltán Borók-Nagy
>            Priority: Major
>              Labels: impala-iceberg
>
> Iceberg table iceberg_partitioned_orc has wrong metadata.
> The field 'file_size_in_bytes' is wrong for the data files.
> This causes issues on object stores where we rely more on Iceberg metadata 
> since 
> IMPALA-11798.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to