Laszlo Gaal created IMPALA-11576:
------------------------------------

             Summary: query_test.test_iceberg.test_multiple_storage_locations 
fails on S3
                 Key: IMPALA-11576
                 URL: https://issues.apache.org/jira/browse/IMPALA-11576
             Project: IMPALA
          Issue Type: Bug
            Reporter: Laszlo Gaal
            Assignee: Gergely Fürnstáhl


The test seems to fail on a badly constructed file name.
Stack trace:{code}
query_test.test_iceberg.TestIcebergTable.test_multiple_storage_locations[protocol:
 beeswax | exec_option: {'test_replan': 1, 'batch_size': 0, 'num_nodes': 0, 
'disable_codegen_rows_threshold': 0, 'disable_codegen': False, 
'abort_on_error': 1, 'exec_single_node_rows_threshold': 0} | table_format: 
parquet/none]

query_test/test_iceberg.py:785: in test_multiple_storage_locations
    vector, unique_database)
common/impala_test_suite.py:706: in run_test_case
    result = exec_fn(query, user=test_section.get('USER', '').strip() or None)
common/impala_test_suite.py:644: in __exec_in_impala
    result = self.__execute_query(target_impalad_client, query, user=user)
common/impala_test_suite.py:980: in __execute_query
    return impalad_client.execute(query, user=user)
common/impala_connection.py:212: in execute
    return self.__beeswax_client.execute(sql_stmt, user=user)
beeswax/impala_beeswax.py:189: in execute
    handle = self.__execute_query(query_string.strip(), user=user)
beeswax/impala_beeswax.py:367: in __execute_query
    self.wait_for_finished(handle)
beeswax/impala_beeswax.py:388: in wait_for_finished
    raise ImpalaBeeswaxException("Query aborted:" + error_log, None)
E   ImpalaBeeswaxException: ImpalaBeeswaxException:
E    Query aborted:Disk I/O error on 
impala-ec2-centos79-m6i-4xlarge-ondemand-1313.vpc.cloudera.com:27001: Failed to 
open HDFS file 
s3a://impala-test-uswest2-2/test-warehouse/iceberg_test/hadoop_catalog/ice/iceberg_multiple_storage_locations/s3a://impala-test-uswest2-2/test-warehouse/iceberg_test/hadoop_catalog/ice/iceberg_multiple_storage_locations_data02/col_int=2/00001-1-26bc91ef-b403-4b65-a6b0-566396b8d097-00001.parquet
E   Error(2): No such file or directory
E   Root cause: FileNotFoundException: No such file or directory: 
s3a://impala-test-uswest2-2/test-warehouse/iceberg_test/hadoop_catalog/ice/iceberg_multiple_storage_locations/s3a:/impala-test-uswest2-2/test-warehouse/iceberg_test/hadoop_catalog/ice/iceberg_multiple_storage_locations_data02/col_int=2/00001-1-26bc91ef-b403-4b65-a6b0-566396b8d097-00001.parquet{code}
Here the file name (this is a single, contuguous string despite the apparent 
line breaks!)
s3a://impala-test-uswest2-2/test-warehouse/iceberg_test/hadoop_catalog/ice/iceberg_multiple_storage_locations/s3a://impala-test-uswest2-2/test-warehouse/iceberg_test/hadoop_catalog/ice/iceberg_multiple_storage_locations_data02/col_int=2/00001-1-26bc91ef-b403-4b65-a6b0-566396b8d097-00001.parquet
contains the s3a: protocol specifier in the middle of the string, which seems 
to be a result of a badly set up concatenation.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to