Joe McDonnell created IMPALA-10062:
--------------------------------------
Summary: TestCompressedNonText.test_insensitivity_to_extension can
fail due to wrong filename
Key: IMPALA-10062
URL: https://issues.apache.org/jira/browse/IMPALA-10062
Project: IMPALA
Issue Type: Bug
Components: Frontend
Affects Versions: Impala 4.0
Reporter: Joe McDonnell
The fix for IMPALA-10005 added a new TestCompressedNonText test. It relies on
Hive generating specific file names when writing these compressed tables (i.e.
it expects a file named 000000_0). It looks like that is not guaranteed by
dataload, which can lead to failures like this:
{noformat}
query_test/test_compressed_formats.py:142: in test_insensitivity_to_extension
unique_database, 'tinytable', db_suffix, '000000_0', src_extension, ext)
query_test/test_compressed_formats.py:86: in _copy_and_query_compressed_file
self.filesystem_client.copy(src_file, dest_file, overwrite=True)
util/hdfs_util.py:79: in copy
self.hdfs_filesystem_client.copy(src, dst, overwrite)
util/hdfs_util.py:241: in copy
'{0} copy failed: '.format(self.filesystem_type) + stderr + "; " + stdout
E AssertionError: HDFS copy failed: cp:
`/test-warehouse/tinytable_avro_snap/000000_0': No such file or directory
E ;{noformat}
The file list shows that the filename is actually
"/test-warehouse/tinytable_avro_snap/000000_1"
We should update the test to tolerate this. The actual base filename doesn't
matter for this test.
I have seen this exactly once so far.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]