[ 
https://issues.apache.org/jira/browse/IMPALA-8754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17153806#comment-17153806
 ] 

Wenzhe Zhou edited comment on IMPALA-8754 at 7/8/20, 5:55 PM:
--------------------------------------------------------------

Saw the similar error in data loading tpch for Kudu.

The console output show the data loading failure when loading tpch for Kudu:

Loading workload 'tpch' using exploration strategy 'core' in table formats 
'kudu/none/none'. Logging to 
/data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/logs/data_loading/data-load-tpch-core-kudu_none_none.log

*04:14:39* Error loading data. 

 

Log file 
"logs/data_loading/sql/tpch/load-tpch-core-impala-generated-kudu-none-node.sql.log"
 show the root cause was requested resource not found. See log file below:

 

ERROR: INSERT INTO TABLE tpch_kudu.lineitem SELECT * FROM tpch.lineitem
 Traceback (most recent call last):
 File 
"/data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/bin/load-data.py",
 line 208, in exec_impala_query_from_file
 result = impala_client.execute(query)
 File 
"/data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/tests/beeswax/impala_beeswax.py",
 line 187, in execute
 handle = self.__execute_query(query_string.strip(), user=user)
 File 
"/data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/tests/beeswax/impala_beeswax.py",
 line 365, in __execute_query
 self.wait_for_finished(handle)
 File 
"/data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/tests/beeswax/impala_beeswax.py",
 line 386, in wait_for_finished
 raise ImpalaBeeswaxException("Query aborted:" + error_log, None)
 ImpalaBeeswaxException: ImpalaBeeswaxException:
 Query aborted:Disk I/O error on 
impala-ec2-centos74-m5-4xlarge-ondemand-1b61.vpc.cloudera.com:22000: Failed to 
open HDFS file 
s3a://impala-test-uswest2-1/test-warehouse/tpch.lineitem/lineitem.tbl
 Error(2): No such file or directory
 Root cause: ResourceNotFoundException: Requested resource not found (Service: 
AmazonDynamoDBv2; Status Code: 400; Error Code: ResourceNotFoundException; 
Request ID: JNG4VO47DND18BRTPCROK2H38FVV4KQNSO5AEMVJF66Q9ASUAAJG)


was (Author: wzhou):
Saw the similar error in data loading tpch for Kudu.

The console output show the data loading failure when loading tpch for Kudu:


 Loading workload 'tpch' using exploration strategy 'core' in table formats 
'kudu/none/none'. Logging to 
/data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/logs/data_loading/data-load-tpch-core-kudu_none_none.log*04:14:39*
 Error loading data. 
  

Log file 
logs/data_loading/sql/tpch/load-tpch-core-impala-generated-kudu-none-node.sql.log
 show the root cause was requested resource not found. See log file below:

 

ERROR: INSERT INTO TABLE tpch_kudu.lineitem SELECT * FROM tpch.lineitem
 Traceback (most recent call last):
 File 
"/data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/bin/load-data.py",
 line 208, in exec_impala_query_from_file
 result = impala_client.execute(query)
 File 
"/data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/tests/beeswax/impala_beeswax.py",
 line 187, in execute
 handle = self.__execute_query(query_string.strip(), user=user)
 File 
"/data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/tests/beeswax/impala_beeswax.py",
 line 365, in __execute_query
 self.wait_for_finished(handle)
 File 
"/data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/tests/beeswax/impala_beeswax.py",
 line 386, in wait_for_finished
 raise ImpalaBeeswaxException("Query aborted:" + error_log, None)
 ImpalaBeeswaxException: ImpalaBeeswaxException:
 Query aborted:Disk I/O error on 
impala-ec2-centos74-m5-4xlarge-ondemand-1b61.vpc.cloudera.com:22000: Failed to 
open HDFS file 
s3a://impala-test-uswest2-1/test-warehouse/tpch.lineitem/lineitem.tbl
 Error(2): No such file or directory
 Root cause: ResourceNotFoundException: Requested resource not found (Service: 
AmazonDynamoDBv2; Status Code: 400; Error Code: ResourceNotFoundException; 
Request ID: JNG4VO47DND18BRTPCROK2H38FVV4KQNSO5AEMVJF66Q9ASUAAJG)

> S3 with S3Guard tests encounter "ResourceNotFoundException" from DynamoDB
> -------------------------------------------------------------------------
>
>                 Key: IMPALA-8754
>                 URL: https://issues.apache.org/jira/browse/IMPALA-8754
>             Project: IMPALA
>          Issue Type: Bug
>          Components: Infrastructure
>    Affects Versions: Impala 3.3.0
>            Reporter: Joe McDonnell
>            Assignee: Joe McDonnell
>            Priority: Critical
>              Labels: broken-build, flaky
>         Attachments: load-tpch-core-impala-generated-kudu-none-none.sql.log
>
>
> When running tests on s3 with s3guard, various tests can encounter the 
> following error coming from the DynamoDB:
> {noformat}
> E    Query aborted:Disk I/O error on 
> impala-ec2-centos74-m5-4xlarge-ondemand-02c8.vpc.cloudera.com:22002: Failed 
> to open HDFS file 
> s3a://impala-test-uswest2-1/test-warehouse/tpcds.store_sales_parquet/ss_sold_date_sk=2451718/6843d8a91fc5ae1d-88b2af4b00000004_156969840_data.0.parq
> E   Error(2): No such file or directory
> E   Root cause: ResourceNotFoundException: Requested resource not found 
> (Service: AmazonDynamoDBv2; Status Code: 400; Error Code: 
> ResourceNotFoundException; Request ID: 
> XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX){noformat}
> Tests that have seen this (this is flaky):
>  * TestTpcdsQuery.test_tpcds_count
>  * TestHdfsFdCaching.test_caching_disabled_by_param
>  * TestMtDop.test_compute_stats
>  * TestScanRangeLengths.test_scan_ranges



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to