Michael Smith created IMPALA-13350:
--------------------------------------

             Summary: Workload Management flush on interval test failed
                 Key: IMPALA-13350
                 URL: https://issues.apache.org/jira/browse/IMPALA-13350
             Project: IMPALA
          Issue Type: Bug
          Components: Backend
            Reporter: Michael Smith
         Attachments: failure.tgz

After IMPALA-12737 "Refactor the Workload Management Initialization Process", 
custom_cluster/test_query_log.py::TestQueryLogTableHS2::test_flush_on_interval 
failed with
{code:java}
-- 2024-09-03 11:50:35,147 INFO     MainThread: Starting cluster with command: 
/data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/bin/start-impala-cluster.py
 '--state_store_args=--statestore_update_frequency_ms=50     
--statestore_priority_update_frequency_ms=50     
--statestore_heartbeat_frequency_ms=50' --cluster_size=3 --num_coordinators=3 
--log_dir=/data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/logs/custom_cluster_tests
 --log_level=1 '--impalad=--shutdown_grace_period_s=0 --shutdown_deadline_s=15' 
'--impalad_args=--enable_workload_mgmt --query_log_write_interval_s=15 ' 
'--state_store_args=None ' '--catalogd_args=--enable_workload_mgmt ' 
--impalad_args=--default_query_options=
11:50:35 MainThread: Found 0 impalad/0 statestored/0 catalogd process(es)
11:50:35 MainThread: Starting State Store logging to 
/data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/logs/custom_cluster_tests/statestored.INFO
11:50:35 MainThread: Starting Catalog Service logging to 
/data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/logs/custom_cluster_tests/catalogd.INFO
11:50:35 MainThread: Starting Impala Daemon logging to 
/data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/logs/custom_cluster_tests/impalad.INFO
11:50:35 MainThread: Starting Impala Daemon logging to 
/data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/logs/custom_cluster_tests/impalad_node1.INFO
11:50:35 MainThread: Starting Impala Daemon logging to 
/data/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/logs/custom_cluster_tests/impalad_node2.INFO
11:50:38 MainThread: Found 3 impalad/1 statestored/1 catalogd process(es)
11:50:38 MainThread: Found 3 impalad/1 statestored/1 catalogd process(es)
11:50:38 MainThread: Getting num_known_live_backends from 
impala-ec2-centos79-m6i-4xlarge-xldisk-0f51.vpc.cloudera.com:25000
11:50:38 MainThread: Debug webpage not yet available: 
HTTPConnectionPool(host='impala-ec2-centos79-m6i-4xlarge-xldisk-0f51.vpc.cloudera.com',
 port=25000): Max retries exceeded with url: /backends?json (Caused by 
NewConnectionError('<urllib3.connection.HTTPConnection object at 
0x7f99f9704d10>: Failed to establish a new connection: [Errno 111] Connection 
refused',))
11:50:40 MainThread: Debug webpage did not become available in expected time.
11:50:40 MainThread: Waiting for num_known_live_backends=3. Current value: None
11:50:41 MainThread: Found 3 impalad/1 statestored/1 catalogd process(es)
11:50:41 MainThread: Getting num_known_live_backends from 
impala-ec2-centos79-m6i-4xlarge-xldisk-0f51.vpc.cloudera.com:25000
11:50:41 MainThread: Debug webpage not yet available: 
HTTPConnectionPool(host='impala-ec2-centos79-m6i-4xlarge-xldisk-0f51.vpc.cloudera.com',
 port=25000): Max retries exceeded with url: /backends?json (Caused by 
NewConnectionError('<urllib3.connection.HTTPConnection object at 
0x7f99f97037d0>: Failed to establish a new connection: [Errno 111] Connection 
refused',))
11:50:43 MainThread: Debug webpage did not become available in expected time.
11:50:43 MainThread: Waiting for num_known_live_backends=3. Current value: None
11:50:44 MainThread: Found 3 impalad/1 statestored/1 catalogd process(es)
11:50:44 MainThread: Getting num_known_live_backends from 
impala-ec2-centos79-m6i-4xlarge-xldisk-0f51.vpc.cloudera.com:25000
11:50:44 MainThread: num_known_live_backends has reached value: 3
11:50:45 MainThread: Found 3 impalad/1 statestored/1 catalogd process(es)
11:50:45 MainThread: Getting num_known_live_backends from 
impala-ec2-centos79-m6i-4xlarge-xldisk-0f51.vpc.cloudera.com:25001
11:50:45 MainThread: num_known_live_backends has reached value: 3
11:50:45 MainThread: Found 3 impalad/1 statestored/1 catalogd process(es)
11:50:45 MainThread: Getting num_known_live_backends from 
impala-ec2-centos79-m6i-4xlarge-xldisk-0f51.vpc.cloudera.com:25002
11:50:45 MainThread: num_known_live_backends has reached value: 3
11:50:46 MainThread: Impala Cluster Running with 3 nodes (3 coordinators, 3 
executors).
-- 2024-09-03 11:50:46,553 DEBUG    MainThread: Found 3 impalad/1 statestored/1 
catalogd process(es)
-- 2024-09-03 11:50:46,553 INFO     MainThread: Getting metric: 
statestore.live-backends from 
impala-ec2-centos79-m6i-4xlarge-xldisk-0f51.vpc.cloudera.com:25010
-- 2024-09-03 11:50:46,556 INFO     MainThread: Metric 
'statestore.live-backends' has reached desired value: 4
-- 2024-09-03 11:50:46,556 DEBUG    MainThread: Getting num_known_live_backends 
from impala-ec2-centos79-m6i-4xlarge-xldisk-0f51.vpc.cloudera.com:25000
-- 2024-09-03 11:50:46,558 INFO     MainThread: num_known_live_backends has 
reached value: 3
-- 2024-09-03 11:50:46,558 DEBUG    MainThread: Getting num_known_live_backends 
from impala-ec2-centos79-m6i-4xlarge-xldisk-0f51.vpc.cloudera.com:25001
-- 2024-09-03 11:50:46,559 INFO     MainThread: num_known_live_backends has 
reached value: 3
-- 2024-09-03 11:50:46,559 DEBUG    MainThread: Getting num_known_live_backends 
from impala-ec2-centos79-m6i-4xlarge-xldisk-0f51.vpc.cloudera.com:25002
-- 2024-09-03 11:50:46,561 INFO     MainThread: num_known_live_backends has 
reached value: 3
SET 
client_identifier=custom_cluster/test_query_log.py::TestQueryLogTableHS2::()::test_flush_on_interval[protocol:hs2|exec_option:{'test_replan':1;'batch_size':0;'num_nodes':0;'disable_codegen_rows_threshold':5000;'disable_codegen':False;'abort_on_error':1;'exec_single_node_ro;
-- connecting to: localhost:21000
-- 2024-09-03 11:50:46,561 INFO     MainThread: Could not connect to ('::1', 
21000, 0, 0)
Traceback (most recent call last):
  File 
"/data0/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/infra/python/env-gcc10.4.0/lib/python2.7/site-packages/thrift/transport/TSocket.py",
 line 137, in open
    handle.connect(sockaddr)
  File 
"/data/jenkins/workspace/impala-asf-master-core-s3/Impala-Toolchain/toolchain-packages-gcc10.4.0/python-2.7.16/lib/python2.7/socket.py",
 line 228, in meth
    return getattr(self._sock,name)(*args)
error: [Errno 111] Connection refused
-- connecting to localhost:21050 with impyla
-- 2024-09-03 11:50:46,562 INFO     MainThread: Could not connect to ('::1', 
21050, 0, 0)
Traceback (most recent call last):
  File 
"/data0/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/infra/python/env-gcc10.4.0/lib/python2.7/site-packages/thrift/transport/TSocket.py",
 line 137, in open
    handle.connect(sockaddr)
  File 
"/data/jenkins/workspace/impala-asf-master-core-s3/Impala-Toolchain/toolchain-packages-gcc10.4.0/python-2.7.16/lib/python2.7/socket.py",
 line 228, in meth
    return getattr(self._sock,name)(*args)
error: [Errno 111] Connection refused
-- 2024-09-03 11:50:46,638 INFO     MainThread: Closing active operation
-- connecting to localhost:28000 with impyla
-- 2024-09-03 11:50:46,661 INFO     MainThread: Closing active operation
-- 2024-09-03 11:50:46,663 INFO     MainThread: Expected log lines could not be 
found, sleeping before retrying: Expected 1 lines in file 
/data0/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/logs/custom_cluster_tests/impalad.impala-ec2-centos79-m6i-4xlarge-xldisk-0f51.vpc.cloudera.com.jenkins.log.INFO.20240903-115035.12755
 matching regex 'Completed workload management initialization', but found 0 
lines. Last line was: 
            }
-- 2024-09-03 11:50:47,665 INFO     MainThread: Expected log lines could not be 
found, sleeping before retrying: Expected 1 lines in file 
/data0/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/logs/custom_cluster_tests/impalad.impala-ec2-centos79-m6i-4xlarge-xldisk-0f51.vpc.cloudera.com.jenkins.log.INFO.20240903-115035.12755
 matching regex 'Completed workload management initialization', but found 0 
lines. Last line was: 
I0903 11:50:46.911687 14130 workload-man
-- 2024-09-03 11:50:48,668 INFO     MainThread: Expected log lines could not be 
found, sleeping before retrying: Expected 1 lines in file 
/data0/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/logs/custom_cluster_tests/impalad.impala-ec2-centos79-m6i-4xlarge-xldisk-0f51.vpc.cloudera.com.jenkins.log.INFO.20240903-115035.12755
 matching regex 'Completed workload management initialization', but found 0 
lines. Last line was: 
I0903 11:50:46.911687 14130 workload-man
-- 2024-09-03 11:50:49,670 INFO     MainThread: Expected log lines could not be 
found, sleeping before retrying: Expected 1 lines in file 
/data0/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/logs/custom_cluster_tests/impalad.impala-ec2-centos79-m6i-4xlarge-xldisk-0f51.vpc.cloudera.com.jenkins.log.INFO.20240903-115035.12755
 matching regex 'Completed workload management initialization', but found 0 
lines. Last line was: 
I0903 11:50:46.911687 14130 workload-man
-- 2024-09-03 11:50:50,673 INFO     MainThread: Expected log lines could not be 
found, sleeping before retrying: Expected 1 lines in file 
/data0/jenkins/workspace/impala-asf-master-core-s3/repos/Impala/logs/custom_cluster_tests/impalad.impala-ec2-centos79-m6i-4xlarge-xldisk-0f51.vpc.cloudera.com.jenkins.log.INFO.20240903-115035.12755
 matching regex 'Completed workload management initialization', but found 0 
lines. Last line was: 
I0903 11:50:46.911687 14130 workload-man
-- getting log for operation: <tests.common.impala_connection.OperationHandle 
object at 0x7f8150074f10>
-- getting runtime profile operation: 
<tests.common.impala_connection.OperationHandle object at 0x7f8150074f10>
-- closing query for operation handle: 
<tests.common.impala_connection.OperationHandle object at 0x7f8150074f10>
-- 2024-09-03 11:50:52,884 INFO     MainThread: Closing active operation
-- getting log for operation: <tests.common.impala_connection.OperationHandle 
object at 0x7f8150074810>
-- getting runtime profile operation: 
<tests.common.impala_connection.OperationHandle object at 0x7f8150074810>
-- closing query for operation handle: 
<tests.common.impala_connection.OperationHandle object at 0x7f8150074810>
-- 2024-09-03 11:50:53,898 INFO     MainThread: Closing active operation
-- getting log for operation: <tests.common.impala_connection.OperationHandle 
object at 0x7f81500746d0>
-- getting runtime profile operation: 
<tests.common.impala_connection.OperationHandle object at 0x7f81500746d0>
-- closing query for operation handle: 
<tests.common.impala_connection.OperationHandle object at 0x7f81500746d0>
-- 2024-09-03 11:50:54,913 INFO     MainThread: Closing active operation
-- getting log for operation: <tests.common.impala_connection.OperationHandle 
object at 0x7f8150074210>
-- getting runtime profile operation: 
<tests.common.impala_connection.OperationHandle object at 0x7f8150074210>
-- closing query for operation handle: 
<tests.common.impala_connection.OperationHandle object at 0x7f8150074210>
-- 2024-09-03 11:50:55,927 INFO     MainThread: Closing active operation
-- getting log for operation: <tests.common.impala_connection.OperationHandle 
object at 0x7f8150074dd0>
-- getting runtime profile operation: 
<tests.common.impala_connection.OperationHandle object at 0x7f8150074dd0>
-- closing query for operation handle: 
<tests.common.impala_connection.OperationHandle object at 0x7f8150074dd0>
-- 2024-09-03 11:50:56,941 INFO     MainThread: Closing active operation
-- getting log for operation: <tests.common.impala_connection.OperationHandle 
object at 0x7f81500746d0>
-- getting runtime profile operation: 
<tests.common.impala_connection.OperationHandle object at 0x7f81500746d0>
-- closing query for operation handle: 
<tests.common.impala_connection.OperationHandle object at 0x7f81500746d0>
-- 2024-09-03 11:50:57,956 INFO     MainThread: Closing active operation
-- getting log for operation: <tests.common.impala_connection.OperationHandle 
object at 0x7f8150074210>
-- getting runtime profile operation: 
<tests.common.impala_connection.OperationHandle object at 0x7f8150074210>
-- closing query for operation handle: 
<tests.common.impala_connection.OperationHandle object at 0x7f8150074210>
-- 2024-09-03 11:50:58,970 INFO     MainThread: Closing active operation
-- getting log for operation: <tests.common.impala_connection.OperationHandle 
object at 0x7f8150074dd0>
-- getting runtime profile operation: 
<tests.common.impala_connection.OperationHandle object at 0x7f8150074dd0>
-- closing query for operation handle: 
<tests.common.impala_connection.OperationHandle object at 0x7f8150074dd0>
-- 2024-09-03 11:50:59,984 INFO     MainThread: Closing active operation
-- getting log for operation: <tests.common.impala_connection.OperationHandle 
object at 0x7f81500746d0>
-- getting runtime profile operation: 
<tests.common.impala_connection.OperationHandle object at 0x7f81500746d0>
-- closing query for operation handle: 
<tests.common.impala_connection.OperationHandle object at 0x7f81500746d0>
-- 2024-09-03 11:51:00,998 INFO     MainThread: Closing active operation
-- getting log for operation: <tests.common.impala_connection.OperationHandle 
object at 0x7f8150074210>
-- getting runtime profile operation: 
<tests.common.impala_connection.OperationHandle object at 0x7f8150074210>
-- closing query for operation handle: 
<tests.common.impala_connection.OperationHandle object at 0x7f8150074210>
-- 2024-09-03 11:51:02,011 INFO     MainThread: Closing active operation
-- 2024-09-03 11:51:02,012 INFO     MainThread: Getting metric: 
impala-server.completed-queries.written from 
impala-ec2-centos79-m6i-4xlarge-xldisk-0f51.vpc.cloudera.com:25000
-- 2024-09-03 11:51:02,015 INFO     MainThread: Waiting for metric value 
'impala-server.completed-queries.written'=10. Current value: 0
-- 2024-09-03 11:51:02,015 INFO     MainThread: Sleeping 1s before next retry.
-- 2024-09-03 11:51:03,016 INFO     MainThread: Getting metric: 
impala-server.completed-queries.written from 
impala-ec2-centos79-m6i-4xlarge-xldisk-0f51.vpc.cloudera.com:25000
-- 2024-09-03 11:51:03,029 INFO     MainThread: Waiting for metric value 
'impala-server.completed-queries.written'=10. Current value: 0
-- 2024-09-03 11:51:03,029 INFO     MainThread: Sleeping 1s before next retry.
-- 2024-09-03 11:51:04,030 INFO     MainThread: Getting metric: 
impala-server.completed-queries.written from 
impala-ec2-centos79-m6i-4xlarge-xldisk-0f51.vpc.cloudera.com:25000
-- 2024-09-03 11:51:04,033 INFO     MainThread: Waiting for metric value 
'impala-server.completed-queries.written'=10. Current value: 9
-- 2024-09-03 11:51:04,033 INFO     MainThread: Sleeping 1s before next retry.
-- 2024-09-03 11:51:05,034 INFO     MainThread: Getting metric: 
impala-server.completed-queries.written from 
impala-ec2-centos79-m6i-4xlarge-xldisk-0f51.vpc.cloudera.com:25000
-- 2024-09-03 11:51:05,037 INFO     MainThread: Waiting for metric value 
'impala-server.completed-queries.written'=10. Current value: 9
-- 2024-09-03 11:51:05,037 INFO     MainThread: Sleeping 1s before next retry.
-- 2024-09-03 11:51:06,038 INFO     MainThread: Getting metric: 
impala-server.completed-queries.written from 
impala-ec2-centos79-m6i-4xlarge-xldisk-0f51.vpc.cloudera.com:25000
-- 2024-09-03 11:51:06,041 INFO     MainThread: Waiting for metric value 
'impala-server.completed-queries.written'=10. Current value: 9
-- 2024-09-03 11:51:06,041 INFO     MainThread: Sleeping 1s before next retry.
-- 2024-09-03 11:51:07,042 INFO     MainThread: Getting metric: 
impala-server.completed-queries.written from 
impala-ec2-centos79-m6i-4xlarge-xldisk-0f51.vpc.cloudera.com:25000
-- 2024-09-03 11:51:07,045 INFO     MainThread: Waiting for metric value 
'impala-server.completed-queries.written'=10. Current value: 9
-- 2024-09-03 11:51:07,045 INFO     MainThread: Sleeping 1s before next retry.
-- 2024-09-03 11:51:08,046 INFO     MainThread: Getting metric: 
impala-server.completed-queries.written from 
impala-ec2-centos79-m6i-4xlarge-xldisk-0f51.vpc.cloudera.com:25000
-- 2024-09-03 11:51:08,049 INFO     MainThread: Waiting for metric value 
'impala-server.completed-queries.written'=10. Current value: 9
-- 2024-09-03 11:51:08,049 INFO     MainThread: Sleeping 1s before next retry.
-- 2024-09-03 11:51:09,050 INFO     MainThread: Getting metric: 
impala-server.completed-queries.written from 
impala-ec2-centos79-m6i-4xlarge-xldisk-0f51.vpc.cloudera.com:25000
-- 2024-09-03 11:51:09,054 INFO     MainThread: Waiting for metric value 
'impala-server.completed-queries.written'=10. Current value: 9
-- 2024-09-03 11:51:09,054 INFO     MainThread: Sleeping 1s before next retry.
-- 2024-09-03 11:51:10,055 INFO     MainThread: Getting metric: 
impala-server.completed-queries.written from 
impala-ec2-centos79-m6i-4xlarge-xldisk-0f51.vpc.cloudera.com:25000
-- 2024-09-03 11:51:10,058 INFO     MainThread: Waiting for metric value 
'impala-server.completed-queries.written'=10. Current value: 9
-- 2024-09-03 11:51:10,058 INFO     MainThread: Sleeping 1s before next retry.
-- 2024-09-03 11:51:11,059 INFO     MainThread: Getting metric: 
impala-server.completed-queries.written from 
impala-ec2-centos79-m6i-4xlarge-xldisk-0f51.vpc.cloudera.com:25000
-- 2024-09-03 11:51:11,062 INFO     MainThread: Waiting for metric value 
'impala-server.completed-queries.written'=10. Current value: 9
-- 2024-09-03 11:51:11,062 INFO     MainThread: Sleeping 1s before next retry.
-- 2024-09-03 11:51:12,063 INFO     MainThread: Metric 
impala-server.completed-queries.written did not reach value 10 in 10s. 
Failing...{code}
Logs attached.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to