[ 
https://issues.apache.org/jira/browse/IMPALA-12556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17785348#comment-17785348
 ] 

Wenzhe Zhou commented on IMPALA-12556:
--------------------------------------

Thanks [~laszlog] to file this Jira. I will work on it.

Catalogd log messages.
{code:java}
F1111 23:36:24.798915 2270244 statestore-subscriber.cc:336] Check failed: 
!statestore_is_active || !statestore2_is_active 
*** Check failure stack trace: ***
    @          0x389f66d  google::LogMessage::Fail()
    @          0x38a15a4  google::LogMessage::SendToLog()
    @          0x389f04c  google::LogMessage::Flush()

I1111 23:36:24.834599 2270278 statestore-subscriber.cc:743] Starting statestore 
subscriber
I1111 23:36:24.835072 2272522 TAcceptQueueServer.cpp:355] New connection to 
server StatestoreSubscriber from client <Host: 127.0.0.1 Port: 48662>
I1111 23:36:24.835194 2272522 TAcceptQueueServer.cpp:355] New connection to 
server StatestoreSubscriber from client <Host: 127.0.0.1 Port: 48668>
I1111 23:36:24.835242 2272522 TAcceptQueueServer.cpp:355] New connection to 
server StatestoreSubscriber from client <Host: 127.0.0.1 Port: 48674>
I1111 23:36:24.835269 2270278 statestore-subscriber.cc:684] Statestore protocol 
version: V2
I1111 23:36:24.835278 2270278 statestore-subscriber.cc:693] Subscriber 
registration ID: b7427b8c816720ad:597a82d4368d7792
I1111 23:36:24.835280 2270278 statestore-subscriber.cc:701] Statestore ID: 
4b4c5e6402fee001:ddd1f305755eb79f
I1111 23:36:24.835281 2270278 statestore-subscriber.cc:719] Active catalogd 
address: ip-172-31-52-128:26000
I1111 23:36:24.835283 2270278 statestore-subscriber.cc:753] statestore 
registration successful on startup
I1111 23:36:24.835287 2270278 exec-env.cc:715] The address of Catalog service 
is changed from localhost:26000 to ip-172-31-52-128:26000
I1111 23:36:24.835299 2272522 TAcceptQueueServer.cpp:355] New connection to 
server StatestoreSubscriber from client <Host: 127.0.0.1 Port: 48676>
I1111 23:36:24.835448 2270278 statestore-subscriber.cc:743] Starting statestore 
subscriber
I1111 23:36:24.835820 2270278 statestore-subscriber.cc:684] Statestore protocol 
version: V2
I1111 23:36:24.835829 2270278 statestore-subscriber.cc:693] Subscriber 
registration ID: ae4e3d7426091e84:2d5e2ba69055ca89
I1111 23:36:24.835831 2270278 statestore-subscriber.cc:701] Statestore ID: 
6d493a7087e09773:97eb4a1e889718b7
I1111 23:36:24.835834 2270278 statestore-subscriber.cc:719] Active catalogd 
address: ip-172-31-52-128:26000
I1111 23:36:24.835836 2270278 statestore-subscriber.cc:753] statestore 
registration successful on startup
I1111 23:36:24.835909 2272522 TAcceptQueueServer.cpp:355] New connection to 
server StatestoreSubscriber from client <Host: 127.0.0.1 Port: 48682>
F1111 23:36:24.835925 2270278 statestore-subscriber.cc:336] Check failed: 
!statestore_is_active || !statestore2_is_active 

    @          0x38a1ac9  google::LogMessageFatal::~LogMessageFatal()
    @          0x1811a13  impala::StatestoreSubscriber::Start()
    @           0xfb419a  impala::CatalogServer::Start()
    @           0xeffb37  CatalogdMain()
    @           0xefe4bf  main
    @     0x7f2a2c2ec083  __libc_start_main
    @           0xefe2fe  _start
Minidump with no thread info available.
{code}

impalad log messages:
{code:java}
F1111 23:36:24.835925 2270278 statestore-subscriber.cc:336] Check failed: 
!statestore_is_active || !statestore2_is_active 
*** Check failure stack trace: ***
    @          0x389f66d  google::LogMessage::Fail()
    @          0x38a15a4  google::LogMessage::SendToLog()
    @          0x389f04c  google::LogMessage::Flush()
    @          0x38a1ac9  google::LogMessageFatal::~LogMessageFatal()
    @          0x1811a13  impala::StatestoreSubscriber::Start()
    @          0x1391180  impala::ExecEnv::StartStatestoreSubscriberService()
    @          0x1625138  impala::ImpalaServer::Start()
    @          0x1609175  ImpaladMain()
    @           0xefe49c  main
    @     0x7f77fc990083  __libc_start_main
    @           0xefe2fe  _start
Minidump with no thread info available.
{code}

statestored log messages:

{code:java}
I1111 23:36:22.327819 2270287 TAcceptQueueServer.cpp:355] New connection to 
server StatestoreHaService from client <Host: 127.0.0.1 Port: 36042>
I1111 23:36:22.327908 2270228 statestore.cc:1878] Send Statestore HA handshake 
request
I1111 23:36:22.327960 2270287 TAcceptQueueServer.cpp:355] New connection to 
server StatestoreHaService from client <Host: 127.0.0.1 Port: 36050>
I1111 23:36:22.328279 2270299 statestore.cc:1910] Receive Statestore HA 
handshake request
I1111 23:36:22.328285 2270299 statestore.cc:1917] Set the statestored as 
standby since the peer is started with force active flag
I1111 23:36:22.328310 2270228 statestore.cc:1851] Receive Statestore HA 
handshake response, set the statestore as active
I1111 23:36:22.328430 2270302 TAcceptQueueServer.cpp:329] 
connection_setup_thread_pool_size is set to 2
I1111 23:36:22.328433 2270228 thrift-server.cc:419] ThriftServer 
'StatestoreService' started on port: 24001
I1111 23:36:22.427978 2270296 statestore.cc:1640] Send notification for active 
statestored version: 1699745782328309
I1111 23:36:22.427994 2270296 statestore.cc:1714] Successfully sent 
UpdateStatestoredRole RPCs to all subscribers
{code}


> test_two_statestored_with_force_active fails or flaky
> -----------------------------------------------------
>
>                 Key: IMPALA-12556
>                 URL: https://issues.apache.org/jira/browse/IMPALA-12556
>             Project: IMPALA
>          Issue Type: Bug
>          Components: Distributed Exec
>    Affects Versions: Impala 4.4.0
>            Reporter: Laszlo Gaal
>            Assignee: Wenzhe Zhou
>            Priority: Blocker
>
> custom_cluster.test_statestored_ha.TestStatestoredHA.test_two_statestored_with_force_active
>  failed in a precommit run.
> Symptom:
> {code}
> common/custom_cluster_test_suite.py:208: in setup_method
>     self._start_impala_cluster(cluster_args, **kwargs)
> common/custom_cluster_test_suite.py:330: in _start_impala_cluster
>     check_call(cmd + options, close_fds=True)
> ../toolchain/toolchain-packages-gcc10.4.0/python-2.7.16/lib/python2.7/subprocess.py:190:
>  in check_call
>     raise CalledProcessError(retcode, cmd)
> E   CalledProcessError: Command 
> '['/home/ubuntu/Impala/bin/start-impala-cluster.py', 
> '--state_store_args=--statestore_update_frequency_ms=50     
> --statestore_priority_update_frequency_ms=50     
> --statestore_heartbeat_frequency_ms=50', '--cluster_size=3', 
> '--num_coordinators=3', 
> '--log_dir=/home/ubuntu/Impala/logs/custom_cluster_tests', '--log_level=1', 
> '--state_store_args=--statestore_force_active=true ', 
> '--enable_statestored_ha', '--impalad_args=--default_query_options=']' 
> returned non-zero exit status 1
> {code}
> The test dies with a FATAL log entry in catalogd's log:
> {code}
> DCHECK found in log file: 
> /home/ubuntu/Impala/logs/custom_cluster_tests/catalogd.FATAL
> {code}
> {code}
> Log file created at: 2023/11/11 23:36:24
> Running on machine: ip-172-31-52-128
> Log line format: [IWEF]mmdd hh:mm:ss.uuuuuu threadid file:line] msg
> F1111 23:36:24.798915 2270244 statestore-subscriber.cc:336] Check failed: 
> !statestore_is_active || !statestore2_is_active 
> {code}
> Offending precommit run: 
> https://jenkins.impala.io/job/ubuntu-20.04-from-scratch/874/ (preserved).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to