[ 
https://issues.apache.org/jira/browse/CURATOR-723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amr Elazhary updated CURATOR-723:
---------------------------------
    Attachment: Status Change Logs Anonymized.csv

> [
> -
>
>                 Key: CURATOR-723
>                 URL: https://issues.apache.org/jira/browse/CURATOR-723
>             Project: Apache Curator
>          Issue Type: Bug
>          Components: Framework
>    Affects Versions: 4.3.0
>            Reporter: Amr Elazhary
>            Priority: Major
>         Attachments: Status Change Logs Anonymized.csv
>
>
> We have a jar application
> we are using the below "curator-framework-4.3.0" and "zookeeper-3.5.7" on the 
> client servers for such application
> !https://issues.apache.org/jira/s/xd97tr/820010/13pdxe5/_/images/fav-jsw.png! 
> CURATOR-525 There is a race condition in Curator which might lead to fake 
> SUSPENDED event and ruin CuratorFrameworkImpl inner state - ASF JIRA (State 
> Bug)
> !https://issues.apache.org/jira/s/xd97tr/820010/13pdxe5/_/images/fav-jsw.png! 
> CURATOR-526 Error logged for valid config - "Invalid config event received: 
> \{properties}" - ASF JIRA (logging Bug)
> Above two Bugs Fixed in 5.0.0/5.2.0
>  
> We have the below logs on our application (probably a combination of these 
> two bugs) which ends up on consuming a lot of CPU and a failure after (due to 
> open/close connections)
> {noformat}
> 2024-10-18T06:35:27.855+11:00 -    o.a.c.f.state.ConnectionStateManager - 
> State change: SUSPENDED
> 2024-10-18T06:35:27.855+11:00 -  - New connection state : SUSPENDED
> 2024-10-18T06:35:27.855+11:00 -  - Connection to server has been suspended..
> 2024-10-18T06:35:28.638+11:00 -    org.apache.curator.ConnectionState - 
> Negotiated session timeout: 100002024-10-18T06:35:28.639+11:00 - 
> o.a.c.f.state.ConnectionStateManager - State change: RECONNECTED
> 2024-10-18T06:35:28.639+11:00 -    o.a.c.f.imps.CuratorFrameworkImpl - 
> Clearing sleep for 0 operations
> 2024-10-18T06:35:28.639+11:00 -  - New connection state : RECONNECTED
> 2024-10-18T06:35:28.639+11:00 -  - Connection to server has been 
> unsuspended...2024-10-18T06:35:28.640+11:00 - 
> o.a.c.framework.imps.EnsembleTracker - New config event received: 
> {}2024-10-18T06:35:28.640+11:00 -  - Reconnect worker starting
> 2024-10-18T06:35:28.640+11:00 -  - Attempting to reconnect to the Herd
> 2024-10-18T06:35:28.640+11:00 -  - Waiting for connection to server
> 2024-10-18T06:35:28.640+11:00 -  - Connected to server
> 2024-10-18T06:35:28.640+11:00 -  - Reconnect worker 
> completed2024-10-18T06:35:28.647+11:00 - o.a.c.framework.imps.EnsembleTracker 
> - Invalid config event received: {}{noformat}
>  
> +*Our Questions*+
> 1.For the logging Bug "Invalid config event received"  , we need to upgrade 
> the Curator as per https://issues.apache.org/jira/browse/CURATOR-526 please 
> confirm?
> 2. For the reconnections , we are suspecting that it is related to BUG 
> https://issues.apache.org/jira/browse/CURATOR-525 , however we need to 
> confirm this (how?), also what is the trigger for this BUG knowing that we 
> have another jar applications having the same Curator/Zookeeper version and 
> they are not facing this BUG if our assumption is correct ? please advice the 
> trigger here and why it is happening on some applications and not happening 
> for others.  also do we have any other solution rather than upgrading the 
> curator version if this is the root cause?
> 3. Can we upgrade the curator to version 5.2.0 without upgrading the Zookeper 
> knowing that we use Zookeeper "zookeeper-3.5.7"?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to