Ronald Macmaster created HBASE-29377: ----------------------------------------
Summary: ChaosService: Doesn't Support ZK Quorum Strings with Ports Key: HBASE-29377 URL: https://issues.apache.org/jira/browse/HBASE-29377 Project: HBase Issue Type: Bug Components: integration tests Affects Versions: 2.6.2, 2.5.11 Reporter: Ronald Macmaster The ChaosAgent doesn't support zookeeper quorum strings which specify custom ports. example {code:java} # hbase.zookeeper.quorum localhost:2181,localhost:2182,localhost:2183{code} When starting the chaosagent service, one will receive the following log messages. The ChaosAgent is setup with a modified zookeeper quorum string always appending hbase.zookeeper.property.clientPort (2181) to each quorum server - even though the original quorum prop already specifies a port. {code:java} bin/hbase chaosagent -c start 2025-06-04T19:27:39,932 INFO [main] zookeeper.ZooKeeper: Initiating client connection, connectString=localhost:2181:2181,localhost:2182:2181,localhost:2183:2181 sessionTimeout=600000 watcher=org.apache.hadoop.hbase.chaos.ChaosAgent@67389cb8 2025-06-04T21:42:38,698 INFO [main-EventThread] chaos.ChaosAgent: Processing event: WatchedEvent state:Closed type:None path:null 2025-06-04T21:42:38,697 ERROR [main-SendThread()] client.StaticHostProvider: Unable to resolve address: localhost:2181/<unresolved>:2181 java.net.UnknownHostException: localhost:2181: invalid IPv6 address literal at java.net.InetAddress.invalidIPv6LiteralException(InetAddress.java:1693) ~[?:?] at java.net.InetAddress.getAllByName(InetAddress.java:1663) ~[?:?] at org.apache.zookeeper.client.StaticHostProvider$1.getAllByName(StaticHostProvider.java:88) ~[zookeeper-3.8.4.jar:3.8.4] at org.apache.zookeeper.client.StaticHostProvider.resolve(StaticHostProvider.java:141) ~[zookeeper-3.8.4.jar:3.8.4] at org.apache.zookeeper.client.StaticHostProvider.next(StaticHostProvider.java:368) ~[zookeeper-3.8.4.jar:3.8.4] at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1204) ~[zookeeper-3.8.4.jar:3.8.4]{code} The behavior comes from the [getZKQuorum() method in the ChaosUtils class|https://github.com/apache/hbase/blob/d93102ccf631f11028d466acaa1ea3862c185b44/hbase-it/src/main/java/org/apache/hadoop/hbase/chaos/ChaosUtils.java#L36-L43] which is only used in the [ChaosService setup|https://github.com/apache/hbase/blob/d93102ccf631f11028d466acaa1ea3862c185b44/hbase-it/src/main/java/org/apache/hadoop/hbase/chaos/ChaosService.java#L79-L80]. This approach deviates from the convention across other areas of the codebase that instead rely on the [ZKConfig helper methods|https://github.com/apache/hbase/blob/master/hbase-client/src/main/java/org/apache/hadoop/hbase/zookeeper/ReadOnlyZKClient.java#L136-L141] to fetch quorum strings. We should use the the ZKConfig methods instead which are used elsewhere and [account for the case where the port is already specified|https://github.com/apache/hbase/blob/d93102ccf631f11028d466acaa1ea3862c185b44/hbase-common/src/main/java/org/apache/hadoop/hbase/zookeeper/ZKConfig.java#L165-L202] in configuration. -- This message was sent by Atlassian Jira (v8.20.10#820010)