Hi:
I have two karaf instances for the cellar cluster setup. Obviously, both of
these have cellar installed.
One of the nodes is a management node, while the other is a worker node. The
management and worker nodes are not supposed to share any kind of
configurations. But, there could be multiple worker nodes, which might share
configurations (based on their group preferences).
Currently, I have isolated the management node to be separately located in the
"management" group. I instantiated this instance first, and created the group,
and joined the management node to the management group. Additionally, I had to
add `org.ops4j.pax.transx.tm.geronimo` to the value of
config.excluded.properties in etc/org.apache.karaf.cellar.node.cfg, as it
showed me transaction lock issue, and because of which Transaction service
never started and ultimately affected the dependent services; and jeopardized
both nodes.
Then, from another instance, I tried to join to an existing group, and I get
the following error (I have listed my steps as well)
karaf@root()> cluster:group-list
| Group | Members
--+------------+----------------------
x | default | 192.168.1.26:5702(x)
| TST |
| test |
| management | 192.168.1.26:5701
karaf@root()> cluster:group-join TST
No result received within given timeout
karaf@root()> log:tail
10:35:17.609 ERROR [pool-13-thread-4] Error while dispatching task
java.lang.NullPointerException: null
at java.util.HashSet.<init>(HashSet.java:118) ~[?:?]
at
org.apache.karaf.cellar.hazelcast.HazelcastGroupManager.registerGroup(HazelcastGroupManager.java:467)
~[67:org.apache.karaf.cellar.hazelcast:4.1.2]
at
org.apache.karaf.cellar.core.control.ManageGroupCommandHandler.joinGroup(ManageGroupCommandHandler.java:91)
~[65:org.apache.karaf.cellar.core:4.1.2]
at
org.apache.karaf.cellar.core.control.ManageGroupCommandHandler.execute(ManageGroupCommandHandler.java:41)
~[65:org.apache.karaf.cellar.core:4.1.2]
at
org.apache.karaf.cellar.core.control.ManageGroupCommandHandler.execute(ManageGroupCommandHandler.java:27)
~[65:org.apache.karaf.cellar.core:4.1.2]
at
org.apache.karaf.cellar.core.command.CommandHandler.handle(CommandHandler.java:40)
~[65:org.apache.karaf.cellar.core:4.1.2]
at
org.apache.karaf.cellar.core.command.CommandHandler.handle(CommandHandler.java:28)
~[65:org.apache.karaf.cellar.core:4.1.2]
at
org.apache.karaf.cellar.core.event.EventDispatchTask.run(EventDispatchTask.java:67)
[65:org.apache.karaf.cellar.core:4.1.2]
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
[?:?]
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
[?:?]
at java.lang.Thread.run(Thread.java:745) [?:?]
And now, my management node also cannot join any groups as I get the same error.
I am out of clue on this issue.
Any help on this would be very helpful.
Thanks,
Cooshal.