Hi,
I have installed clearwater manually using single node. I also scale up
using 2 more node for sprout 1 for vellum and dime. After some time my ip
is lost and i reset the ip in local_config file restart the services and the*
scale up node is no more (they are deleted)*


*Please reply, i want to fix it. I am waiting for your reply,, thanks in
advance*

*i have doubt that the scale up node is not remove properly. I just deleted
them*

I am getting the following errors in every node:



*[vellum]ubuntu@vellum:~$ cw-config download shared_config*
Unable to contact the etcd cluster.




*[bono]ubuntu@bono:~$ sudo monit summary*
[sudo] password for ubuntu:
Monit 5.18.1 uptime: 2h 38m
 Service Name                     Status                      Type
 node-bono                        Running                     System
 restund_process                  Running                     Process
 ntp_process                      Running                     Process
 clearwater_queue_manager_pro...  Running                     Process
 etcd_process                     Execution failed | Does...  Process
 clearwater_diags_monitor_pro...  Running                     Process
 clearwater_config_manager_pr...  Running                     Process
 clearwater_cluster_manager_p...  Running                     Process
 bono_process                     Running                     Process
 poll_restund                     Status ok                   Program
 monit_uptime                     Status ok                   Program
 clearwater_queue_manager_uptime  Status ok                   Program
* etcd_uptime                      Wait parent                 Program*
* poll_etcd_cluster                Wait parent                 Program*
* poll_etcd                        Wait parent                 Program*
 poll_bono                        Status ok                   Program







 *[vellum]ubuntu@vellum:~$ sudo cw-check_config_sync*
[sudo] password for ubuntu:
Traceback (most recent call last):
  File 
"/usr/share/clearwater/clearwater-config-manager/scripts/check_config_sync.py",
line 29, in <module>
    result = client.get("/" + etcd_key + "/" + site + "/configuration/" +
plugin.key())
  File "/usr/share/clearwater/clearwater-config-manager/env/local/
lib/python2.7/site-packages/etcd/client.py", line 679, in get
    return self.read(key)
  File "/usr/share/clearwater/clearwater-config-manager/env/local/
lib/python2.7/site-packages/etcd/client.py", line 536, in read
    timeout=timeout)
  File "/usr/share/clearwater/clearwater-config-manager/env/local/
lib/python2.7/site-packages/etcd/client.py", line 834, in wrapper
    cause=e
etcd.EtcdConnectionFailed: Connection to etcd failed due to
MaxRetryError("HTTPConnectionPool(host='10.224.61.45', port=4000): Max
retries exceeded with url: /v2/keys/clearwater/site1/configuration/dns_json
(Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at
0x7f479c1d3910>: Failed to establish a new connection: [Errno 111]
Connection refused',))",)






*[vellum]ubuntu@vellum:~$ clearwater-etcdctl cluster-health*
cluster may be unhealthy: failed to list members
Error:  client: etcd cluster is unavailable or misconfigured; error #0:
dial tcp 10.224.61.45:4000: getsockopt: connection refused

error #0: dial tcp 10.224.61.45:4000: getsockopt: connection refused



*[vellum]ubuntu@vellum:~$ clearwater-etcdctl member list*
Error:  client: etcd cluster is unavailable or misconfigured; error #0:
dial tcp 10.224.61.45:4000: getsockopt: connection refused

error #0: dial tcp 10.224.61.45:4000: getsockopt: connection refused








*[bono]ubuntu@bono:/var/log/clearwater-etcd$ cat clearwater-etcd-initd.log*
context deadline exceeded
2018-04-19 06:25:52.190441844 etcdctl returned 1
2018-04-19 06:25:52.203217708 Joining existing cluster...
2018-04-19 06:26:21.211523618 Configured ETCDCTL_PEERS: 10.224.61.112:4000,
10.224.61.8:4000,10.224.61.109:4000,10.224.61.19:4000,10.224.61.47:4000,
10.224.61.24:4000,10.224.61.22:4000,10.224.61.115:4000,10.224.61.27:4000,
10.224.61.45:4000,
2018-04-19 06:26:21.213445229 Check cluster is healthy
2018-04-19 06:26:21.217376667 Running etcdctl cluster-health
2018-04-19 06:26:22.331785696 Found leaked etcd 2414 (correct is ) -
killing 2414
2018-04-19 06:26:22.335431738 etcdctl returned 137
2018-04-19 06:26:22.337320589 Restarting etcd clearwater-etcd
2018-04-19 06:26:22.343212899 Configured ETCDCTL_PEERS: 10.224.61.112:4000,
10.224.61.8:4000,10.224.61.109:4000,10.224.61.19:4000,10.224.61.47:4000,
10.224.61.24:4000,10.224.61.22:4000,10.224.61.115:4000,10.224.61.27:4000,
10.224.61.45:4000,
2018-04-19 06:26:22.343830818 Check for previous failed startup attempt
2018-04-19 06:26:22.344751074 Running etcdctl member list
context deadline exceeded
2018-04-19 06:26:32.357759475 etcdctl returned 1
2018-04-19 06:26:32.371886200 Joining existing cluster...
2018-04-19 06:26:37.382282245 Configured ETCDCTL_PEERS: 10.224.61.112:4000,
10.224.61.8:4000,10.224.61.109:4000,10.224.61.19:4000,10.224.61.47:4000,
10.224.61.24:4000,10.224.61.22:4000,10.224.61.115:4000,10.224.61.27:4000,
10.224.61.45:4000,
2018-04-19 06:26:37.383949088 Check cluster is healthy
2018-04-19 06:26:37.386734717 Running etcdctl cluster-health
cluster may be unhealthy: failed to list members
Error:  client: etcd cluster is unavailable or misconfigured; error #0:
client: endpoint http://10.224.61.109:4000 exceeded header timeout
; error #1: dial tcp 10.224.61.19:4000: getsockopt: connection refused
; error #2: dial tcp 10.224.61.47:4000: getsockopt: connection refused
; error #3: http: no Host in request URL
; error #4: dial tcp 10.224.61.24:4000: getsockopt: connection refused
; error #5: dial tcp 10.224.61.45:4000: getsockopt: connection refused
; error #6: dial tcp 10.224.61.8:4000: getsockopt: connection refused
; error #7: client: endpoint http://10.224.61.115:4000 exceeded header
timeout
; error #8: client: endpoint http://10.224.61.112:4000 exceeded header
timeout
; error #9: dial tcp 10.224.61.27:4000: getsockopt: connection refused
; error #10: dial tcp 10.224.61.22:4000: getsockopt: connection refused

error #0: client: endpoint http://10.224.61.109:4000 exceeded header timeout
error #1: dial tcp 10.224.61.19:4000: getsockopt: connection refused
error #2: dial tcp 10.224.61.47:4000: getsockopt: connection refused
error #3: http: no Host in request URL
error #4: dial tcp 10.224.61.24:4000: getsockopt: connection refused
error #5: dial tcp 10.224.61.45:4000: getsockopt: connection refused
error #6: dial tcp 10.224.61.8:4000: getsockopt: connection refused
error #7: client: endpoint http://10.224.61.115:4000 exceeded header timeout
error #8: client: endpoint http://10.224.61.112:4000 exceeded header timeout
error #9: dial tcp 10.224.61.27:4000: getsockopt: connection refused
error #10: dial tcp 10.224.61.22:4000: getsockopt: connection refused

2018-04-19 06:26:43.409826012 etcdctl returned 4
2018-04-19 06:26:43.412523362 Not joining an unhealthy cluster
2018-04-19 06:27:03.126255654 Restarting etcd clearwater-etcd
2018-04-19 06:27:03.133102368 Configured ETCDCTL_PEERS: 10.224.61.112:4000,
10.224.61.8:4000,10.224.61.109:4000,10.224.61.19:4000,10.224.61.47:4000,
10.224.61.24:4000,10.224.61.22:4000,10.224.61.115:4000,10.224.61.27:4000,
10.224.61.45:4000,
2018-04-19 06:27:03.133904517 Check for previous failed startup attempt
2018-04-19 06:27:03.135105646 Running etcdctl member list
context deadline exceeded
2018-04-19 06:27:13.148679161 etcdctl returned 1
2018-04-19 06:27:13.162058102 Joining existing cluster...
2018-04-19 06:27:34.170718991 Configured ETCDCTL_PEERS: 10.224.61.112:4000,
10.224.61.8:4000,10.224.61.109:4000,10.224.61.19:4000,10.224.61.47:4000,
10.224.61.24:4000,10.224.61.22:4000,10.224.61.115:4000,10.224.61.27:4000,
10.224.61.45:4000,
2018-04-19 06:27:34.172721201 Check cluster is healthy
2018-04-19 06:27:34.176441659 Running etcdctl cluster-health
cluster may be unhealthy: failed to list members
Error:  client: etcd cluster is unavailable or misconfigured; error #0:
client: endpoint http://10.224.61.109:4000 exceeded header timeout
; error #1: dial tcp 10.224.61.19:4000: getsockopt: connection refused
; error #2: client: endpoint http://10.224.61.112:4000 exceeded header
timeout
; error #3: dial tcp 10.224.61.47:4000: getsockopt: connection refused
; error #4: dial tcp 10.224.61.45:4000: getsockopt: connection refused
; error #5: dial tcp 10.224.61.8:4000: getsockopt: connection refused
; error #6: dial tcp 10.224.61.27:4000: getsockopt: connection refused
; error #7: client: endpoint http://10.224.61.115:4000 exceeded header
timeout
; error #8: dial tcp 10.224.61.22:4000: getsockopt: connection refused
; error #9: http: no Host in request URL
; error #10: dial tcp 10.224.61.24:4000: getsockopt: connection refused

error #0: client: endpoint http://10.224.61.109:4000 exceeded header timeout
error #1: dial tcp 10.224.61.19:4000: getsockopt: connection refused
error #2: client: endpoint http://10.224.61.112:4000 exceeded header timeout
error #3: dial tcp 10.224.61.47:4000: getsockopt: connection refused
error #4: dial tcp 10.224.61.45:4000: getsockopt: connection refused
error #5: dial tcp 10.224.61.8:4000: getsockopt: connection refused
error #6: dial tcp 10.224.61.27:4000: getsockopt: connection refused
error #7: client: endpoint http://10.224.61.115:4000 exceeded header timeout
error #8: dial tcp 10.224.61.22:4000: getsockopt: connection refused
error #9: http: no Host in request URL
error #10: dial tcp 10.224.61.24:4000: getsockopt: connection refused

thanks,
sunil
_______________________________________________
Clearwater mailing list
[email protected]
http://lists.projectclearwater.org/mailman/listinfo/clearwater_lists.projectclearwater.org

Reply via email to