Dear all,

I am experiencing a similar error too with OOM Casablanca:

ubuntu@rancher:~/oom/kubernetes$ kubectl get pods --all-namespaces -o=wide
No resources found.
Error from server: client: etcd cluster is unavailable or misconfigured; error 
#0: client: etcd member https://etcd.kubernetes.rancher.internal:2379 has no 
leader

The very first time I deployed Casablanca it went fine. Then, due to some 
issues trying new helm commands and plugins I decided to make a complete new 
install (from scratch, i.e., ubuntu-server and so on).
Then, the second time and third time (2nd time master node crashed, 3rd time 
slave node crashed) I had this issue above and I did a full installation from 
scratch in all cases, i.e., ubuntu-server on racks and then OOM guide. I have 2 
physical nodes for K8s (64 + 32 of RAM). This happens almost at the beginning, 
i.e., after 30min-1h waiting for ONAP components to reach running state, one 
node crushes (and it is almost unresponsive...). In theory, it is possible to 
delete/add and recover a node in a cluster:

https://rancher.com/docs/rancher/v1.6/en/kubernetes/disaster-recovery/

However, the crashed node is almost unresponsive to commands so...no way to do 
that (can't ssh, anything "hyper-mega slow").
It seems an issue...thinking to use a single node for K8s...worried about this 
behavior.
I thought it might be a RAM issue (not enough RAM) but I don't deploy all ONAP 
components. I tried to deploy the same ONAP components as in Beijing (no issue 
there).

Can someone help?

KR,

Xoan

-=-=-=-=-=-=-=-=-=-=-=-
Links: You receive all messages sent to this group.

View/Reply Online (#14321): https://lists.onap.org/g/onap-discuss/message/14321
Mute This Topic: https://lists.onap.org/mt/27404722/21656
Group Owner: onap-discuss+ow...@lists.onap.org
Unsubscribe: https://lists.onap.org/g/onap-discuss/unsub  
[arch...@mail-archive.com]
-=-=-=-=-=-=-=-=-=-=-=-

Reply via email to