Dear all, I am experiencing a similar error too with OOM Casablanca:
ubuntu@rancher:~/oom/kubernetes$ kubectl get pods --all-namespaces -o=wide No resources found. Error from server: client: etcd cluster is unavailable or misconfigured; error #0: client: etcd member https://etcd.kubernetes.rancher.internal:2379 has no leader The very first time I deployed Casablanca it went fine. Then, due to some issues trying new helm commands and plugins I decided to make a complete new install (from scratch, i.e., ubuntu-server and so on). Then, the second time and third time (2nd time master node crashed, 3rd time slave node crashed) I had this issue above and I did a full installation from scratch in all cases, i.e., ubuntu-server on racks and then OOM guide. I have 2 physical nodes for K8s (64 + 32 of RAM). This happens almost at the beginning, i.e., after 30min-1h waiting for ONAP components to reach running state, one node crushes (and it is almost unresponsive...). In theory, it is possible to delete/add and recover a node in a cluster: https://rancher.com/docs/rancher/v1.6/en/kubernetes/disaster-recovery/ However, the crashed node is almost unresponsive to commands so...no way to do that (can't ssh, anything "hyper-mega slow"). It seems an issue...thinking to use a single node for K8s...worried about this behavior. I thought it might be a RAM issue (not enough RAM) but I don't deploy all ONAP components. I tried to deploy the same ONAP components as in Beijing (no issue there). Can someone help? KR, Xoan -=-=-=-=-=-=-=-=-=-=-=- Links: You receive all messages sent to this group. View/Reply Online (#14321): https://lists.onap.org/g/onap-discuss/message/14321 Mute This Topic: https://lists.onap.org/mt/27404722/21656 Group Owner: onap-discuss+ow...@lists.onap.org Unsubscribe: https://lists.onap.org/g/onap-discuss/unsub [arch...@mail-archive.com] -=-=-=-=-=-=-=-=-=-=-=-