Thanks for the tips, Joel, but no luck so far with 4.3.0-0.nightly-2019-12-13-180405.
After the following: - destroy cluster - copy backup install-config.yaml with my CA cert at additionalTrustBundle to empty osp-nightly/ dir - generate manifests `openshift-install create manifests --dir osp-nightly` - update osp-nightly/manifests/cluster-proxy-01-config.yaml setting spec/trustedCA/name=user-ca-bundle - run install `openshift-install create cluster --dir=osp-nightly --log-level=debug` I still see cert errors from machine-api controller ``` $ export KUBECONFIG=osp-nightly/auth/kubeconfig $ oc logs -c machine-controller -f -n openshift-machine-api $(oc get pods -n openshift-machine-api -l k8s-app=controller -o name) ... I1214 07:34:19.124112 1 controller.go:164] Reconciling Machine "osp-nightly-rrzv5-worker-tk495" I1214 07:34:19.124188 1 controller.go:376] Machine "osp-nightly-rrzv5-worker-tk495" in namespace "openshift-machine-api" doesn't specify "cluster.k8s.io/cluster-name" label, assuming nil cluster E1214 07:34:19.132925 1 controller.go:279] Failed to check if machine "osp-nightly-rrzv5-worker-tk495" exists: Error checking if instance exists (machine/actuator.go 346): Error getting a new instance service from the machine (machine/actuator.go 467): Create providerClient err: Post https://openstack.domain.com:13000/v3/auth/tokens: x509: certificate signed by unknown authority ```` I can confirm my cert is here: $ oc get cm user-ca-bundle -n openshift-config -o json | jq -r '.data."ca-bundle.crt"' And that the proxy received the configmap name from the custom manifest rather than default "": $ oc get proxy cluster -o json | jq .spec.trustedCA {"name": "user-ca-bundle"} I'm stuck with 3 masters and no workers while installer says: DEBUG Still waiting for the cluster to initialize: Some cluster operators are still updating: authentication, console, image-registry, ingress, monitoring I guess I'll keep watching https://bugzilla.redhat.com/show_bug.cgi?id=1769879 and https://github.com/openshift/enhancements/pull/115 and running 3.11 :) On Wed, Dec 4, 2019 at 9:29 PM Joel Pearson <japear...@agiledigital.com.au> wrote: > > > On Wed, 4 Dec 2019 at 08:02, Dale Bewley <d...@bewley.net> wrote: > >> >> On Tue, Nov 26, 2019 at 7:29 PM Joel Pearson < >> japear...@agiledigital.com.au> wrote: >> >> Thanks for taking the time to reply, Joel. >> >> >>> On Sat, 23 Nov 2019 at 13:21, Dale Bewley <d...@bewley.net> wrote: >>> >>>> Hello, >>>> I'm testing OCP 4.3 2019-11-19 nightly on OSP 13. >>>> >>>> I added my CA cert [1] to install-config.yaml [3] and the installer >>>> now progresses. I can even `oc get nodes` and see the masters. [2]. >>>> >>>> I still have the following errors and no worker nodes though. >>>> >>>> ERROR Cluster operator authentication Degraded is True with >>>> RouteStatusDegradedFailedHost: RouteStatusDegraded: route is not available >>>> at canonical host >>>> oauth-openshift.apps.osp-nightly.osp-nightly.domain.com: [] >>>> >>> >>> This sounds like ingress isn't deploying because the worker nodes are >>> not deployed or your load balancer isn't making ingress available. Are your >>> master nodes schedulable? Ie are your masters also workers? If not, then >>> ingress won't deploy. >>> >>> >> $ oc describe node osp-nightly-tfz6p-master-0 | grep -i schedul >> Taints: node-role.kubernetes.io/master:NoSchedule >> Unschedulable: false >> >> They are schedulable, but there are no matching tolerations in >> openshift-ingress/router-default deployment, so those pods are indeed stuck >> in _pending_ without any worker nodes. >> >> How is your load balancer configured for 80/443 traffic? If the masters >>> aren't targets of that, then even if ingress deploys you still won't be >>> able to use any routes >>> >>> >> >> No load balancer exists. I'm just trying to smoke test >> https://docs.openshift.com/container-platform/4.2/installing/installing_openstack/installing-openstack-installer-custom.html >> >> >>> >>>> >>>> This is likely a symptom of not yet having associated a floating IP to >>>> the app neutron port, and not having created an /etc/hosts entry on the >>>> installer host. I assume that's a nonfatal error. >>>> >>>> I assume this one is fatal, however: >>>> >>>> INFO Cluster operator image-registry Progressing is True with Error: >>>> Unable to apply resources: unable to sync storage configuration: Post >>>> https://openstack.domain.com:13000/v3/auth/tokens: x509: certificate >>>> signed by unknown authority >>>> >>> >>> Have you added the CA that covers openstack.domain.com >>> to install-config.yaml at .additionalTrustBundle like you mentioned in your >>> previous post? >>> >> >> Yep. >> >> >>> >>> Otherwise you might need to edit Proxy config and set >>> spec.trustedCA.name to user-ca-bundle >>> >>> apiVersion: config.openshift.io/v1 >>> kind: Proxy >>> metadata: >>> name: cluster >>> spec: >>> trustedCA: >>> name: user-ca-bundle >>> >>> I had to do this even though I don't have an explicit proxy. I do have a >>> transparent proxy though, which was doing MITM, essentially breaking >>> anything trying to talk to the internet. >>> >> >> Where did you make this change? >> > > I did this before installation, for convenience mostly, after running > "openshift-install create manifests --dir=ignition-files", I edited the > ignition-files/manifests/cluster-proxy-01-config.yaml file. > > Otherwise, it looks like you can do it after the fact using "oc edit > proxies cluster", then you'll need to wait for the masters to reboot I > think. Which for me sometimes takes like 10 minutes until it has done all > of them. > > FYI, I managed to find out what name to use to edit that proxy config by > running "oc api-resources --api-group=config.openshift.io" and then > finding the name for apigroup "config.openshift.io" and kind "Proxy". > > >> >> I was going to try the 12/02 4.3 nightly build, but based on the >> following 2 blockers it doesn't look like it will work: >> >> * https://bugzilla.redhat.com/show_bug.cgi?id=1769879 Machine-api cannot >> create workers on osp envs installed with self-signed certs >> > > There is a fair chance the above proxy config will fix this one > > >> * https://github.com/openshift/enhancements/pull/115 enhancements/x509-trust: >> Propose a new enhancement >> > > I triggered this whole discussion from here: > https://bugzilla.redhat.com/show_bug.cgi?id=1771564 originally, so the > above proxy config should help. > > >> >> It's disappointing that the 4.2 release notes claim that OpenStack is >> supported when it does not seem to be supported in what I presume to be the >> majority of OSP configurations. >> >> >>>> Is it safe to assume this BZ comment is related to that error? >>>> https://bugzilla.redhat.com/show_bug.cgi?id=1735192#c17 >>>> >>>> Bootstrap host has already been removed by the installer, so >>>> `openshift-install gather` does not seem usable, but the installer debug >>>> output can be found at >>>> https://paste.fedoraproject.org/paste/SzIqAMU4DWHN3Bw3WDKfTQ >>>> >>>> Any advice? >>>> >>>> Thanks! >>>> >>>> >>>> [1] >>>> https://lists.openshift.redhat.com/openshift-archives/users/2019-November/msg00073.html >>>> >>>> [2] >>>> $ export KUBECONFIG=osp-nightly/auth/kubeconfig >>>> $ oc get nodes >>>> NAME STATUS ROLES AGE VERSION >>>> osp-nightly-tfz6p-master-0 Ready master 102m v1.16.2 >>>> osp-nightly-tfz6p-master-1 Ready master 103m v1.16.2 >>>> osp-nightly-tfz6p-master-2 Ready master 103m v1.16.2 >>>> >>>> [3] install-config.yaml >>>> apiVersion: v1 >>>> baseDomain: ocp.domain.com >>>> additionalTrustBundle: | >>>> -----BEGIN CERTIFICATE----- >>>> MI... >>>> compute: >>>> - hyperthreading: Enabled >>>> name: worker >>>> platform: >>>> openstack: >>>> rootVolume: >>>> size: 10 >>>> replicas: 3 >>>> controlPlane: >>>> hyperthreading: Enabled >>>> name: master >>>> platform: {} >>>> replicas: 3 >>>> metadata: >>>> creationTimestamp: null >>>> name: osp-nightly >>>> networking: >>>> clusterNetwork: >>>> - cidr: 10.128.0.0/14 >>>> hostPrefix: 23 >>>> machineCIDR: 10.0.0.0/16 >>>> networkType: OpenShiftSDN >>>> serviceNetwork: >>>> - 172.30.0.0/16 >>>> platform: >>>> openstack: >>>> cloud: shiftstack >>>> computeFlavor: ocp4.worker.4x16 >>>> externalDNS: null >>>> externalNetwork: floating >>>> lbFloatingIP: 192.0.2.29 >>>> octaviaSupport: "0" >>>> region: "" >>>> trunkSupport: "1" >>>> publish: External >>>> pullSecret: '{"... >>>> sshKey: | >>>> ssh-rsa A... >>>> >>>> >>>> _______________________________________________ >>>> users mailing list >>>> users@lists.openshift.redhat.com >>>> http://lists.openshift.redhat.com/openshiftmm/listinfo/users >>>> >>> >>> >
_______________________________________________ users mailing list users@lists.openshift.redhat.com http://lists.openshift.redhat.com/openshiftmm/listinfo/users