-
Bug
-
Resolution: Not a Bug
-
Undefined
-
None
-
4.12
-
Low
-
None
-
False
-
-
Description of problem:
Invalid extra manifest only caught by the agent installer timeout
Version-Release number of selected component (if applicable):
How reproducible:
100%
Steps to Reproduce:
1. Install openshift using Agent Installer with OpenShift manifest directory, consume into iso, connect to BM and boot 2. During install get "failed to create some manifests: ..." 3. This keep repeating until openshift-install agent wait-for bootstrap-complete will time out in 60 minutes, openshift-install agent wait-for install-complete should time out in 90 minutes [core@sno-2 ~]$ journalctl -b -f -u release-image.service -u bootkube.service Jan 23 20:03:30 sno-2 bootkube.sh[13933]: [#3485] failed to create some manifests: Jan 23 20:03:30 sno-2 bootkube.sh[13933]: "lvms-operatorgroup-0.yml": failed to get operatorgroups.v1.operators.coreos.com/ -n openshift-storage: name is required ^^ In other test I see the iteration of 14K attempts before hour install failed. openshift/ ├── 00-disable-operator-hub.yaml ├── 00-kni-lvms.yaml ├── 01-lvms-namespace.yml ├── 02-lvms-operatorgroup.yml <-- offending yml └── 03-lvms-subscriptions.ym cat 00-kni-lvms.yaml apiVersion: operators.coreos.com/v1alpha1 kind: CatalogSource metadata: name: kni-lvms namespace: openshift-marketplace annotations: org.image: registry.redhat.io/redhat/redhat-operator-index:v4.12 (lvms-operator) spec: sourceType: grpc image: registry.kni-qe-31.lab.eng.rdu2.redhat.com:5000/olm/kni-lvms@sha256:7b5606311ec4a9fef2ad7aec06705a4b6ceccf23c0cbb5dcc22b38e3e8ab1ba5 displayName: kni-lvms-disconnected publisher: Red Hat ------ cat 00-kni-lvms.yaml === This is the offending yml, specifically the generateName== apiVersion: operators.coreos.com/v1 kind: OperatorGroup metadata: annotations: olm.providedAPIs: LVMCluster.v1alpha1.lvm.topolvm.io,LVMVolumeGroup.v1alpha1.lvm.topolvm.io,LVMVolumeGroupNodeStatus.v1alpha1.lvm.topolvm.io,LogicalVolume.v1.topolvm.io generateName: openshift-storage- namespace: openshift-storage spec: targetNamespaces: - openshift-storage upgradeStrategy: Default ===This one below is good working yml === apiVersion: operators.coreos.com/v1alpha1 kind: CatalogSource metadata: name: kni-lvms namespace: openshift-marketplace annotations: org.image: registry.redhat.io/redhat/redhat-operator-index:v4.12 (lvms-operator) spec: sourceType: grpc image: registry.kni-qe-31.lab.eng.rdu2.redhat.com:5000/olm/kni-lvms@sha256:7b5606311ec4a9fef2ad7aec06705a4b6ceccf23c0cbb5dcc22b38e3e8ab1ba5 displayName: kni-lvms-disconnected publisher: Red Hat [kni@registry.kni-qe-31 openshift]$ cat 01-lvms-namespace.yml apiVersion: v1 kind: Namespace metadata: labels: openshift.io/cluster-monitoring: "true" name: openshift-storage ------ cat 03-lvms-subscriptions.yml apiVersion: operators.coreos.com/v1alpha1 kind: Subscription metadata: labels: operators.coreos.com/lvms-operator.openshift-storage: "" name: lvms-operator namespace: openshift-storage spec: channel: stable-4.12 installPlanApproval: Automatic name: lvms-operator source: kni-lvms sourceNamespace: openshift-marketplace startingCSV: lvms-operator.v4.12.0
Actual results:
repetitive failure over 14K attempts before Agent Installer timeout at 60 minutes
Expected results:
Error the invalid manifest and continue
Additional info:
- relates to
-
CORS-2522 Improve bootstrapping/installation progress & error reporting
- To Do