Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-9461

ovirt-csi-driver-node and ovirt-csi-driver-controller pods in CrashLoopBackOff

XMLWordPrintable

    • None
    • Unspecified
    • If docs needed, set a value

      Description of problem:
      Fresh install of OCP 4.10.24 on RHV v4.5.0-3.el8ev (Red Hat Enterprise Linux release 8.6 (Ootpa)) installed following https://access.redhat.com/documentation/en-us/red_hat_virtualization/4.4/html-single/installing_red_hat_virtualization_as_a_standalone_manager_with_local_databases/index, storage on a shared NFS.

      Reproduced only on 4.10 (never hit this in <=4.9).

      Ensure /var/lib/libvirt/images/nfs is 777

      Version-Release number of selected component (if applicable):

      How reproducible:

      Steps to Reproduce:
      1. Install OCP4.10 with the install-config.yaml (see below)
      2. Configure the DNS Operator with DNS Forwarding (see below)
      3.

      Actual results:

      These resources never get up because apparently the csi-driver can't resolve correctly the ovirt hosted engine name:

      ~~~
      NAME READY UP-TO-DATE AVAILABLE AGE
      deployment.apps/ovirt-csi-driver-controller 0/2 2 0 15d
      deployment.apps/ovirt-csi-driver-operator 1/1 1 1 15d

      replicaset.apps/ovirt-csi-driver-controller-55687f6d98 0 0 0 2d3h
      replicaset.apps/ovirt-csi-driver-controller-5594f8f6f8 2 2 0 15d
      replicaset.apps/ovirt-csi-driver-controller-5fb9df6cf 0 0 0 2d
      replicaset.apps/ovirt-csi-driver-controller-856878949 0 0 0 2d1h
      replicaset.apps/ovirt-csi-driver-controller-8574877658 0 0 0 15d
      replicaset.apps/ovirt-csi-driver-controller-8647d785f5 0 0 0 13d
      replicaset.apps/ovirt-csi-driver-controller-ff4879f7 0 0 0 2d
      replicaset.apps/ovirt-csi-driver-operator-5f8dd8f5 0 0 0 13d
      replicaset.apps/ovirt-csi-driver-operator-6667467cb4 0 0 0 2d1h
      replicaset.apps/ovirt-csi-driver-operator-787964ff88 1 1 1 15d

      $ oc logs pod/ovirt-csi-driver-controller-5594f8f6f8-lmssh csi-driver
      I0811 14:14:59.380388 1 ovirt-csi-driver.go:43] Driver vendor csi.ovirt.org 0.1.1
      F0811 14:15:09.381693 1 ovirt-csi-driver.go:47] Failed to initialize ovirt client timeout: timeout while attempting to create connection (failed to validate the connection (Post "https://ovirt-engine.dummy.example.com/ovirt-engine/sso/oauth/token": dial tcp: lookup ovirt-engine.dummy.example.com on 10.37.195.193:53: no such host))
      ~~~

      PV Dump:
      N/A

      PVC Dump:
      N/A

      StorageClass Dump (if StorageClass used by PV/PVC):
      N/A

      Additional info:
      Similar to: https://access.redhat.com/solutions/5870461 but the solution here doesn't resolve the issue

      [1] install-config.yaml
      ~~~
      apiVersion: v1
      baseDomain: .....
      compute:

      • architecture: amd64
        hyperthreading: Enabled
        name: worker
        platform:
        ovirt:
        affinityGroupsNames: []
        cpu:
        cores: 4
        sockets: 2
        memoryMB: 16384
        osDisk:
        sizeGB: 100
        vmType: server
        replicas: 5
        controlPlane:
        architecture: amd64
        hyperthreading: Enabled
        name: master
        platform:
        ovirt:
        affinityGroupsNames: []
        cpu:
        cores: 4
        sockets: 2
        memoryMB: 16384
        osDisk:
        sizeGB: 100
        vmType: server
        replicas: 3
        metadata:
        creationTimestamp: null
        name: rocky
        networking:
        clusterNetwork:
      • cidr: 10.128.0.0/14
        hostPrefix: 23
        machineNetwork:
      • cidr: 10.0.0.0/16
        networkType: OpenShiftSDN
        serviceNetwork:
      • 172.30.0.0/16
        platform:
        ovirt:
        affinityGroups:
      • description: AffinityGroup for spreading each compute machine to a different
        host
        enforcing: true
        name: compute
        priority: 3
      • description: AffinityGroup for spreading each control plane machine to a different
        host
        enforcing: true
        name: controlplane
        priority: 5
        api_vip: 10.37.193.199
        ingress_vip: 10.37.193.200
        ovirt_cluster_id: a515872a-f631-11ec-a877-00163e1b5119
        ovirt_network_name: ovirtmgmt
        ovirt_storage_domain_id: 2aecbe4f-6f77-465b-b56a-b7ca135a9d9e
        vnicProfileID: 0000000a-000a-000a-000a-000000000398
        ~~~

      [2] DNS Operator configuration:
      ~~~
      ...
      spec:
      servers:

      • forwardPlugin:
        upstreams:
      • 10.37.193.6
        name: dummy.lab.example.com
        zones:
      • dummy.lab.example.com
        ~~~

            mskrivan@redhat.com Michal Skrivanek
            rhn-support-fsoppels Fabrizio Soppelsa
            Lucie Leistnerova Lucie Leistnerova
            Red Hat Employee
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

              Created:
              Updated:
              Resolved: