Uploaded image for project: 'Project Quay'
  1. Project Quay
  2. PROJQUAY-5853

Volume mount meet Multi-Attach error durning Quay operator upgrade

XMLWordPrintable

    • Approved

      Description:

      Durning Quay 3.9 Operator upgrade, pod quayregistry-clair-postgres-upgrade init error with 

      "Multi-Attach error for volume "pvc-38b6a080-2102-4305-8f19-9e77bd1ad75d" Volume is already used by pod(s)" 

       

      Index Image:

      quay-operator-bundle-container-v3.9.0-137

      Index image v4.13: registry-proxy.engineering.redhat.com/rh-osbs/iib:538740

       

      Reproduce Steps: 

      Actually it is Same test steps with PROJQUAY-5631 refer to "More Update-> 3, A 100% reproduce approach", sometimes, it shows this volume mount error. 

       

      1, Provision OCP cluster and Quay with flex job

      2, Deploy Quay latest 3.7 Operator on OCP, create a valid quay registry 

      - kind: postgres  
         managed: true
      - kind: clairpostgres
         managed: true
      

      3, Upgrade to latest v3.9 bundle image

      4, Whether it pass or fail, then uninstall Quay registry/Quay operator, make sure all pv/pvc released

      5,  Create a new project, repeat step 2 and step 3. The intention is to utilize the OCP cluster to test a different version Quay upgrade 

      6, Now Check upgrade status 

       

      Actual Result:

      [ec2-user@ip-10-0-12-54 ~]$ oc get pod
      NAME                        READY  STATUS    RESTARTS    AGE
      quay-operator.v3.9.0-645db47fd4-b8hnr       1/1   Running    0        38s
      quayregistry-clair-postgres-64d8c4b85-x2c94    1/1   Terminating  1 (4m58s ago)  5m28s
      quayregistry-clair-postgres-upgrade-v24lz     0/1   Init:0/1   0        20s
      
      [ec2-user@ip-10-0-12-54 ~]$ oc describe pod/quayregistry-clair-postgres-upgrade-v24lz 
      Events:
        Type     Reason                  Age                   From                     Message
        ----     ------                  ----                  ----                     -------
        Normal   Scheduled               15m                   default-scheduler        Successfully assigned quay-enterprise-14080/quayregistry-clair-postgres-upgrade-v24lz to ip-10-0-222-246.us-east-2.compute.internal
        Warning  FailedAttachVolume      15m                   attachdetach-controller  Multi-Attach error for volume "pvc-38b6a080-2102-4305-8f19-9e77bd1ad75d" Volume is already used by pod(s) quayregistry-clair-postgres-64d8c4b85-x2c94
        Normal   SuccessfulAttachVolume  15m                   attachdetach-controller  AttachVolume.Attach succeeded for volume "pvc-5900ad31-5654-4d94-bc29-c8a521282942"
        Warning  FailedMount             13m                   kubelet                  Unable to attach or mount volumes: unmounted volumes=[postgres-data], unattached volumes=[migration-data kube-api-access-tp7nm clair-postgres-conf-sample postgres-data]: timed out waiting for the condition
        Normal   SuccessfulAttachVolume  12m                   attachdetach-controller  AttachVolume.Attach succeeded for volume "pvc-38b6a080-2102-4305-8f19-9e77bd1ad75d"
        Normal   AddedInterface          12m                   multus                   Add eth0 [10.130.2.10/23] from openshift-sdn
        Normal   Pulled                  7m41s (x5 over 12m)   kubelet                  Container image "registry.redhat.io/rhscl/postgresql-12-rhel7@sha256:68b969143f4c638098a13c2db8693b60339122ef004f28875ebc4a4afc23035d" already present on machine
        Normal   Created                 7m41s (x5 over 12m)   kubelet                  Created container postgres-old
        Normal   Started                 7m41s (x5 over 12m)   kubelet                  Started container postgres-old
        Warning  BackOff                 3m11s (x20 over 11m)  kubelet                  Back-off restarting failed container postgres-old in pod quayregistry-clair-postgres-upgrade-v24lz_quay-enterprise-14080(61aa21b2-fe72-4ed9-a47b-aadedc638eac)
      [ec2-user@ip-10-0-12-54 ~]$   

      It shows resource conflict durning  pod quayregistry-clair-postgres-64d8c4b85-x2c94 terminate, but after this pod been terminated, this error still there, pls review full log attached.

       

        1. Downstream139ocp414quay38to39upgrade.log
          64 kB
          Sean Zhao
        2. quay38upgrade.log
          20 kB
          Sean Zhao
        3. upgradeoncadidate39.log
          38 kB
          Sean Zhao

              Unassigned Unassigned
              szhao@redhat.com Sean Zhao
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated:
                Resolved: