Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-14644

Errors when running must-gather for 4.12 Rosa/Hypershift cluster

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Done
    • Icon: Undefined Undefined
    • None
    • 4.12.z, 4.12
    • oc

      Description of problem:

      Trying to run must-gather in production, I encountered these errors:
      
      Error running must-gather collection:
          gather did not start for pod must-gather-sbrwj: timed out waiting for the condition
      Falling back to `oc adm inspect clusteroperators.v1.config.openshift.io` to collect basic cluster information.
      {"component":"entrypoint","file":"k8s.io/test-infra/prow/entrypoint/run.go:164","func":"k8s.io/test-infra/prow/entrypoint.Options.ExecuteProcess","level":"error","msg":"Process did not finish before 15m30s timeout","severity":"error","time":"2023-06-06T17:42:20Z"}
      error running backup collection: errors occurred while gathering data:
          [skipping gathering secrets/support due to error: secrets "support" not found, skipping gathering sharedconfigmaps.sharedresource.openshift.io due to error: the server doesn't have a resource type "sharedconfigmaps", skipping gathering sharedsecrets.sharedresource.openshift.io due to error: the server doesn't have a resource type "sharedsecrets"]
      error: gather did not start for pod must-gather-sbrwj: timed out waiting for the condition
      {"component":"entrypoint","file":"k8s.io/test-infra/prow/entrypoint/run.go:251","func":"k8s.io/test-infra/prow/entrypoint.gracefullyTerminate","level":"error","msg":"Process gracefully exited before 15s grace period","severity":"error","time":"2023-06-06T17:42:25Z"}
      {"component":"entrypoint","error":"process timed out","file":"k8s.io/test-infra/prow/entrypoint/run.go:79","func":"k8s.io/test-infra/prow/entrypoint.Options.Run","level":"error","msg":"Error executing test process","severity":"error","time":"2023-06-06T17:42:25Z"}
      error: failed to execute wrapped command: exit status 127 
      INFO[2023-06-06T17:42:26Z] Step XXXXXX-perfscale-ci-tests-rosa-hypershift-cluster-density-v2-gather-must-gather failed after 15m43s. 
      INFO[2023-06-06T17:42:26Z] Running step XXXXXX-perfscale-ci-tests-rosa-hypershift-cluster-density-v2-gather-extra. 
      
      followed by.. 
      
      error: the server doesn't have a resource type "machineconfigpools"
      ...
      error: the server doesn't have a resource type "machineconfigs"
      ...
      INFO: gathering the audit logs for each master
      error: the server doesn't have a resource type "machineconfigs"
      error: the server doesn't have a resource type "machinesets"
      error: the server doesn't have a resource type "machines"
      error: the server doesn't have a resource type "machineconfigpools"
      error: the server doesn't have a resource type "machines"
      error: the server doesn't have a resource type "machinesets"
      error: the server doesn't have a resource type "controlplanemachinesets"
      error: the server doesn't have a resource type "controlplanemachinesets"
      /logs/artifacts/network/multus_logs /

      Version-Release number of selected component (if applicable):

      4.12

      How reproducible:

      Always

      Steps to Reproduce:

      1. Run must-gather on a Rosa/Hypershift cluster 
      2. See the errors reported
      3.
      

      Actual results:

      Must-gather errors out.

      Expected results:

      Must-gather should run successfully.

      Additional info:

      Ran into issue when testing in production, but applies to all 4.12 

            jchaloup@redhat.com Jan Chaloupka
            svetsa@redhat.com Sharada Vetsa
            ying zhou ying zhou
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

              Created:
              Updated:
              Resolved: