Uploaded image for project: 'OCP Technical Release Team'
  1. OCP Technical Release Team
  2. TRT-1898

periodic-ci-openshift-release-master-ci-4.19-e2e-aws-ovn-techpreview failing the payload

XMLWordPrintable

    • Icon: Ticket Ticket
    • Resolution: Done
    • Icon: Blocker Blocker
    • None
    • 4.19
    • False
    • None
    • False

      Beginning with 4.19.0-0.nightly-2024-11-27-025041 this job failed with a pattern I don't recognize.

      I'll note some other aws jobs failed on the same payload which looked like infra issues; however this test re-ran in full and so its timing was very different.

      Then it failed with much the same pattern on the next payload too.

      The failures are mainly on tests like these:

      [sig-instrumentation][OCPFeatureGate:MetricsCollectionProfiles] The collection profiles feature-set initially, in a homogeneous default environment, should expose default metrics [Suite:openshift/conformance/parallel] expand_more
      [sig-instrumentation][OCPFeatureGate:MetricsCollectionProfiles] The collection profiles feature-set in a heterogeneous environment, should revert to default collection profile when an empty collection profile value is specified [Suite:openshift/conformance/parallel] expand_more
      [sig-instrumentation][OCPFeatureGate:MetricsCollectionProfiles] The collection profiles feature-set in a heterogeneous environment, should expose information about the applied collection profile using meta-metrics [Suite:openshift/conformance/parallel] expand_more
      [sig-instrumentation][OCPFeatureGate:MetricsCollectionProfiles] The collection profiles feature-set in a heterogeneous environment, should have at least one implementation for each collection profile [Suite:openshift/conformance/parallel] expand_more
      [sig-instrumentation][OCPFeatureGate:MetricsCollectionProfiles] The collection profiles feature-set in a homogeneous minimal environment, should hide default metrics [Suite:openshift/conformance/parallel] expand_more
      

      Each has a run where it looks like something timed out:

      fail [github.com/openshift/origin/test/extended/prometheus/collection_profiles.go:99]: Interrupted by User
      Ginkgo exit error 1: exit with code 1
      

      and a second run failing to update configmap cluster-monitoring-config

      {  fail [github.com/openshift/origin/test/extended/prometheus/collection_profiles.go:197]: Expected
          <*errors.StatusError | 0xc006738280>: 
          Operation cannot be fulfilled on configmaps "cluster-monitoring-config": the object has been modified; please apply your changes to the latest version and try again
          {
              ErrStatus: 
                  code: 409
                  details:
                    kind: configmaps
                    name: cluster-monitoring-config
                  message: 'Operation cannot be fulfilled on configmaps "cluster-monitoring-config":
                    the object has been modified; please apply your changes to the latest version and
                    try again'
                  metadata: {}
                  reason: Conflict
                  status: Failure,
          }
      to be nil
      Ginkgo exit error 1: exit with code 1}
      

              Unassigned Unassigned
              lmeyer@redhat.com Luke Meyer
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

                Created:
                Updated:
                Resolved: