Uploaded image for project: 'OpenShift Specialist Platform Team'
  1. OpenShift Specialist Platform Team
  2. SPLAT-1225

[aws][local-zones][CI] Investigate jobs failing

    • Icon: Bug Bug
    • Resolution: Done
    • Icon: Critical Critical
    • 4.15
    • None
    • Hide
      2023-10-31: Bug assigned to me, I reopened the PR and reviewed the logs, initial results looks promises. Awaiting feedback/PR review
      2023-10-30: Bug filled, waiting for initial triage: https://issues.redhat.com/browse/OCPBUGS-22703
      Show
      2023-10-31: Bug assigned to me, I reopened the PR and reviewed the logs, initial results looks promises. Awaiting feedback/PR review 2023-10-30: Bug filled, waiting for initial triage: https://issues.redhat.com/browse/OCPBUGS-22703
    • False
    • OCPSTRAT-736 - Add support to AWS Wavelength Zones
    • Sprint 244

      Goal:

      • Investigate both Local Zone jobs failing

      Context:

      Local Zones jobs[1][2] are perm failing in the installer repo.

      An initial triage points to monitor tests is failing, we need to check the failure below (1 and 2) to check if it is related

       

      [1]https://prow.ci.openshift.org/job-history/gs/origin-ci-test/pr-logs/directory/pull-ci-openshift-installer-master-e2e-aws-ovn-localzones?buildId=1716457254460329984 

      [2]https://prow.ci.openshift.org/job-history/gs/origin-ci-test/pr-logs/directory/pull-ci-openshift-installer-master-e2e-aws-ovn-shared-vpc-localzones 

       

      Failure 01) Run multi-stage test e2e-aws-ovn-localzones - e2e-aws-ovn-localzones-gather-audit-logs container test

      https://prow.ci.openshift.org/view/gs/origin-ci-test/pr-logs/pull/openshift_installer/7509/pull-ci-openshift-installer-master-e2e-aws-ovn-localzones/1714631725696421888 

       

      {  ift.com:6443/api?timeout=32s": dial tcp: lookup api.ci-op-t585h9f8-9d876.vmc-ci.devcluster.openshift.com on 172.30.0.10:53: no such host
      E1018 15:36:38.772841      33 memcache.go:265] couldn't get current server API group list: Get "https://api.ci-op-t585h9f8-9d876.vmc-ci.devcluster.openshift.com:6443/api?timeout=32s": dial tcp: lookup api.ci-op-t585h9f8-9d876.vmc-ci.devcluster.openshift.com on 172.30.0.10:53: no such host
      error running backup collection: Get "https://api.ci-op-t585h9f8-9d876.vmc-ci.devcluster.openshift.com:6443/api?timeout=32s": dial tcp: lookup api.ci-op-t585h9f8-9d876.vmc-ci.devcluster.openshift.com on 172.30.0.10:53: no such host
       

       

      Failure 02)  [sig-network] can collect host-to-pod poller pod logs

      https://prow.ci.openshift.org/view/gs/origin-ci-test/pr-logs/pull/openshift_installer/7537/pull-ci-openshift-installer-master-e2e-aws-ovn-localzones/1714721421663408128

       

      :34.773866       1 disruption_backend_sampler.go:496] not finished writing all samples (1 remaining), but we're told to close
      E1018 22:06:34.774669       1 disruption_backend_sampler.go:496] not finished writing all samples (1 remaining), but we're told to close 

      https://github.com/openshift/origin/blame/880c9f8ff5d0bceb5afdfab008536ec4e738b425/pkg/monitortests/network/disruptionpodnetwork/monitortest.go#L339-L342

      https://github.com/openshift/origin/blob/880c9f8ff5d0bceb5afdfab008536ec4e738b425/pkg/monitor/backenddisruption/disruption_backend_sampler.go#L444-L446 

       

      https://issues.redhat.com/browse/OCPBUGS-18865 

            rhn-support-mrbraga Marco Braga
            rhn-support-mrbraga Marco Braga
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

              Created:
              Updated:
              Resolved: