Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-64744

Sometimes Metrics are not returning in a timely manner from the kubelet

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Minor Minor
    • None
    • 4.18, 4.19, 4.20
    • Node / Kubelet
    • None
    • Quality / Stability / Reliability
    • False
    • Hide

      None

      Show
      None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      Description of problem:

      Sometimes the kubelet does not return metrics in under 30 seconds, and in some cases using the proxy interface to the kubelet the metrics are not returned within 2 minutes. These are likely two different issues.
      
      The upstream https://github.com/kubernetes/kubernetes/pull/133392 PR fixes the upstream [test|https://github.com/kubernetes/kubernetes/blob/544dfee60a99eaec9962c3853674dc1e4d7f0c8d/test/e2e/instrumentation/metrics.go#L56] by deflaking the  HTTP request call to the proxy. It's unclear why the upstream test called out why the test is flaking in this regards, so further study will be needed.
      
      Additionally, we see metrics taking over 30s to be returned from cadvisor. This is likely a separate issue that needs more investigation on why it happens.  [~abukashem] has some thoughts with improvements in this area.
      
          

      Version-Release number of selected component (if applicable):

      
          

      How reproducible:

      
          

      Steps to Reproduce:

          1.
          2.
          3.
          

      Actual results:

      
          

      Expected results:

      
          

      Additional info:

      
          

              rphillip@redhat.com Ryan Phillips
              rphillip@redhat.com Ryan Phillips
              None
              None
              Min Li Min Li
              None
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated: