Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-16118

Machine API Does Not Finish Reconciling Machine After Timeout

    XMLWordPrintable

Details

    • Moderate
    • No
    • CLOUD Sprint 249, CLOUD Sprint 250, CLOUD Sprint 251, CLOUD Sprint 252
    • 4
    • Rejected
    • False
    • Hide

      None

      Show
      None

    Description

      Description of problem:

      Machine API does not finish reconciling new machine after a timeout occurs

      Version-Release number of selected component (if applicable):

      4.14

      How reproducible:

      The timing to force the issue is difficult, but may be able to force issue with unit test injection.

      Steps to Reproduce:

      1. Scale machine higher than current available
      2. Have kube-apiserver / etcd timeout during period when mapi attempts to update machine info when transitioning to provisioned. 

      Actual results:

      After waiting 40+ minutes from time issue occurs, the machine never moves to provisioned even though machine vm is created.

      Expected results:

      Machine moves to provisioned state after cloning is completed.

      Additional info:

      In most cases I would agree infrastructure should be better to prevent this scenario from happening; however, CI infrastructure is going to be high at times and if we cannot recover from timeouts when attempting to progress to Provisioned, we'll have many unneeded CI failures.
      
      This issue is not marked as high severity, but it would be great if we can improve the vsphere machine provisioning process to be able to recover from this scenario and eventually mark the machine as provisioned so the CI tests can complete.

      Attachments

        Activity

          People

            rh-ee-nbrubake Nolan Brubaker
            rhn-support-ngirard Neil Girard
            Huali Liu Huali Liu
            Votes:
            2 Vote for this issue
            Watchers:
            8 Start watching this issue

            Dates

              Created:
              Updated: