Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-16724

c4.* instanceType stuck in Provisioned on AWS 4.14

XMLWordPrintable

    • Important
    • Yes
    • CLOUD Sprint 240, CLOUD Sprint 241
    • 2
    • Rejected
    • False
    • Hide

      We are currently unsure as to whether this is a regression or a capacity issue. Will need more reproduction

      Show
      We are currently unsure as to whether this is a regression or a capacity issue. Will need more reproduction

      Description of problem:

      c4.* instanceType stuck in Provisioned on AWS 4.14, tired c4.8xlarge and c4.2xlarge, both stuck in Provisioned on AWS 4.14.
      But on AWS 4.13 they can get Running.

      Version-Release number of selected component (if applicable):

      4.14.0-0.nightly-2023-07-20-215234

      How reproducible:

      Always

      Steps to Reproduce:

      1.Copy a default machineset, change instanceType to c4.8xlarge, then create the machineset.
      liuhuali@Lius-MacBook-Pro huali-test % oc create -f ms2.yaml 
      machineset.machine.openshift.io/huliu-aws25a-nsmkq-worker-us-east-2bd created
      liuhuali@Lius-MacBook-Pro huali-test % oc get clusterversion
      NAME      VERSION                              AVAILABLE   PROGRESSING   SINCE   STATUS
      version   4.14.0-0.nightly-2023-07-20-215234   True        False         8m30s   Cluster version is 4.14.0-0.nightly-2023-07-20-215234
      liuhuali@Lius-MacBook-Pro huali-test % oc get machine
      NAME                                          PHASE         TYPE         REGION      ZONE         AGE
      huliu-aws25a-nsmkq-master-0                   Running       m6i.xlarge   us-east-2   us-east-2a   156m
      huliu-aws25a-nsmkq-master-1                   Running       m6i.xlarge   us-east-2   us-east-2b   156m
      huliu-aws25a-nsmkq-master-2                   Running       m6i.xlarge   us-east-2   us-east-2c   156m
      huliu-aws25a-nsmkq-worker-us-east-2a-6vzt9    Running       m6i.xlarge   us-east-2   us-east-2a   153m
      huliu-aws25a-nsmkq-worker-us-east-2ac-k8cqq   Provisioned   c4.8xlarge   us-east-2   us-east-2a   94m
      huliu-aws25a-nsmkq-worker-us-east-2ad-rq2dp   Provisioned   c4.2xlarge   us-east-2   us-east-2a   63m
      huliu-aws25a-nsmkq-worker-us-east-2b-svbbd    Running       m6i.xlarge   us-east-2   us-east-2b   153m
      huliu-aws25a-nsmkq-worker-us-east-2bd-w4rg4   Provisioned   c4.2xlarge   us-east-2   us-east-2b   39m
      huliu-aws25a-nsmkq-worker-us-east-2c-t8s4x    Running       m6i.xlarge   us-east-2   us-east-2c   153m
      liuhuali@Lius-MacBook-Pro huali-test % 
      
      
      On AWS 4.13, machine get Running
      
      liuhuali@Lius-MacBook-Pro huali-test % oc get clusterversion 
      NAME      VERSION                              AVAILABLE   PROGRESSING   SINCE   STATUS
      version   4.13.0-0.nightly-2023-07-24-145501   True        False         10m     Cluster version is 4.13.0-0.nightly-2023-07-24-145501
      liuhuali@Lius-MacBook-Pro huali-test % oc get machine
      NAME                                          PHASE     TYPE         REGION      ZONE         AGE
      huliu-aws25b-t8k62-master-0                   Running   m6i.xlarge   us-east-2   us-east-2a   162m
      huliu-aws25b-t8k62-master-1                   Running   m6i.xlarge   us-east-2   us-east-2b   161m
      huliu-aws25b-t8k62-master-2                   Running   m6i.xlarge   us-east-2   us-east-2c   161m
      huliu-aws25b-t8k62-worker-us-east-2a-l484l    Running   m6i.xlarge   us-east-2   us-east-2a   158m
      huliu-aws25b-t8k62-worker-us-east-2ab-xcmtc   Running   c4.8xlarge   us-east-2   us-east-2a   98m
      huliu-aws25b-t8k62-worker-us-east-2ac-lmpwg   Running   c4.2xlarge   us-east-2   us-east-2a   59m
      huliu-aws25b-t8k62-worker-us-east-2b-5sxlf    Running   m6i.xlarge   us-east-2   us-east-2b   158m
      huliu-aws25b-t8k62-worker-us-east-2c-wfvgf    Running   m6i.xlarge   us-east-2   us-east-2c   158m
      liuhuali@Lius-MacBook-Pro huali-test % 
      
      
      Checked on AWS console, the Provisioned machine shows "Instance reachability check failed”
      Downloaded the system log for huliu-aws25a-nsmkq-worker-us-east-2ac-k8cqq which phase is Provisioned: https://drive.google.com/file/d/1TjSY7tP-O4ofPIieEHb9vUbrQgBPEp04/view?usp=sharing
      
      The system log for huliu-aws25b-t8k62-worker-us-east-2ab-xcmtc which phase is Running: https://drive.google.com/file/d/15zB_FRfh4fceQUYLBe0Mj4GG9pwK_-XP/view?usp=sharing 

       

      Actual results:

      Machine stuck in Provisioned

      Expected results:

      Machine should get Running

      Additional info:

      Must gather: https://drive.google.com/file/d/1oneVZY3wKDCya7WzmMo2Z7LlxT7QdQl2/view?usp=sharing

              Unassigned Unassigned
              huliu@redhat.com Huali Liu
              Huijing Hei Huijing Hei
              Votes:
              0 Vote for this issue
              Watchers:
              10 Start watching this issue

                Created:
                Updated:
                Resolved: