Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-50504

[enterprise-4.17] No doc on how to uncordon nodes after graceful shutdown

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Done
    • Icon: Normal Normal
    • None
    • 4.13, 4.12, 4.14, 4.15, 4.16, 4.17, 4.18
    • Documentation / etcd
    • Quality / Stability / Reliability
    • False
    • Hide

      None

      Show
      None
    • 2
    • Moderate
    • None
    • None
    • None
    • None
    • OSDOCS Sprint 269, OSDOCS Sprint 270
    • 2
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      Doc at https://docs.openshift.com/container-platform/4.17/backup_and_restore/graceful-cluster-restart.html

      Description of problem:

      In the restore documentation, we do not describe how to uncordon the nodes after rebooting them. Specifically we need to describe exactly how to login to the API server from the CLI potentially without access to the API VIP since during graceful shutdown we cordon all nodes
      
      We need to decide what credential the user should be using, there is no guarantee that the oauth token from `oc login` will work which means we need a system:admin certificate credential.
      
      Steps in the documentation should be:
      1. Turn back on nodes
      
      IF user is using the admin.kubeconfig from the cluster installation and the API VIP is up:
      - User sets their KUBECONFIG env var to admin.kubeconfig path
      - Use `oc adm uncordon <node>` for each node in the cluster
      
      IF the user does not have access to their admin.kubeconfig credentials then they can follow the steps similar to https://access.redhat.com/solutions/6987229
      - SSH to a control plane node
      - copy localhost-recovery.kubeconfig to /root
      - Use that kubeconfig to run `oc adm uncordon <node>` for each node in the cluster
      
      

      Version-Release number of selected component (if applicable):

      At least back to docs for 4.12 which is where this documentation starts

      How reproducible:

      Always

              rhn-support-lahinson Laura Hinson
              jkincl@redhat.com Jason Kincl
              None
              None
              None
              None
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

                Created:
                Updated:
                Resolved: