Uploaded image for project: 'Product Technical Learning'
  1. Product Technical Learning
  2. PTL-13935

DO316: The NFS Ansible playbooks are not reliable

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Done
    • Icon: Minor Minor
    • DO316 - OCP4.14-en-3-20240619
    • DO316
    • None
    • False
    • Hide

      None

      Show
      None
    • False
    • Hide

      Attempt to start or finish the advanced-migrate GE. The NFS actions to configure or revert fail when the cluster is on heavy load, or has capacity issues.

      Show
      Attempt to start or finish the advanced-migrate GE. The NFS actions to configure or revert fail when the cluster is on heavy load, or has capacity issues.
    • en-US (English)

      Please fill in the following information:


      URL: https://rol.redhat.com/rol/app/courses/do316-4.14/pages/ch07s08 
      Reporter RHNID: ricardodacosta 
      Section Title:  Guided Exercise: Perform Virtual Machine Live Migrations 

      Slack thread

      Issue description

       The NFS ansible playbooks use a shell command to get the resource from the OpenShift cluster instead of using the kubernetes.core.k8s_info module.

      The shell command is attempted only once (there is no retry logic), and it fails when the cluster is under heavy load, or when the cluster is having capacity issues.

      Steps to reproduce:

      View the source code of the affected ansible playbooks (search for "command:" in the ansible directory:

       * https://github.com/RedHatTraining/DO316/blob/anhernan/dynolabs-update/classroom/grading/src/do316/ansible/template-create/update-nfs-storage.yml

      Workaround:

      * Wait until the cluster does not have heavy load and try again.

      • Attempt to run the playbook manually until it succeeds

      Expected result:

      The NFS configuration is applied in lab start and reverted in lab finish.

            rht-anhernan Andres Hernandez
            rht-anhernan Andres Hernandez
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

              Created:
              Updated:
              Resolved: