Uploaded image for project: 'Red Hat OpenShift Data Science'
  1. Red Hat OpenShift Data Science
  2. RHODS-1997

RHODS downtime after marketplace registration

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Done
    • Icon: Major Major
    • RHODS_1.1_GA
    • RHODS_1.1_GA
    • UI
    • MODH Sprint 32

      Description of problem:

      there is a short downtime of the RHODS dashboard while following the procedure to register a cluster to Marketplace. I think it triggers the rolling deployment of the RHODS operator pod, so it is scaled down and up. Doing so, the dashboard is unreachable for few minutes (see attached images). 

      This may have a probable impact on the running Jupyter labs. I will check the impact on JL and update this issue

      Prerequisites (if any, like setup, operators/versions):

      RHODS installed

      Steps to Reproduce

      1. Follow the marketplace cluster registration process  https://marketplace.redhat.com/en-us/workspace/clusters/add/register 
      2. notice that the RHODS operator passes from status "succeeded" o "installing" in the Operators > Installed Operators section
      3. notice that a new operator pod is instantiated and the old one is terminated
      4. try to reach the RHODS dashboard (you may try few times before hit the error)

      Actual results:

      • RHODS operator re-deployed
      • dashboard not reachable: blank page with the text "Client sent an HTTP request to an HTTPS server"

        Expected results:

      no impact on RHODS after cluster registration

      Reproducibility (Always/Intermittent/Only Once):

      always

      Build Details:

      RHODS v1.1.1-41

      Workaround:

      Additional info:

      • The registration step which triggers the rhods deployment is "Add the Red Hat Marketplace pull secret to the global pull secret on the cluster"
      • there is a note in the registration guide saying that "The update pull secret script will perform a rolling update on all your cluster nodes and update the pull secret" and it suggests to perform this operation during off-hours

        1. Screenshot_20211004_180620.png
          Screenshot_20211004_180620.png
          28 kB
        2. Screenshot_20211004_180522.png
          Screenshot_20211004_180522.png
          109 kB
        3. operator-error-state.png
          operator-error-state.png
          93 kB
        4. restarts.png
          restarts.png
          98 kB
        5. rhods-install-error.png
          rhods-install-error.png
          112 kB
        6. operator-events.png
          operator-events.png
          154 kB
        7. operator-pod-ogs.zip
          936 kB

              acorvin@redhat.com Alex Corvin
              rhn-support-bdattoma Berto D'Attoma
              Berto D'Attoma Berto D'Attoma
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

                Created:
                Updated:
                Resolved: