Uploaded image for project: 'Data Foundation Bugs'
  1. Data Foundation Bugs
  2. DFBUGS-905

[ Tracker for 43495] Ceph daemon crash leads to warning state in Fusion Cluster

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Undefined Undefined
    • odf-4.18
    • odf-4.16
    • rook
    • None
    • False
    • Hide

      None

      Show
      None
    • False
    • ?
    • ?
    • ?
    • ?
    • None

      Description of problem - Provide a detailed description of the issue encountered, including logs/command-output snippets and screenshots if the issue is observed in the UI:

      reference ticket in github - https://github.ibm.com/ProjectAbell/abell-tracking/issues/43495

      Defect Synopsis:
      After new cluster was provisioned via this ticket, I deployed ISF via this Jenkins pipeline, specifying ODF to be also installed
      https://hyc-icps-team-jenkins.swg-devops.com/job/DevOps/job/fusion-deploy/job/fusion-deploy/3283/

      But after that, the storage is showing as degraded: block and file services are both unhealthy

      Impact:
      This will block deploying apps on local storage for Data Foundation related testing

      TestRail Testcase link
      https://ibm.testrail.io/index.php?/tests/view/18393558#

      Build and Platform details:
      ISF: docker-na-public.artifactory.swg-devops.com/hyc-abell-devops-team-mint-docker-local/sds-2.9.0/isf-operator-software-catalog:2.9.0-29133893
      FDF: v4.16.3
      Cluster: Power

      Recreate steps:

      Deploy ISF and ODF via Jenkins, or install each manually and configure local storage
      Expected Vs actual behavior:
      Expected: FDF should have local storage in healty state
      Actual: Storage system is degraded

      Probability: (Consistent/intermittent)

      Unsure
      Data collection and analysis: (Add log support bundle at- https://ibm.ent.box.com/folder/157974638139)
      Screenshots present in the github issue.

      The OCP platform infrastructure and deployment type (AWS, Bare Metal, VMware, etc. Please clarify if it is platform agnostic deployment), (IPI/UPI):

       

      The ODF deployment type (Internal, External, Internal-Attached (LSO), Multicluster, DR, Provider, etc):

       

       

      The version of all relevant components (OCP, ODF, RHCS, ACM whichever is applicable):

       

       

      Does this issue impact your ability to continue to work with the product?

       

       

      Is there any workaround available to the best of your knowledge?

       

       

      Can this issue be reproduced? If so, please provide the hit rate

       

       

      Can this issue be reproduced from the UI?

      If this is a regression, please provide more details to justify this:

      Steps to Reproduce:

      1.

      2.

      3.

      The exact date and time when the issue was observed, including timezone details:

       

      Actual results:

       

       

      Expected results:

       

      Logs collected and log location:

       

      Additional info:

       

              skrai Subham Rai
              nberry@redhat.com Neha Berry
              Votes:
              0 Vote for this issue
              Watchers:
              17 Start watching this issue

                Created:
                Updated: