Uploaded image for project: 'Managed Service - Streams'
  1. Managed Service - Streams
  2. MGDSTRM-11005

KafkaTopicPartitionReplicaSpreadMax still firing after MGDSTRM-10777

XMLWordPrintable

    • MK - Sprint 234

      WHAT

      Investigate the root cause of the alert and tune the alert. 

      WHY

      1000 KafkaTopicPartitionReplicaSpreadMax alerts fired during past 7 days

      https://grafana.app-sre.devshift.net/d/xcNqJY24zgdgdgd/mk-7-day-alert?orgId=1

      HOW

       

      DONE

      The alert doesn't fire anymore in normal operating state. 

        1. image-2023-03-27-20-24-02-225.png
          409 kB
          Luke Chen
        2. image-2023-03-28-14-40-13-363.png
          145 kB
          Luke Chen
        3. image-2023-03-28-14-42-46-999.png
          142 kB
          Luke Chen
        4. image-2023-03-28-14-43-24-279.png
          88 kB
          Luke Chen
        5. image-2023-03-28-14-46-25-833.png
          55 kB
          Luke Chen
        6. image-2023-03-28-14-48-53-767.png
          81 kB
          Luke Chen
        7. image-2023-03-28-14-51-02-883.png
          83 kB
          Luke Chen
        8. image-2023-03-28-14-52-30-170.png
          66 kB
          Luke Chen
        9. screenshot-1.png
          203 kB
          Luke Chen

            lukchen@redhat.com Luke Chen
            lukchen@redhat.com Luke Chen
            Kafka Integrations
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

              Created:
              Updated:
              Resolved: