Uploaded image for project: 'Managed Service - Streams'
  1. Managed Service - Streams
  2. MGDSTRM-9327

Blast radius and other controls for scheduling changes

XMLWordPrintable

    • Icon: Task Task
    • Resolution: Done
    • Icon: Undefined Undefined
    • kasFleetShard-0.30.0
    • None
    • None
    • None
    • MK - Sprint 223, MK - Sprint 225, MK - Sprint 226

      To prevent the rolling of all brokers pods, or the unanticipated disruption of other pods, such as ingress replicas, we need to account for how scheduling changes are applied in a migration scenario.

      For controlling broker rolling, it has been proposed to use an artificial strimzi version to control roll the instances. This will be controlled by a dynamic-scaling-scheduling flag in the strimzi configmap in the fleetshard_operands.yaml.

      For ingress replicas we need them to roll onto the default machine pool before adding the NoExecute taint. For this we'll need to also use the NoSchedule version of the taint, and update the existing tolerations to include that.

            rhn-engineering-shawkins Steven Hawkins
            rhn-engineering-shawkins Steven Hawkins
            Kafka Fleet Services
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

              Created:
              Updated:
              Resolved: