Goals

Use available pod disruption primitives to harden the LokiStack reliability during OCP cluster restarts
Keep the LokiStack ingestion path working while the OCP cluster is restarting
Keep the LokiStack query path working while the OCP cluster is restarting

Motivation

In OpenShift Container Platform 4, updates are applied based on MachineConfigPool level, requiring customers to apply PodDisruptionBudget to prevent undesired disruption when OpenShift Container Platform 4 - Nodes are being updated/rebooted.

LokiStack is missing PodDisruptionBudget configuration, which could trigger all OpenShift Container Platform 4 - Nodes, hosting such components to be updated at the same time and therefore restart the entire service at the same time, which may introcued undesired service disruption.

Acceptance Criteria

Any LokiStack deployment size supports OCP cluster restarts without human administrator attendance.
Any LokiStack path (ingestion/query) keeps operating within the available boundaries of node resources (CPU/Memory) during OCP cluster restarts.

Documentation Considerations

PodDisruptionBudget are already well documented in the official OpenShift Container Platform documentation pages. However our Logging docs should have some sort of banner that we explains how the LokiStack will behave during cluster restarts, e.g. explaning the effect of each PodDisruptionBudget we place.

clones

LOG-3839 Loki - Cluster Restart Hardening

Closed

documents

LOG-3839 Loki - Cluster Restart Hardening

Closed

links to

openshift/openshift-docs#64839: OBSDOCS-214: Documenting Loki restart behavior

openshift/openshift-docs#65030: [enterprise-4.12] OBSDOCS-214: Documenting Loki restart behavior

openshift/openshift-docs#65031: [enterprise-4.14] OBSDOCS-214: Documenting Loki restart behavior

openshift/openshift-docs#65033: [enterprise-4.13] OBSDOCS-214: Documenting Loki restart behavior

mentioned in: Page Loading...

(1 links to, 1 mentioned in)

Assignee:: Ashleigh Brennan

Reporter:: Robert Krátký (Inactive)

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Created:: 2023/05/17 1:53 PM

Updated:: 2023/09/27 8:29 AM

Resolved:: 2023/09/20 8:34 PM

Details

Description

Goals

Motivation

Acceptance Criteria

Documentation Considerations

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates