Loading...

XML

Word

Printable

Type: Bug
Resolution: Done-Errata
Priority: Major
Fix Version/s: None
Affects Version/s: 4.14
Component/s: MicroShift
Labels:
None

Regression:
No
Sprint:
uShift Sprint 241
sprint_count:
1
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Release Note Text:
Fixed a problem that prevented the etcd database used by MicroShift from shutting down cleanly in some circumstances.
Release Note Type:
Bug Fix
Target Version:

4.14.0

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Description of problem:

Running command systemctl stop microshift.service also signals microshift-etcd.scope to stop.
This means that it's not microshift controlling the etcd shutdown.

This results in etcd stopping to soon - before kube-apiserver had chance to write final items.

Depending on what KAS wanted to persist, it can result in microshift taking around 40-50 second to stop (because it waits for go context to timeout)

Version-Release number of selected component (if applicable):

main

How reproducible:

100%

Steps to Reproduce:

1. Watch `microshift.service` and microshift-etcd.scope` logs side by side
2. Run `sudo systemctl stop microshift`
3. Compare time when both processes received interrupt signal

Actual results:

Jun 07 08:35:26 localhost.localdomain microshift[10939]: I0607 08:35:26.720566   10939 run.go:135] microshift-etcd received signal terminated - stopping

Jun 07 08:35:26 localhost.localdomain microshift[10899]: ??? I0607 08:35:26.721436   10899 run.go:212] Interrupt received

microshift-etcd got signal at 08:35:26.720566
microshift got signal at 08:35:26.721436 - a little bit later

Expected results:

microshift manages microshift-etcd and shuts it down AFTER kube-apiserver (in reverse order to the start up sequence)

Additional info:

Added extra debug log to see when microshift wants to shutdown etcd just to be sure:

Jun 07 08:35:26 localhost.localdomain microshift[10899]: etcd I0607 08:35:26.723597   10899 etcd.go:120] "+++PMTK Signalling etcd"
Jun 07 08:35:26 localhost.localdomain microshift[10899]: etcd I0607 08:35:26.723620   10899 manager.go:123] etcd completed

relates to

OCPBUGS-18548 MicroShift's KAS and KCM are not shutting down

Closed

links to

openshift/microshift#2287: OCPBUGS-14678: microshift-etcd shuts down independently

RHSA-2023:5008 OpenShift Container Platform 4.14.z security update

Assignee:: Evgeny Slutsky

Reporter:: Patryk Matuszak

QA Contact:: John George

Contributors:: Allen Ray

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Created:: 2023/06/07 1:09 PM

Updated:: 2023/10/31 2:20 PM

Resolved:: 2023/10/31 2:20 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates