-
Spike
-
Resolution: Done
-
Normal
-
None
-
None
-
3
-
False
-
None
-
False
OTA-761 ported our Bugzilla workflow to Jira without structural changes. But the label-based workfllow isn't very familiar to many developers, and it's easy to land a fix and forget about the parallel work of mitigating an issue for the existing fleet (who maybe cannot update to pick up the fix yet) or brainstorming for ways to avoid similar regressions going forward.
This ticket is about spiking for alternative approaches which might be more understandable and easier to manage. For example, OCPBUGS could allow for sub-tasks tracking "disccuss update recommendation changes like declaring conditional update risks" or "analyze root casuses and create tickets for follow-up improvements". Or we could decide to spin out tickets in the relevant component project, instead of using sub-tasks under OCP. Or we could create new projects just for RCAs and conditional update risks. Or whatever. Lots of options.
Definition of done:
- Consensus or executive decision reached on target workflows for discussing update-risk mitigations and root cause analysis for OCPBUGS.
- Initial set of tickets created tracking the work that would need to be done to implement those target workflows.
Mutable process docs
Collecting these so they don't get lost in the comments, and we have a central place to evolve:
- Someone puts an UpgradeBlocker label on an OCPBUGS-# ticket.
- We create a Spike in the relevant component's Jira project (e.g. MCO-#). Let's use TEAM-# as a placeholder.
- Subject is: Impact ${BUG_SUBJECT}
- Assignee is the bug assignee.
- Body is our request template, but with subsitutions (if/when we formally adopt this process, we can update the enhancement template) like:
- "OCPBUGS-#" instead of "this bug".
- Use "When responding, please move this ticket to Code Review" for the "When responding" sentence (Jira native ).
- Priority is Critical, because it's hard to assess customer risk in the absence of an impact statement. If new information flows in that suggests further impact-statement work should be lower-priority for that bug, folks can reduce the priority later and explain their motivation.
- TEAM-# blocks OCPBUGS-#, because we should be able to get an initial draft at impact without waiting for the bug fix to merge, be verified, or ship.
- Label TEAM-# with UpgradeBlocker for tracking.
- Label OCPBUGS-# with ImpactStatementRequested for
While we're trialing this workflow, also link TEAM-# with this OTA-787 ticket, so it's easy for folks to poke around and see how the process is going.
- is blocked by
-
OTA-761 Port UpgradeBlocker queries to Jira
- Closed
- is triggering
-
CONSOLE-3427 Impact assessment of OCPBUGS-6488 : Console-operator is pinned at ConsoleNotificationSyncDegraded if it occurred once
- Closed
-
MCO-402 Impact: "error: No enabled repositories" on upgrade with kernelType: realtime enabled
- Closed
-
RUN-1668 Impact: 4.11 upgrade to 4.12, prometheus-operator-admission-webhook pod is failed to start up due to "error loading seccomp filter into kernel: error loading seccomp filter: errno 524"
- Closed
- split to
-
OTA-866 Enforce providing impact statements on UpgradeBlocker candidates (by blocking merges)
- To Do
- links to