-
Feature Request
-
Resolution: Unresolved
-
Critical
-
None
-
None
-
False
-
None
-
False
-
Not Selected
-
-
-
-
-
1. Proposed title of this feature request
Provide fallback or priorization for MachineSet/MachinePools to guarantee scale-up in case instance type is not available
2. What is the nature and description of the request?
In various Public Cloud Region it's common to see MachineSet scaling failures because the requested instanceType is not available at the point the scaling was triggered. This is causing critical workload to remain in pending state as simply no resources are available to host the workload.
Thus having a way to either fallback to a MachineSet/MachinePool with a different instanceType specified or go through a list of MachineSet/MachinePool based priorities would be desired to automatically recover from a faulty Machine scale-up because the instanceType is not available.
If something can be provided today in the MachineAPI that would be great. The focus though should be on the ClusterAPI to make sure this functionality is available once OpenShift Container Platform 4 is transitioning to the same.
3. Why does the customer need this? (List the business requirements here)
Being able to scale on demand is critical for customers. Having the scale-up stall because the instanceType is not available is causing potential disruption in production and also requires manual intervention from the SRE Team. Given that MachineSet/MachinePools with different instanceType can be created, it would be rather easy to assign priorites to them or simply implement a fallback in case the scaling is failing.
So key would be to have a way to tell ClusterAPI to use a different MachinePool or instanceType if the selected one is not available and thus to actually capture that event properly to act accordingly.
4. List any affected packages or components.
- MachineAPI
- ClusterAPI
- is related to
-
RFE-3931 Karpenter support for ROSA
- Accepted
-
OCPSTRAT-1527 Phase 2 : Get CAPI provider working for standalone OCP on AWS
- New
-
OCPSTRAT-1586 Phase 1 : PoC for a new upstream CAPI provider for Karpenter on AWS - (part-2)
- New
-
OCPPLAN-9645 Prioritized instance types for MachineSets
- New
-
OCPSTRAT-943 Phase 3: get CAPI provider for Karpenter working for ROSA+HCP
- Refinement
-
OCPSTRAT-1498 Phase 4: get CAPI provider for Karpenter working for ARO+HCP
- Refinement
-
OCPSTRAT-1526 [Upstream] Phase 1 : PoC for a new upstream CAPI provider for Karpenter on AWS- (part-1)
- Closed
-
OCPPLAN-9761 Prioritized instance types for MachineSets
- Closed
- relates to
-
OCPSTRAT-1101 Provide instance type fallbacks in CAPI
- New