-
Epic
-
Resolution: Done
-
Major
-
None
-
None
-
None
-
SRIOV IRQ overspilling
-
False
-
-
False
-
Not Selected
-
Proposed
-
Committed
-
In Progress
-
Proposed
-
rhos-dfg-nfv
-
?
-
0% To Do, 0% In Progress, 100% Done
-
-
-
2024Q3
-
Rejected
https://bugzilla.redhat.com/show_bug.cgi?id=2219830
In RHOSP 17.1, there is a known issue of transient packet loss where hardware interrupt requests (IRQs) are causing non-voluntary context switches on OVS-DPDK PMD threads or in guests running DPDK applications.
This issue is the result of provisioning large numbers of VFs during deployment. VFs need IRQs, each of which must be bound to a physical CPU. When there are not enough housekeeping CPUs to handle the capacity of IRQs, `irqbalance` fails to bind all of them and the IRQs overspill on isolated CPUs.
Workaround: You can try one or more of these actions:
- Reduce the number of provisioned VFs to avoid unused VFs remaining bound to their default Linux driver.
- Increase the number of housekeeping CPUs to handle all IRQs.
- Force unused VF network interfaces down to avoid IRQs from interrupting isolated CPUs.
- Disable multicast and broadcast traffic on unused, down VF network interfaces to avoid IRQs from interrupting isolated CPUs.
We need a real solution that does not require workarounds.
There are no Sub-Tasks for this issue.