-
Bug
-
Resolution: Done
-
Undefined
-
None
-
4.18
-
None
-
Important
-
None
-
False
-
-
Description of problem:
SriovFecNodeConfig reports Failed status with ACC200 resource
Version-Release number of selected component (if applicable):
4.18.0-rc.0 sriov-fec.v2.9.0
How reproducible:
100%
Steps to Reproduce:
1. Deploy SNO with DU profile and the following SriovFecClusterConfig apiVersion: sriovfec.intel.com/v2 kind: SriovFecClusterConfig metadata: creationTimestamp: "2024-12-03T11:44:36Z" generation: 1 name: config namespace: vran-acceleration-operators resourceVersion: "16487" uid: 818a856e-f1ac-44ee-a6c1-c2e3c5560222 spec: acceleratorSelector: pciAddress: 0000:f7:00.0 drainSkip: true nodeSelector: node-role.kubernetes.io/master: "" physicalFunction: bbDevConfig: acc200: downlink4G: aqDepthLog2: 4 numAqsPerGroups: 16 numQueueGroups: 0 downlink5G: aqDepthLog2: 4 numAqsPerGroups: 16 numQueueGroups: 4 maxQueueSize: 1024 numVfBundles: 16 pfMode: false qfft: aqDepthLog2: 4 numAqsPerGroups: 16 numQueueGroups: 4 uplink4G: aqDepthLog2: 4 numAqsPerGroups: 16 numQueueGroups: 0 uplink5G: aqDepthLog2: 4 numAqsPerGroups: 16 numQueueGroups: 4 pfDriver: vfio-pci vfAmount: 16 vfDriver: vfio-pci
Actual results:
apiVersion: sriovfec.intel.com/v2 kind: SriovFecNodeConfig metadata: creationTimestamp: "2024-12-03T11:44:37Z" generation: 2 name: sno.kni-qe-67.lab.eng.rdu2.redhat.com namespace: vran-acceleration-operators resourceVersion: "471104" uid: ca3d5369-5d99-45ef-aa85-b829a0b32b33 spec: drainSkip: true physicalFunctions: - bbDevConfig: acc200: downlink4G: aqDepthLog2: 4 numAqsPerGroups: 16 numQueueGroups: 0 downlink5G: aqDepthLog2: 4 numAqsPerGroups: 16 numQueueGroups: 4 fftLut: fftChecksum: "" fftUrl: "" maxQueueSize: 1024 numVfBundles: 16 qfft: aqDepthLog2: 4 numAqsPerGroups: 16 numQueueGroups: 4 uplink4G: aqDepthLog2: 4 numAqsPerGroups: 16 numQueueGroups: 0 uplink5G: aqDepthLog2: 4 numAqsPerGroups: 16 numQueueGroups: 4 pciAddress: 0000:f7:00.0 pfDriver: vfio-pci vfAmount: 16 vfDriver: vfio-pci status: conditions: - lastTransitionTime: "2024-12-03T20:34:14Z" message: exit status 255 observedGeneration: 2 reason: Failed status: "False" type: Configured inventory: sriovAccelerators: - deviceID: 57c0 driver: vfio-pci maxVirtualFunctions: 16 pciAddress: 0000:f7:00.0 vendorID: "8086" virtualFunctions: []
Expected results:
Successfully configured.
Additional info:
Attaching must-gather and vran-acceleration-operators logs.