Uploaded image for project: 'Red Hat OpenStack Services on OpenShift'
  1. Red Hat OpenStack Services on OpenShift
  2. OSPRH-11159

[os-net-config][nmstate] NIC partitioning error due to SR-IOV VFs not created

XMLWordPrintable

    • Critical

      When configuring NIC partitioning like this:

                network_config:
                - type: interface
                  name: nic1
                  use_dhcp: false
                - type: interface
                  name: nic2
                  use_dhcp: false
                - type: sriov_pf
                  name: nic3
                  mtu: 9000
                  numvfs: 5
                  use_dhcp: false
                  defroute: false
                  nm_controlled: true
                  hotplug: true
                - type: sriov_pf
                  name: nic4
                  mtu: 9000
                  numvfs: 5
                  use_dhcp: false
                  defroute: false
                  nm_controlled: true
                  hotplug: true
                - type: linux_bond
                  name: bond_api
                  use_dhcp: false
                  bonding_options: "mode=active-backup"
                  dns_servers: {{ ctlplane_dns_nameservers }}
                  members:
                    - type: sriov_vf
                      device: nic3
                      vfid: 0
                    - type: sriov_vf
                      device: nic4
                      vfid: 0
                  addresses:
                  - ip_netmask: {{ ctlplane_ip }}/{{ ctlplane_cidr }}
                  routes:
                  - default: true
                    next_hop: {{ ctlplane_gateway_ip }}
                - type: vlan
                  vlan_id: {{ lookup('vars', networks_lower['internalapi'] ~ '_vlan_id') }}
                  device: bond_api
                  addresses:
                  - ip_netmask: {{ lookup('vars', networks_lower['internalapi'] ~ '_ip') }}/{{ lookup('vars', networks_lower['internalapi'] ~ '_cidr') }}
                - type: linux_bond
                  name: storage_bond
                  use_dhcp: false
                  bonding_options: "mode=active-backup"
                  dns_servers: {{ ctlplane_dns_nameservers }}
                  members:
                    - type: sriov_vf
                      device: nic3
                      vfid: 1
                      vlan_id: {{ lookup('vars', networks_lower['storage'] ~ '_vlan_id') }}
                    - type: sriov_vf
                      device: nic4
                      vfid: 1
                      vlan_id: {{ lookup('vars', networks_lower['storage'] ~ '_vlan_id') }}
                  addresses:
                  - ip_netmask: {{ lookup('vars', networks_lower['storage'] ~ '_ip') }}/{{ lookup('vars', networks_lower['storage'] ~ '_cidr') }}
      ...
      

      We hit this error:

      [cloud-admin@compute-1 ~]$ grep -R ERROR /var/log/os-net-config.log 
      2024-10-31 14:32:59.181 ERROR os_net_config.impl_nmstate.nmstate_apply Error seen while applying the nmstate templates Ethernet interface enp130s0f0v0 does not exists
      2024-10-31 14:33:10.886 ERROR os_net_config.impl_nmstate.nmstate_apply **** Verification Error *****
      2024-10-31 14:33:10.886 ERROR os_net_config.impl_nmstate.nmstate_apply Error seen while applying the nmstate templates Desired route destination: 0.0.0.0/0 next-hop-interface: bond_api next-hop-address: 192.168.122.1 not found after apply
      2024-10-31 14:33:11.347 ERROR os_net_config.impl_nmstate.apply Failure(s) occurred when applying configuration
      2024-10-31 14:33:11.347 ERROR os_net_config.impl_nmstate.apply Ethernet interface enp130s0f0v0 does not exists
      2024-10-31 14:33:11.347 ERROR os_net_config.impl_nmstate.apply Desired route destination: 0.0.0.0/0 next-hop-interface: bond_api next-hop-address: 192.168.122.1 not found after apply
      2024-10-31 14:33:19.446 ERROR os_net_config.main ***Failed to configure with nmstate provider***
      2024-10-31 14:33:19.449 ERROR os_net_config.common.log_exceptions Traceback (most recent call last):

      The problem seems to be the SR-IOV VF are not properly created:

      [cloud-admin@compute-1 ~]$ sudo lshw -c network -businfo
      Bus info          Device       Class          Description
      =========================================================
      pci@0000:04:00.0  enp4s0f0np0  network        MT27800 Family [ConnectX-5]
      pci@0000:04:00.1  enp4s0f1np1  network        MT27800 Family [ConnectX-5]
      pci@0000:06:00.0               network        Ethernet Controller X710 for 10GbE SFP+
      pci@0000:06:00.1               network        Ethernet Controller X710 for 10GbE SFP+
      pci@0000:06:00.2  enp6s0f2     network        Ethernet Controller X710 for 10GbE SFP+
      pci@0000:06:00.3  enp6s0f3     network        Ethernet Controller X710 for 10GbE SFP+
      pci@0000:02:00.0  eno3         network        NetXtreme BCM5720 Gigabit Ethernet PCIe
      pci@0000:02:00.1  eno4         network        NetXtreme BCM5720 Gigabit Ethernet PCIe
      pci@0000:01:00.0  eno1         network        NetXtreme BCM5720 Gigabit Ethernet PCIe
      pci@0000:01:00.1  eno2         network        NetXtreme BCM5720 Gigabit Ethernet PCIe
      pci@0000:82:00.0  enp130s0f0   network        Ethernet Controller X710 for 10GbE SFP+
      pci@0000:82:00.1  enp130s0f1   network        Ethernet Controller X710 for 10GbE SFP+
      pci@0000:82:00.2               network        Ethernet Controller X710 for 10GbE SFP+
      pci@0000:82:00.3               network        Ethernet Controller X710 for 10GbE SFP+

      os-net-config RPM version:

      [cloud-admin@compute-1 ~]$ rpm -qi os-net-config
      Name        : os-net-config
      Version     : 18.0.1
      Release     : 18.0.20241018164709.cb46a31.el9ost
      Architecture: noarch
      Install Date: Tue 22 Oct 2024 11:45:57 AM EDT
      Group       : Unspecified
      Size        : 4392553
      License     : ASL 2.0
      Signature   : RSA/SHA256, Tue 22 Oct 2024 11:10:18 AM EDT, Key ID 199e2f91fd431d51
      Source RPM  : os-net-config-18.0.1-18.0.20241018164709.cb46a31.el9ost.src.rpm
      Build Date  : Tue 22 Oct 2024 10:03:25 AM EDT
      Build Host  : x86-64-03.build.eng.rdu2.redhat.com
      Packager    : Red Hat, Inc. <http://bugzilla.redhat.com/bugzilla>
      Vendor      : Red Hat, Inc.
      URL         : http://pypi.python.org/pypi/os-net-config
      Summary     : Host network configuration tool
      Description :
      Host network configuration tool for OpenStack.

              ksundara@redhat.com Karthik Sundaravel
              rdiazcam@redhat.com Ricardo Diaz Campos
              rhos-dfg-nfv
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated: