Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-2729

should ignore enP.* NICs from node-exporter on Azure cluster

    XMLWordPrintable

Details

    • Bug
    • Resolution: Done
    • Normal
    • None
    • 4.12.0
    • Monitoring
    • None
    • Low
    • False
    • Hide

      None

      Show
      None
    • NA

    Description

      Description of problem:

      issue is found when verify OCPBUGS-1321 in azure cluster, found enP.* devices

      # oc get infrastructures/cluster -o jsonpath="{.spec.platformSpec.type}"
      Azure
      
      # oc get network cluster -o jsonpath="{.spec.networkType}"
      OpenShiftSDN
      
      # token=`oc create token prometheus-k8s -n openshift-monitoring`  
      # oc -n openshift-monitoring exec -c prometheus prometheus-k8s-0 -- curl -k -H "Authorization: Bearer $token" 'https://prometheus-k8s.openshift-monitoring.svc:9091/api/v1/query?' --data-urlencode 'query=group by(device) (node_network_info)' | jq
      {
        "status": "success",
        "data": {
          "resultType": "vector",
          "result": [
            {
              "metric": {
                "device": "lo"
              },
              "value": [
                1666319847.258,
                "1"
              ]
            },
            {
              "metric": {
                "device": "enP49203s1"
              },
              "value": [
                1666319847.258,
                "1"
              ]
            },
            {
              "metric": {
                "device": "eth0"
              },
              "value": [
                1666319847.258,
                "1"
              ]
            },
            {
              "metric": {
                "device": "enP30235s1"
              },
              "value": [
                1666319847.258,
                "1"
              ]
            },
            {
              "metric": {
                "device": "enP12948s1"
              },
              "value": [
                1666319847.258,
                "1"
              ]
            },
            {
              "metric": {
                "device": "enP51324s1"
              },
              "value": [
                1666319847.258,
                "1"
              ]
            },
            {
              "metric": {
                "device": "enP21301s1"
              },
              "value": [
                1666319847.258,
                "1"
              ]
            },
            {
              "metric": {
                "device": "enP26677s1"
              },
              "value": [
                1666319847.258,
                "1"
              ]
            },
            {
              "metric": {
                "device": "vxlan_sys_4789"
              },
              "value": [
                1666319847.258,
                "1"
              ]
            },
            {
              "metric": {
                "device": "ovs-system"
              },
              "value": [
                1666319847.258,
                "1"
              ]
            }
          ]
        }
      }
      

      according to network QE, enP.* NIC, example: enP49203s1 is virtual NIC, we should ignore enP.* NICs from node-exporter

      # oc debug node/**-k8wdf-master-0
      Temporary namespace openshift-debug-lg44p is created for debugging node...
      Starting pod/**-k8wdf-master-0-debug ...
      To use host binaries, run `chroot /host`
      Pod IP: 10.0.0.6
      If you don't see a command prompt, try pressing enter.
      sh-4.4# chroot /host
      sh-4.4# ifconfig -a
      enP49203s1: flags=6211<UP,BROADCAST,RUNNING,SLAVE,MULTICAST>  mtu 1500
              inet 10.0.0.6  netmask 255.255.255.0  broadcast 10.0.0.255
              inet6 fe80::20d:3aff:fe77:d5f0  prefixlen 64  scopeid 0x20<link>
              ether 00:0d:3a:77:d5:f0  txqueuelen 1000  (Ethernet)
              RX packets 10255342  bytes 7578264248 (7.0 GiB)
              RX errors 0  dropped 0  overruns 0  frame 0
              TX packets 8300084  bytes 4603637695 (4.2 GiB)
              TX errors 0  dropped 0 overruns 0  carrier 0  collisions 0
      ...sh-4.4# nmcli
      enP49203s1: connected to Wired Connection
              "Mellanox MT27800"
              ethernet (mlx5_core), 00:0D:3A:77:D5:F0, hw, mtu 1500
              inet4 10.0.0.6/24
              route4 10.0.0.0/24 metric 101
              route4 default via 10.0.0.1 metric 101
              route4 168.63.129.16/32 via 10.0.0.1 metric 101
              route4 169.254.169.254/32 via 10.0.0.1 metric 101
              inet6 fe80::20d:3aff:fe77:d5f0/64
              route6 fe80::/64 metric 1024 

      Version-Release number of selected component (if applicable):

      # oc get clusterversion
      NAME      VERSION                              AVAILABLE   PROGRESSING   SINCE   STATUS
      version   4.12.0-0.nightly-2022-10-20-104328   True        False         79m     Cluster version is 4.12.0-0.nightly-2022-10-20-104328
      

      How reproducible:

      in azure cluster

      Steps to Reproduce:

      1. Launch a 4.12 azure cluster.
      2. Run the following PromQL query: "group by(device) (node_network_info)"
      

      Actual results:

      enP.* NICs exist in query

      Expected results:

      should ignore enP.* NICs from node-exporter on Azure cluster

      Additional info:

       

      Attachments

        Activity

          People

            hasun@redhat.com Haoyu Sun
            juzhao@redhat.com Junqi Zhao
            Tai Gao Tai Gao
            Votes:
            0 Vote for this issue
            Watchers:
            7 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: