Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-38468

MicroShift doesn't auto detect the MTU value at start up

XMLWordPrintable

    • Quality / Stability / Reliability
    • False
    • Hide

      None

      Show
      None
    • 5
    • None
    • None
    • None
    • None
    • uShift Sprint 263, uShift Sprint 264, uShift Sprint 265, uShift Sprint 266
    • 4
    • Done
    • Known Issue
    • Hide
      The maximum transmission unit (MTU) value in MicroShift OVN-K overlay networking must be 100 bytes smaller than the MTU value of the base network. MicroShift automatically configures the value using the MTU value of the default gateway of the host. If the auto-configuration does not work correctly, the MTU value must be configured manually.
      Show
      The maximum transmission unit (MTU) value in MicroShift OVN-K overlay networking must be 100 bytes smaller than the MTU value of the base network. MicroShift automatically configures the value using the MTU value of the default gateway of the host. If the auto-configuration does not work correctly, the MTU value must be configured manually.
    • None
    • None
    • None
    • None

      Description of problem:

      When starting MicroShift in a VM using OpenShift Virtualization, with ovn in the picture, it uses part of the MTU for the encapsulation, so VMs have a smaller MTU and it's not detected by MicroShift. Therefore, specifying the MTU value manually in /etc/microshift/ovn.yaml is required.

      Version-Release number of selected component (if applicable):

      4.15 & 4.16    

      How reproducible:

      Always, when starting MicroShift on a VM in OpenShift Virtualization

      Steps to Reproduce:

          1. Create a RHEL VM in OpenShift Virtualization
          2. Try to start MicroShift on it 
          3. Check pods in CrashLoopBack and Pending state.

       

      alternative Reproduction steps (can be performed on any setup):

      - change the MTU on the external facing interface
         nmcli c modify enp1s0 802-3-ethernet.mtu 1300 && nmcli c up enp1s0
      - delete the ovn-master POD so it will be recreated
         oc delete pod/ovnkube-master-kpff2 -n openshift-ovn-kubernetes
      - Check pods in CrashLoopBack and Pending state.
          

       

       

      Actual results:

      The openshift-ovn-kubernetes pod falls in CrashLoopBackOff, the ingress, kube-system, storage and serivce-ca pods shows Pending.  

      Expected results:

      MicroShift auto detects the MTU value needed in the network infrastructure and this value is updated during the start up so all the pods are Running. 

      Additional info:

      ● microshift.service - MicroShift
           Loaded: loaded (/usr/lib/systemd/system/microshift.service; enabled; prese>
           Active: active (running) since Thu 2024-04-11 04:29:04 EDT; 3min 52s ago
         Main PID: 18912 (microshift)
            Tasks: 12 (limit: 48492)
           Memory: 310.2M
              CPU: 31.698s
           CGroup: /system.slice/microshift.service
                   └─18912 microshift run
      
      Apr 11 04:35:30 edge microshift[18912]: kubelet E0411 04:35:30.603473   18912 kubelet.go:2869] "Container runtime network not ready" networkReady="NetworkReady=false reason:NetworkPluginNotReady message:Network plugin returns error: No CNI configuration file in /etc/cni/net.d/. Has your network provider started?"
      Apr 11 04:35:31 edge microshift[18912]: kubelet E0411 04:35:31.620672   18912 pod_workers.go:1300] "Error syncing pod, skipping" err="failed to \"StartContainer\" for \"ovnkube-master\" with CrashLoopBackOff: \"back-off 5m0s restarting failed container=ovnkube-master pod=ovnkube-master-ql5k7_openshift-ovn-kubernetes (97a1a2cf-e653-4cf8-bafb-b0188488ac3d)\"" pod="openshift-ovn-kubernetes/ovnkube-master-ql5k7" podUID="97a1a2cf-e653-4cf8-bafb-b0188488ac3d"
      Apr 11 04:35:33 edge microshift[18912]: kube-controller-manager I0411 04:35:33.812683   18912 node_lifecycle_controller.go:785] "Node is NotReady. Adding it to the Taint queue" node="edge" timeStamp="2024-04-11 04:35:33.812595792 -0400 EDT m=+457.202568567"
      
      ----------------------------------------------------------------------
      [cloud-user@edge ~]$ oc get pods -A
      NAMESPACE                  NAME                                       READY   STATUS             RESTARTS      AGE
      kube-system                csi-snapshot-controller-6686957bb9-vz8tv   0/1     Pending            0             17m
      kube-system                csi-snapshot-webhook-64455cd68b-47vjj      0/1     Pending            0             17m
      openshift-dns              node-resolver-247rt                        1/1     Running            0             17m
      openshift-ingress          router-default-65757846cd-4chhm            0/1     Pending            0             17m
      openshift-ovn-kubernetes   ovnkube-master-ql5k7                       3/4     CrashLoopBackOff   8 (23s ago)   17m
      openshift-ovn-kubernetes   ovnkube-node-6w4p8                         1/1     Running            1 (16m ago)   17m
      openshift-service-ca       service-ca-6dbd7c5ddc-2pqpj                0/1     Pending            0             17m
      openshift-storage          topolvm-controller-597486954b-9ffz9        0/5     Pending            0             17m

              eslutsky Evgeny Slutsky
              dialvare@redhat.com Diego Alvarez Ponce
              None
              None
              John George John George
              Shauna Diaz Shauna Diaz
              Votes:
              0 Vote for this issue
              Watchers:
              8 Start watching this issue

                Created:
                Updated:
                Resolved: