Uploaded image for project: 'Red Hat OpenStack Services on OpenShift'
  1. Red Hat OpenStack Services on OpenShift
  2. OSPRH-12949

BZ#2327781 [OSP16.2] During FFU the overcloud upgrade run failed on networkers role node/s due to Error: invalid value all for cpuset cpus

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Undefined Undefined
    • rhos-16.2.z
    • rhos-16.2.z
    • puppet-ovn
    • None
    • False
    • Hide

      None

      Show
      None
    • False
    • Not Set
    • Not Set
    • Not Set
    • None
    • Moderate

      Description of problem:

      During FFU, and After running the the overcloud upgrade run, the ovn_controller container fails to start with the below error:

      ~~~
      "ERROR: Container ovn_controller exited with code 125 when runed\nstderr: Error: invalid value all for cpuset cpus\n"]}
      ~~~

      it seems it comes from here:
      $ cat hashed-ovn_controller.json
      {
      "cpuset_cpus": "all", <======
      "depends_on": [
      "openvswitch.service"
      ],
      "environment": {
      "KOLLA_CONFIG_STRATEGY": "COPY_ALWAYS"

      Checking the Cu templates, The parameter OVNContainerCpusetCpus is defined for all Compute roles (various roles), but not defined for controllers nor Networker nodes (where the issue is happening).

      To overcome the issue (pass the error), the parameter cpuset_cpus:0 was set 'manually' to '0' to all issued nodes on the hashed ovn_controller file., and then overcloud upgrade run

      • Cu needs to know where the cpuset_cpus": "all" came from ?
      • How to fix this issue without manually change this parameter or through templates ?
      • Why did the error pass and then ovn_controller become up after defining this parameter to cpuset_cpus": "0" ?.

      Version-Release number of selected component (if applicable):
      openstack-ovn-controller:16.2.6

      How reproducible:
      NA

      Steps to Reproduce:
      1.
      2.
      3.

      Actual results:
      After running the the overcloud upgrade run, the ovn_controller container fails to start

      Expected results:
      the ovn_controller become up with the default cpuset_cpus value After running the the overcloud upgrade run step

              jbadiapa@redhat.com Juan Payno
              jira-bugzilla-migration RH Bugzilla Integration
              Archana Singh Archana Singh
              rhos-dfg-upgrades
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

                Created:
                Updated: