Uploaded image for project: 'RHEL'
  1. RHEL
  2. RHEL-84182

smartd.service: Main process exited, code=dumped

Linking RHIVOS CVEs to...Migration: Automation ...RHELPRIO AssignedTeam ...SWIFT: POC ConversionSync from "Extern...XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Undefined Undefined
    • None
    • rhel-10.0
    • smartmontools
    • None
    • No
    • None
    • rhel-base-utils-core
    • ssg_core_services
    • None
    • False
    • False
    • Hide

      None

      Show
      None
    • None
    • None
    • None
    • None
    • Unspecified
    • Unspecified
    • Unspecified
    • x86_64
    • None

      What were you trying to do that didn't work?

      On my system with Fibre Channel LUNS, I installed smartmontools and attempted to start smartd:

       

      # systemctl start smartd
      Job for smartd.service failed because a fatal signal was delivered causing the control process to dump core.
      See "systemctl status smartd.service" and "journalctl -xeu smartd.service" for details.
      

      Below is from journalctl at the time of failure:

       

      Mar 19 14:45:47 storageqe-14.fast.eng.rdu2.dc.redhat.com systemd[1]: Starting smartd.service - Self Monitoring and Reporting Technology (SMART) Daemon...
      Mar 19 14:45:47 storageqe-14.fast.eng.rdu2.dc.redhat.com smartd[7995]: smartd 7.4 2023-08-01 r5530 [x86_64-linux-6.12.0-55.2.1.el10_0.x86_64] (local build)
      Mar 19 14:45:47 storageqe-14.fast.eng.rdu2.dc.redhat.com smartd[7995]: Opened configuration file /etc/smartmontools/smartd.conf
      Mar 19 14:45:47 storageqe-14.fast.eng.rdu2.dc.redhat.com smartd[7995]: Configuration file /etc/smartmontools/smartd.conf was parsed, found DEVICESCAN, scanning devices
      Mar 19 14:45:47 storageqe-14.fast.eng.rdu2.dc.redhat.com smartd[7995]: Device: /dev/sda, type changed from 'scsi' to 'sat'
      Mar 19 14:45:47 storageqe-14.fast.eng.rdu2.dc.redhat.com smartd[7995]: Device: /dev/sda [SAT], opened
      Mar 19 14:45:47 storageqe-14.fast.eng.rdu2.dc.redhat.com smartd[7995]: Device: /dev/sda [SAT], SSDSC2KG480G8R, S/N:PHYG002400ND480BGN, WWN:5-5cd2e4-152174ea5, FW:XCV1DL69, 480 GB
      Mar 19 14:45:47 storageqe-14.fast.eng.rdu2.dc.redhat.com smartd[7995]: Device: /dev/sda [SAT], found in smartd database 7.3/5610: Dell Certified Intel S4x00/D3-S4x10 Series SSDs
      Mar 19 14:45:47 storageqe-14.fast.eng.rdu2.dc.redhat.com smartd[7995]: Device: /dev/sda [SAT], is SMART capable. Adding to "monitor" list.
      Mar 19 14:45:47 storageqe-14.fast.eng.rdu2.dc.redhat.com smartd[7995]: Device: /dev/sdb, type changed from 'scsi' to 'sat'
      Mar 19 14:45:47 storageqe-14.fast.eng.rdu2.dc.redhat.com smartd[7995]: Device: /dev/sdb [SAT], opened
      Mar 19 14:45:47 storageqe-14.fast.eng.rdu2.dc.redhat.com smartd[7995]: Device: /dev/sdb [SAT], SSDSC2KG480G8R, S/N:PHYG002400Y8480BGN, WWN:5-5cd2e4-152171c03, FW:XCV1DL69, 480 GB
      Mar 19 14:45:47 storageqe-14.fast.eng.rdu2.dc.redhat.com smartd[7995]: Device: /dev/sdb [SAT], found in smartd database 7.3/5610: Dell Certified Intel S4x00/D3-S4x10 Series SSDs
      Mar 19 14:45:47 storageqe-14.fast.eng.rdu2.dc.redhat.com smartd[7995]: Device: /dev/sdb [SAT], is SMART capable. Adding to "monitor" list.
      Mar 19 14:45:47 storageqe-14.fast.eng.rdu2.dc.redhat.com smartd[7995]: Device: /dev/sdc, opened
      Mar 19 14:45:47 storageqe-14.fast.eng.rdu2.dc.redhat.com smartd[7995]: *** buffer overflow detected ***: terminated
      Mar 19 14:45:47 storageqe-14.fast.eng.rdu2.dc.redhat.com systemd-coredump[7996]: Process 7995 (smartd) of user 0 terminated abnormally with signal 6/ABRT, processing...
      Mar 19 14:45:47 storageqe-14.fast.eng.rdu2.dc.redhat.com systemd[1]: Created slice system-systemd\x2dcoredump.slice - Slice /system/systemd-coredump.
      Mar 19 14:45:47 storageqe-14.fast.eng.rdu2.dc.redhat.com systemd[1]: Started systemd-coredump@0-7996-0.service - Process Core Dump (PID 7996/UID 0).
      Mar 19 14:45:47 storageqe-14.fast.eng.rdu2.dc.redhat.com systemd-coredump[7997]: [🡕] Process 7995 (smartd) of user 0 dumped core.
                                                                                       
                                                                                       Module libpcre2-8.so.0 from rpm pcre2-10.44-1.el10.3.x86_64
                                                                                       Module libcap.so.2 from rpm libcap-2.69-7.el10.x86_64
                                                                                       Module libselinux.so.1 from rpm libselinux-3.8-1.el10.x86_64
                                                                                       Module libsystemd.so.0 from rpm systemd-257-9.el10.x86_64
                                                                                       Module libcap-ng.so.0 from rpm libcap-ng-0.8.4-6.el10.x86_64
                                                                                       Stack trace of thread 7995:
                                                                                       #0  0x00007fb5f980fe9c __pthread_kill_implementation (libc.so.6 + 0x96e9c)
                                                                                       #1  0x00007fb5f97b9a96 raise (libc.so.6 + 0x40a96)
                                                                                       #2  0x00007fb5f97a18fa abort (libc.so.6 + 0x288fa)
                                                                                       #3  0x00007fb5f97a2956 __libc_message_impl.cold (libc.so.6 + 0x29956)
                                                                                       #4  0x00007fb5f988ae8b __fortify_fail (libc.so.6 + 0x111e8b)
                                                                                       #5  0x00007fb5f988a816 __chk_fail (libc.so.6 + 0x111816)
                                                                                       #6  0x00007fb5f988befd __snprintf_chk (libc.so.6 + 0x112efd)
                                                                                       #7  0x000055a22bde5e8c n/a (n/a + 0x0)
                                                                                       #8  0x3898a00000585050 n/a (n/a + 0x0)
                                                                                       ELF object binary architecture: AMD x86-64
      Mar 19 14:45:47 storageqe-14.fast.eng.rdu2.dc.redhat.com systemd[1]: smartd.service: Main process exited, code=dumped, status=6/ABRT
      Mar 19 14:45:47 storageqe-14.fast.eng.rdu2.dc.redhat.com systemd[1]: smartd.service: Failed with result 'core-dump'.
      Mar 19 14:45:47 storageqe-14.fast.eng.rdu2.dc.redhat.com systemd[1]: Failed to start smartd.service - Self Monitoring and Reporting Technology (SMART) Daemon.
      Mar 19 14:45:47 storageqe-14.fast.eng.rdu2.dc.redhat.com systemd[1]: systemd-coredump@0-7996-0.service: Deactivated successfully.
      Mar 19 14:45:53 storageqe-14.fast.eng.rdu2.dc.redhat.com systemd[1]: systemd-hostnamed.service: Deactivated successfully.
      

      On the same system with RHEL-9.6, I did not encounter this issue. It is only reproducible on RHEL-10

      What is the impact of this issue to you? 

      Please provide the package NVR for which the bug is seen:

      1. rpm -qa smartmontools
        smartmontools-7.4-7.el10.x86_64
      1. uname -r
        6.12.0-55.2.1.el10_0.x86_64

        How reproducible is this bug?: Often

        Steps to reproduce

      2. see above

              mhlavink Michal Hlavinka
              mpatalan Marco Patalano
              Michal Hlavinka Michal Hlavinka
              RHEL SST CS base utils QE Bot RHEL SST CS base utils QE Bot
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated: