Loading...

Linking RHIVOS CVEs to...

Migration: Automation ...

Sync from "Extern...

XML

Word

Printable

Type: Bug
Resolution: Unresolved
Priority: Blocker
Fix Version/s: None
Affects Version/s: rhel-9.5
Component/s: unbound
Labels:
None

Regression:
No
Severity:
Low
sprint_count:
1

AssignedTeam:
rhel-net-perf
Sub-System Group:

ssg_core_services

Story Points:
0
Blocked:
False
Ready:
False
Blocked Reason:

Hide

None

Show
None
Product Documentation Required:
None
Sprint:
_N&P-Refined_

Preliminary Testing:
None
Test Coverage:
None

ProdDocsReview-CCS:
Unspecified
ProdDocsReview-Dev:
Unspecified
ProdDocsReview-QE:
Unspecified

Experience:

PX Impact Score:
PX Technical Impact:
PX Impact Range:
PX Priority Data:
PX Review Complete:
SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Planning:
None

We have a customer that reported unbound consistently eating up all available memory, triggering an OOM. Customer is running RHEL 9.5 and states that the same configuration works without issues on a CentOS 7 server. The system has a 4-core CPU and 16GB of RAM. The settings that appear to influence this behaviour are:

        num-threads: 4
        so-rcvbuf: 2m
        so-sndbuf: 2m
        outgoing-num-tcp: 1000
        incoming-num-tcp: 1000
        msg-cache-size: 1G
        rrset-cache-size: 2G

While we ran some tests with lower limits and lower system resources, we eventually saw the same behaviour. There's kind of a fixed formula to estimate memory consumption from upstream, based on the cache sizes, which essentially boils down to:

 2.5 * (rrset-cache-size + msg-cache-size)

This should give us 7.5GB but unbound eventually eats up way more memory than that, triggering an OOM.

Additionally, according to the only relevant upstream report we could find , we could make an additional memory usage estimation based on the number of TCP connections, number of threads and the msg-buffer-size. This should give us:

(((66k * 1000) * 2) * 4)

or some ~512MB of additional memory. We're still way below 16GB, yet unbound eventually gets killed due to OOM.

Disabling THP has no effect, other than delaying how often this gets triggered, but it happens sooner or later. Customer needs to periodically (every ~3 days) restart unbound in order not to suffer an OOM.

Assignee:: Tomáš Korbař

Reporter:: Juan Santos

Developer:: Petr Mensik

QA Contact:: Petr Sklenar

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Created:: 2025/05/22 9:37 AM

Updated:: 2025/09/24 8:13 PM

Stale Date:: 2026/07/27

Details

Description

Attachments

Easy Agile Planning Poker

Activity

People

Dates