Loading...

XML

Word

Printable

Type: Bug
Resolution: Not a Bug
Priority: Normal
Fix Version/s: None
Affects Version/s: 4.16.z
Component/s: Etcd
Labels:
- etcd

Severity:
Moderate
Regression:
None
Blocked:
False
Blocked Reason:

Hide

None

Show
None

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

PX Impact Score:

Description of problem:

ETCD showing multiple warnings like this:

2024-10-25T05:44:29.451067060Z {"level":"warn","ts":"2024-10-25T05:44:29.44813Z","caller":"embed/config_logging.go:169","msg":"rejected connection","remote-addr":"[::1]:49220","server-name":"localhost","error":"EOF"}

2024-10-25T05:55:29.456816446Z {"level":"warn","ts":"2024-10-25T05:55:29.456708Z","caller":"embed/config_logging.go:169","msg":"rejected connection","remote-addr":"[::1]:40788","server-name":"localhost","error":"EOF"}

2024-10-25T06:15:49.292624944Z {"level":"warn","ts":"2024-10-25T06:15:49.290607Z","caller":"embed/config_logging.go:169","msg":"rejected connection","remote-addr":"10.102.12.7:55766","server-name":"","error":"EOF"}

2024-10-25T06:19:28.789618300Z {"level":"warn","ts":"2024-10-25T06:19:28.778419Z","caller":"embed/config_logging.go:169","msg":"rejected connection","remote-addr":"10.102.12.7:50446","server-name":"","error":"EOF"}

The port number mentioned in the error above changes between warnings time to time and is not related to a firewall blocking the connections as verified.

The port numbers, as indicated above, are also not documented[1] in our official documentation for allowing access in the firewall or infra

[1] https://docs.openshift.com/container-platform/4.16/installing/install_config/configuring-firewall.html

There are no pending CSRs so this does not appear to be a certificate issue.

The etcd pods appear to be healthy:

$ omc get po -n openshift-etcd

NAME                                READY   STATUS      RESTARTS   AGE
etcd-guard-svitmadocpro01           1/1     Running     0          98d
etcd-guard-svitmadocpro02           1/1     Running     1          98d
etcd-guard-svitmadocpro03           1/1     Running     2          98d
etcd-svitmadocpro01                 4/4     Running     4          98d
etcd-svitmadocpro02                 4/4     Running     9          98d
etcd-svitmadocpro03                 4/4     Running     12         98d
revision-pruner-12-svitmadocpro01   0/1     Completed   0          98d
revision-pruner-12-svitmadocpro02   0/1     Completed   0          98d
revision-pruner-12-svitmadocpro03   0/1     Completed   0          98d

Based on the below test, ETCD health looks to be fine:

sh-5.1# etcdctl member list -w table +------------------+---------+----------------+-----------------------------+-----------------------------+------------+ | ID | STATUS | NAME | PEER ADDRS | CLIENT ADDRS | IS LEARNER | +------------------+---------+----------------+-----------------------------+-----------------------------+------------+ | 329b4a7d0a3109a0 | started | svitmadocpro01 | https://10.102.12.6:2380 | https://10.102.12.6:2379 | false | | d75585bb6c33f85c | started | svitmadocpro03 | https://10.223.254.235:2380 | https://10.223.254.235:2379 | false | | e170ffaf8e78163e | started | svitmadocpro02 | https://10.102.12.7:2380 | https://10.102.12.7:2379 | false | +------------------+---------+----------------+-----------------------------+-----------------------------+------------+ sh-5.1# etcdctl endpoint health --cluster https://10.102.12.6:2379 is healthy: successfully committed proposal: took = 10.140173ms https://10.102.12.7:2379 is healthy: successfully committed proposal: took = 15.744742ms https://10.223.254.235:2379 is healthy: successfully committed proposal: took = 22.150999ms sh-5.1# etcdctl endpoint status -w table +-----------------------------+------------------+---------+---------+-----------+------------+-----------+------------+--------------------+--------+ | ENDPOINT | ID | VERSION | DB SIZE | IS LEADER | IS LEARNER | RAFT TERM | RAFT INDEX | RAFT APPLIED INDEX | ERRORS | +-----------------------------+------------------+---------+---------+-----------+------------+-----------+------------+--------------------+--------+ | https://10.102.12.6:2379 | 329b4a7d0a3109a0 | 3.5.14 | 233 MB | true | false | 83 | 200457558 | 200457558 | | | https://10.102.12.7:2379 | e170ffaf8e78163e | 3.5.14 | 242 MB | false | false | 83 | 200457558 | 200457558 | | | https://10.223.254.235:2379 | d75585bb6c33f85c | 3.5.14 | 233 MB | false | false | 83 | 200457558 | 200457558 | | +-----------------------------+------------------+---------+---------+-----------+------------+-----------+------------+--------------------+--------+ sh-5.1#

Version-Release number of selected component (if applicable):
OpenShift v4.16.2

How reproducible:

Steps to Reproduce:

    1.
    2.
    3.

Actual results:

ETCD showing multiple warnings and port numbers mentioned in the error messages  are also not being used inside the cluster anywhere.

Expected results:

ETCD should not show multiple warnings like this.

Additional info:

Assignee:: Dean West

Reporter:: Mridul Markandey

QA Contact:: Ge Liu

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2024/11/20 6:58 AM

Updated:: 2024/11/25 10:37 AM

Resolved:: 2024/11/25 10:37 AM

Details

Description

Attachments

Easy Agile Planning Poker

Activity

People

Dates