Loading...

XML

Word

Printable

Type: Bug
Resolution: Done-Errata
Priority: Blocker
Fix Version/s: RH-SSO-7.6.5
Affects Version/s: RH-SSO-7.6.2
Component/s: Distribution
Labels:
- rh-sso-release-candidate
- support

Blocked:
False
Blocked Reason:
None
Ready:
False
Epic Link:
RHSSO-2549
Workaround Description:
Hide

The only "workaround" is to temporarily set the RH SSO Operator to unmanaged, remove the Probes and ask the customers to closely monitor their environment in the meantime (since the issue is with the Probes only and the RH SSO image itself works just fine). Obviously this is a very problematic approach but it's currently is the only single known solution to restore a running Production environment:

1. Patch 'Keycloak' to have the pods 'unmanaged' by the RH SSO Operator:

$ oc patch Keycloak <KEYCLOAK> --type merge -p '{"spec":{"unmanaged":true}}'

2. Edit StatefulSet/keycloak:

$ oc edit StatefulSet keycloak

3. Remove the Liveness and Readiness Probes through deleting the lines below:

... livenessProbe: exec: command: - /bin/sh - -c - /probes/liveness_probe.sh failureThreshold: 10 initialDelaySeconds: 30 periodSeconds: 30 successThreshold: 1 timeoutSeconds: 22 ... readinessProbe: exec: command: - /bin/sh - -c - /probes/readiness_probe.sh failureThreshold: 10 initialDelaySeconds: 40 periodSeconds: 30 successThreshold: 1 timeoutSeconds: 22 ...

4. Save and quit using the default vi commands :wq or :x.

5. Delete the keycloak-0 pod in order to force a restart from it:

$ oc delete pod keycloak-0

6. Customers will need to closely monitor their environment due to the unmanaged status and the Probes removed, until this issue is addressed. As mentioned, this is a last resort workaround to restore their environments.
Show
The only "workaround" is to temporarily set the RH SSO Operator to unmanaged , remove the Probes and ask the customers to closely monitor their environment in the meantime (since the issue is with the Probes only and the RH SSO image itself works just fine). Obviously this is a very problematic approach but it's currently is the only single known solution to restore a running Production environment: 1. Patch 'Keycloak' to have the pods 'unmanaged' by the RH SSO Operator: $ oc patch Keycloak <KEYCLOAK> --type merge -p '{ "spec" :{ "unmanaged" : true }}' 2. Edit StatefulSet/keycloak : $ oc edit StatefulSet keycloak 3. Remove the Liveness and Readiness Probes through deleting the lines below: ... livenessProbe: exec: command: - /bin/sh - -c - /probes/liveness_probe.sh failureThreshold: 10 initialDelaySeconds: 30 periodSeconds: 30 successThreshold: 1 timeoutSeconds: 22 ... readinessProbe: exec: command: - /bin/sh - -c - /probes/readiness_probe.sh failureThreshold: 10 initialDelaySeconds: 40 periodSeconds: 30 successThreshold: 1 timeoutSeconds: 22 ... 4. Save and quit using the default vi commands :wq or :x . 5. Delete the k eycloak-0 pod in order to force a restart from it: $ oc delete pod keycloak-0 6. Customers will need to closely monitor their environment due to the unmanaged status and the Probes removed, until this issue is addressed. As mentioned, this is a last resort workaround to restore their environments.
Steps to Reproduce:

Hide

Either perform a manual RH SSO Operator upgrade from rhsso-operator.7.6.1-opr-005 to rhsso-operator.7.6.2-opr-001 or wait for an Automatic upgrade depending on the installPlanApproval policy from the RH SSO Operator Subscription object.

Show
Either perform a manual RH SSO Operator upgrade from rhsso-operator.7.6.1-opr-005 to rhsso-operator.7.6.2-opr-001 or wait for an Automatic upgrade depending on the installPlanApproval policy from the RH SSO Operator Subscription object.
Intelligence Requested:
Market:

SFDC Cases Links:
SFDC Cases Counter:
SFDC Cases Open:

This issue affects specifically customers that have FIPS enabled in their OpenShift Cluster but disabled it for the RH SSO Operator through an Environment Variable as follows:

- apiVersion: keycloak.org/v1alpha1
  kind: Keycloak
  ...
  spec:
    keycloakDeploymentSpec:
      experimental:
        env:
        - name: JAVA_TOOL_OPTIONS
          value: -Dcom.redhat.fips=false
    ...

The Liveness and Readiness Probes (which were working normally in the latest RH SSO 7.6.1 Operator release - rhsso-operator.7.6.0-opr-001) are now failing as below:

  message: |
    Liveness probe failed: {
        "probe.eap.dmr.EapProbe": "Error sending probe request: [digital envelope routines: EVP_DigestInit_ex] disabled for FIPS",
        "probe.eap.dmr.HealthCheckProbe": "Error sending probe request: [digital envelope routines: EVP_DigestInit_ex] disabled for FIPS"
    }
    INFO Using the 'ejRKSfxZsUFwrAiqhvfSTPvUzjxfwOvx' username to authenticate the probe request against the JBoss DMR API.
    INFO Using the 'ejRKSfxZsUFwrAiqhvfSTPvUzjxfwOvx' username to authenticate the probe request against the JBoss DMR API.

...

  message: |
    (combined from similar events): Readiness probe failed: {
        "probe.eap.dmr.EapProbe": "Error sending probe request: [digital envelope routines: EVP_DigestInit_ex] disabled for FIPS",
        "probe.eap.dmr.HealthCheckProbe": "Error sending probe request: [digital envelope routines: EVP_DigestInit_ex] disabled for FIPS"
    }
    INFO Using the 'ejRKSfxZsUFwrAiqhvfSTPvUzjxfwOvx' username to authenticate the probe request against the JBoss DMR API.
    INFO Using the 'ejRKSfxZsUFwrAiqhvfSTPvUzjxfwOvx' username to authenticate the probe request against the JBoss DMR API.

Full details will be attached in the events.yaml file from the customer.

This issue has been reported by at least 2 customers, both with FIPS enabled environments (and disabling for the RH SSO Operator as mentioned above).

It's important to highlight that it only affects the Liveness and Readiness Probes and the RH SSO Operator is still able to start and run normally the OpenShift image with the Probes disabled (more information at the Workaround Section)

While we don't officially support RH SSO in FIPS, customers were able to use the RH SSO Operator (and also the Template / JDBC Base image) normally with JAVA_TOOL_OPTIONS=-Dcom.redhat.fips=false (Example Jira where we assisted customers on deploying RH SSO in their OpenShift enabled FIPS environments: SSOSUP-162) and as mentioned above this issue is limited to the Liveness and Readiness Probes and not the OpenShift image itself.

It's expected that other customers that applied the same workaround to have the RH SSO Operator working on their FIPS environment might also experience the same issue.

In addition to the Events .yaml file I will also attach other details from both Cases that will be linked to this Bug.

NOTE: I don't have an OpenShift FIPS enabled environment to reproduce this issue, however the "Workaround" has been tested and confirmed to work by at least one of the customers.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

03451467-inspect.local.zip
656 kB
2023/03/03 7:17 PM
03451467-keycloak.yaml
2 kB
2023/03/03 7:16 PM
03451467-misc.txt
6 kB
2023/03/03 7:19 PM
events.yaml
459 kB
2023/03/03 7:03 PM

is duplicated by

RHSSO-2661 RH SSO 7.6.5 in OpenShift fails to start

Closed

links to

RHBA-2023:120513 Red Hat Single Sign-On 7.6.5 for OpenShift image enhancement update

mentioned on

Merge request - [RHSS-2364] using fips approved digests for http management access

Merge request - Draft: [RHSSO-2364] Use only FIPS 140-2 approved SHA-256 & SHA-512/256 message digest algorithms for the HTTP Digest authentication of the mngmt endpoint

Solved by commit 4d8425cf072f28ce0bb84741b50e7ed0b3e1091d.

Assignee:: Steven Hawkins

Reporter:: Estevao Konecsni

Votes:: 4 Vote for this issue

Watchers:: 24 Start watching this issue

Created:: 2023/03/03 7:01 PM

Updated:: 2023/09/22 2:55 PM

Resolved:: 2023/09/20 5:25 PM

Details

Description

Attachments

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates