Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Major
Fix Version/s: 2.15.0 GA
Affects Version/s: SaaS, 2.14.0 GA, 2.14.0-mas (0.11.8)
Component/s: System
Labels:
- 2.15-to-test

Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
3Scale PT Tested upstream:
Not Started
3scale PT Docs:
Not Started
3scale PT Product Specs:
Not Started
3scale PT Product Update Ready:
Not Started
3scale PT Released In Saas:
Not Started
3scale PT Verified Product:
Not Started
Target Release:

2.15.0 GA
Intelligence Requested:
Market:

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

There is a bug in the setup of the Prometheus metrics for System, that affects specifically the Rails-related ones (not Sidekiq), and exposed at /yabeda-metrics.
The problem is that each Unicorn worker collects and reports its own metrics, so what's aggregated in Prometheus is not correct (as counter metrics jump back and forth, the resulting numbers are wrong).

We're using yabeda-prometheus-mmap based on prometheus-client-mmap which supports multi-process environments, but for that we need to set prometheus_multiproc_dir environment variable to ensure all the processes (e.g. all unicorn workers) report to aggregated metrics counters.