● Production
LIVE
00:00:00 UTC
CPU Usage
73%
16-core cluster avg ↑ 12%
Memory
41%
26.4 GB / 64 GB ↓ 3%
Error Rate
2.4%
p99 threshold: 1.0% ↑ BREACH
Throughput
8.3k rps
peak today: 11.2k → stable
Request Rate & Error Rate · Last 60 minutes
threshold -60m -45m -30m -15m now
Request Rate (k rps)
Error Rate (%)
Service Health
api-gateway
42ms
99.99%
payments-svc
2140ms
98.2%
user-service
18ms
100%
ml-inference
310ms
99.94%
data-pipeline
890ms
99.4%
notification-svc
7ms
100%
postgres-primary
3ms
99.97%
redis-cache
0.8ms
100%
Build Pipelines
api-core
checkout
test
build
deploy
verify
deploying…
ml-service
checkout
test
build
deploy
verify
tests failed
frontend
checkout
test
build
deploy
verify
31m ago