Monitoring Stack
Risoluto exposes Prometheus metrics atGET /metrics. This recipe wires up Prometheus + Grafana for dashboards and alerting.
Available metrics
| Metric | Type | Description |
|---|---|---|
risoluto_http_requests_total | Counter | HTTP requests by method and status |
risoluto_http_request_duration_seconds | Histogram | Request latency distribution (buckets) |
risoluto_orchestrator_polls_total | Counter | Poll cycles by status (success, error, skipped) |
risoluto_agent_runs_total | Counter | Agent completions by outcome (completed, failed, oom, stalled) |
Setup
Configure Grafana
- Open Grafana at http://localhost:3001 (admin / changeme)
- Data Sources > Add Prometheus:
http://prometheus:9090 - Import or build dashboards using the queries below
Grafana dashboard queries
Agent success rate (1h window)
Active polls per minute
HTTP error rate
P95 API latency
Agent runs by outcome (stacked)
Alert rules
Add alert rules to your Prometheus config or a separaterules.yml:
Scraping from outside Docker
If Risoluto runs directly on the host (not in a container), point Prometheus athost.docker.internal:4000 or your machine’s LAN IP:
What’s Next
Observability
Full observability reference — SSE events, audit logs, and data persistence.
Troubleshooting
Common failure cases and recovery procedures.
Notifications
Configure Slack notifications for agent lifecycle events.