Skip to content

Metrics & Monitoring

Comprehensive monitoring and observability for your Mend deployment.

Mend exposes Prometheus metrics for monitoring performance, tracking usage, and alerting on issues.

GET /metrics

Public No authentication required

Terminal window
curl http://localhost:8080/metrics
MetricTypeDescription
mend_jobs_totalCounterTotal jobs processed by type and status
mend_job_duration_secondsHistogramJob processing duration
mend_jobs_in_progressGaugeCurrently processing jobs
MetricTypeDescription
mend_queue_depthGaugeCurrent queue depth by type
mend_queue_jobs_enqueued_totalCounterTotal jobs enqueued
mend_queue_jobs_dequeued_totalCounterTotal jobs dequeued
MetricTypeDescription
mend_worker_utilizationGaugeWorker utilization percentage
mend_memory_usage_bytesGaugeMemory usage
mend_disk_usage_bytesGaugeDisk usage

Import the provided Grafana dashboard for visualization:

Job Throughput

  • Jobs per second
  • Success/failure rates
  • Processing times

Queue Health

  • Queue depths
  • Wait times
  • Backlog trends

System Resources

  • CPU usage
  • Memory usage
  • Disk space

Example Prometheus alerting rules:

groups:
- name: mend
rules:
- alert: HighQueueDepth
expr: mend_queue_depth > 100
for: 5m
annotations:
summary: "Queue depth is high"
- alert: HighFailureRate
expr: rate(mend_jobs_total{status="failed"}[5m]) > 0.1
for: 5m
annotations:
summary: "Job failure rate is high"