mirror of https://github.com/vllm-project/vllm.git synced 2025-10-20 14:53:52 +08:00

Files

Harry Mellor 8c853050e7 [Docs] Enable fail_on_warning for the docs build in CI (#25580 )

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

2025-09-24 19:30:33 +00:00

1.7 KiB

Raw Blame History

Grafana Dashboards for vLLM Monitoring

This directory contains Grafana dashboard configurations (as JSON) designed to monitor vLLM performance and metrics.

Requirements

Grafana 8.0+
Prometheus data source configured in Grafana
vLLM deployment with Prometheus metrics enabled

Dashboard Descriptions

performance_statistics.json: Tracks performance metrics including latency and throughput for your vLLM service.
query_statistics.json: Tracks query performance, request volume, and key performance indicators for your vLLM service.

Deployment Options

Manual Import (Recommended)

The easiest way to use these dashboards is to manually import the JSON configurations directly into your Grafana instance:

Navigate to your Grafana instance
Click the '+' icon in the sidebar
Select 'Import'
Copy and paste the JSON content from the dashboard files, or upload the JSON files directly

Grafana Operator

If you're using the Grafana Operator in Kubernetes, you can wrap these JSON configurations in a GrafanaDashboard custom resource:

# Note: Adjust the instanceSelector to match your Grafana instance's labels
# You can check with: kubectl get grafana -o yaml
apiVersion: grafana.integreatly.org/v1beta1
kind: GrafanaDashboard
metadata:
  name: vllm-performance-dashboard
spec:
  instanceSelector:
    matchLabels:
      dashboards: grafana  # Adjust to match your Grafana instance labels
  folder: "vLLM Monitoring"
  json: |
    # Replace this comment with the complete JSON content from
    # performance_statistics.json - The JSON should start with { and end with }

Then apply to your cluster:

kubectl apply -f your-dashboard.yaml -n <namespace>

1.7 KiB Raw Blame History