mirror of
https://github.com/vllm-project/vllm.git
synced 2025-10-20 14:53:52 +08:00
1.7 KiB
1.7 KiB
Grafana Dashboards for vLLM Monitoring
This directory contains Grafana dashboard configurations (as JSON) designed to monitor vLLM performance and metrics.
Requirements
- Grafana 8.0+
- Prometheus data source configured in Grafana
- vLLM deployment with Prometheus metrics enabled
Dashboard Descriptions
- performance_statistics.json: Tracks performance metrics including latency and throughput for your vLLM service.
- query_statistics.json: Tracks query performance, request volume, and key performance indicators for your vLLM service.
Deployment Options
Manual Import (Recommended)
The easiest way to use these dashboards is to manually import the JSON configurations directly into your Grafana instance:
- Navigate to your Grafana instance
- Click the '+' icon in the sidebar
- Select 'Import'
- Copy and paste the JSON content from the dashboard files, or upload the JSON files directly
Grafana Operator
If you're using the Grafana Operator
in Kubernetes, you can wrap these JSON configurations in a GrafanaDashboard
custom
resource:
# Note: Adjust the instanceSelector to match your Grafana instance's labels
# You can check with: kubectl get grafana -o yaml
apiVersion: grafana.integreatly.org/v1beta1
kind: GrafanaDashboard
metadata:
name: vllm-performance-dashboard
spec:
instanceSelector:
matchLabels:
dashboards: grafana # Adjust to match your Grafana instance labels
folder: "vLLM Monitoring"
json: |
# Replace this comment with the complete JSON content from
# performance_statistics.json - The JSON should start with { and end with }
Then apply to your cluster:
kubectl apply -f your-dashboard.yaml -n <namespace>