Skip to content

Deployment#1

Merged
fuddin-bit merged 7 commits into
mainfrom
deployment
Jun 10, 2026
Merged

Deployment#1
fuddin-bit merged 7 commits into
mainfrom
deployment

Conversation

@fuddin-bit

Copy link
Copy Markdown
Owner

No description provided.

fuddin-bit and others added 7 commits June 4, 2026 11:53
Extend gpu-dashboard.json with temperature, power, VRAM %, memory-copy,
XID, and optional profiling metrics; sync Helm ConfigMap and document
PromQL in DASHBOARD.md and GRAFANA_DEPLOYMENT.md.

Co-authored-by: Cursor <cursoragent@cursor.com>
Include values for OpenShift and vanilla Kubernetes, dashboard import
script, CoreWeave ingress guide, and README Helm quick start.

Co-authored-by: Cursor <cursoragent@cursor.com>
Visualize DCGM_FI_PROF_GR_ENGINE_ACTIVE per node and document PromQL
in DASHBOARD.md and GRAFANA_DEPLOYMENT.md.

Co-authored-by: Cursor <cursoragent@cursor.com>
… states. Adjusted legend formats and added refIds for clarity in the Grafana configuration. Ensured consistency across dashboard panels for better monitoring of GPU workloads.
@fuddin-bit fuddin-bit merged commit 347ddc9 into main Jun 10, 2026
3 of 4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant