Skip to content

Add nvidia-vss-vllm 3.2.2 (NVIDIA VSS Blueprint 3.1.0)#242

Open
blik616287 wants to merge 6 commits into
spectrocloud:mainfrom
blik616287:add-nvidia-vss-vllm-3.1.4
Open

Add nvidia-vss-vllm 3.2.2 (NVIDIA VSS Blueprint 3.1.0)#242
blik616287 wants to merge 6 commits into
spectrocloud:mainfrom
blik616287:add-nvidia-vss-vllm-3.1.4

Conversation

@blik616287

Copy link
Copy Markdown
Contributor

nvidia-vss-vllm — VSS 3.x, pack 3.1.4

VSS 3.x standalone vLLM LLM backend (nvidia/NVIDIA-Nemotron-Nano-9B-v2-FP8) as a Helm chart, fronted by an nginx sidecar (llm-nim-svc). gpuMemUtil: 0.4 — sized so the LLM co-resides with the Cosmos VLM on a single GB10 (119.6 GiB unified); raise it on a dedicated GPU.

Versioning: chart/pack version: 3.1.4 (our packaging) · appVersion: 3.1.0 (upstream NVIDIA VSS Blueprint 3.x). Helm chart; images pinned in values.yaml pack.content.images.

Tested on NVIDIA GB10 / DGX Spark (arm64 SBSA)

Deployed via Palette add-on cluster profile vss-dgx-spark-3x on edge cluster edge-gx10 (single GB10). Full VSS 3.x route green — all 5 packs report Pack services are ready, cluster Running:

cosmos-reason2-8b-846f9747c9-b2z44           1/1 Running
elasticsearch-0                              1/1 Running
envoy-proxy-8654778857-4lh9r                 1/1 Running
kibana-6c48bf74cb-nlrm9                      1/1 Running
llm-vllm-66f78ff858-l5ftt                    2/2 Running
phoenix-0                                    1/1 Running
postgres-0                                   1/1 Running
redis-0                                      1/1 Running
vss-agent-669d5b5fd6-tzc7d                   1/1 Running
vss-agent-ui-98789c556-zq257                 1/1 Running
vss-proxy-6f8dc9d698-4rmmt                   1/1 Running
vss-vios-ingress-56485ddc44-s7pnf            1/1 Running
vss-vios-mcp-5d579cc49b-fjs68                1/1 Running
vss-vios-sensor-57d86f8dcb-9thct             1/1 Running
vss-vios-streamprocessing-789ccd7c59-xkfrc   1/1 Running

Validation: pack.json JSON-syntax/schema/version, logo, README, and pack.content.images all pass. The content.images pull (crane) fails for the gated nvcr.io/nim/* and nvcr.io/nvidia/vss-core/* images — the CI runner has no NGC credentials (same image-pull exception as the 2.4 PRs #233–236; the cluster pulls them via ngc-pull-secret).

@bulwark-spectrocloud bulwark-spectrocloud Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

✅ CVE scan completed successfully.

Scan Summary:

  • Total images scanned: 4
  • Clean images: 2
  • Images with CVEs: 2
  • Total CVEs found: 219

🔴 Critical CVEs: 5
🟠 High CVEs: 28
🟡 Medium CVEs: 21
🟢 Low CVEs: 165

Images with CVEs:

  • docker.io/alpine/curl:8.12.1: 82 CVEs (Critical: 2, High: 7, Medium: 3, Low: 70)
    Critical CVEs: CVE-2026-31789
  • nginx:1.27-alpine: 137 CVEs (Critical: 3, High: 21, Medium: 18, Low: 95)
    Critical CVEs: CVE-2025-48174, CVE-2026-31789

⚠️ Please review the CVE findings above and address critical/high severity issues before merging.

Pack references only public 0/0 Chainguard images (wolfi-base/nginx/curl). The gated NVIDIA vLLM runtime is fetched as data at deploy (crane export, NGC creds) and run non-privileged on wolfi-base via rootfs-linker swap + gcc/g++ --sysroot wrappers + CUDA tool paths. Validated on a GB10 (Blackwell arm64): serves nemotron, HTTP 200. content.images all public 0/0; validator content check passes (no gated pull). The running container carries NVIDIA CUDA-stack CVEs as runtime data, not in the scanned pack image.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

@bulwark-spectrocloud bulwark-spectrocloud Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

✅ CVE scan completed successfully.

Scan Summary:

  • Total images scanned: 3
  • Clean images: 3
  • Images with CVEs: 0
  • Total CVEs found: 0

🔴 Critical CVEs: 0
🟠 High CVEs: 0
🟡 Medium CVEs: 0
🟢 Low CVEs: 0

✅ All scanned images are clean with no CVEs found.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

@bulwark-spectrocloud bulwark-spectrocloud Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

✅ CVE scan completed successfully.

Scan Summary:

  • Total images scanned: 3
  • Clean images: 3
  • Images with CVEs: 0
  • Total CVEs found: 0

🔴 Critical CVEs: 0
🟠 High CVEs: 0
🟡 Medium CVEs: 0
🟢 Low CVEs: 0

✅ All scanned images are clean with no CVEs found.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

@bulwark-spectrocloud bulwark-spectrocloud Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

✅ CVE scan completed successfully.

Scan Summary:

  • Total images scanned: 3
  • Clean images: 3
  • Images with CVEs: 0
  • Total CVEs found: 0

🔴 Critical CVEs: 0
🟠 High CVEs: 0
🟡 Medium CVEs: 0
🟢 Low CVEs: 0

✅ All scanned images are clean with no CVEs found.

mforde84 and others added 2 commits June 15, 2026 05:49
3.2.2 validated running 2/2 on a GB10 edge cluster. Merged upstream/main (crane-manifest validator fix).

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
@blik616287 blik616287 changed the title Add nvidia-vss-vllm 3.1.4 (NVIDIA VSS Blueprint 3.1.0) Add nvidia-vss-vllm 3.2.2 (NVIDIA VSS Blueprint 3.1.0) Jun 15, 2026

@bulwark-spectrocloud bulwark-spectrocloud Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

✅ CVE scan completed successfully.

Scan Summary:

  • Total images scanned: 3
  • Clean images: 3
  • Images with CVEs: 0
  • Total CVEs found: 0

🔴 Critical CVEs: 0
🟠 High CVEs: 0
🟡 Medium CVEs: 0
🟢 Low CVEs: 0

✅ All scanned images are clean with no CVEs found.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants