Skip to content

fix: Model readiness for namespaced models#502

Merged
yinggeh merged 1 commit into
mainfrom
yinggeh/tri-1341-fix-ci-test-l0_model_namespacing-base
Jun 10, 2026
Merged

fix: Model readiness for namespaced models#502
yinggeh merged 1 commit into
mainfrom
yinggeh/tri-1341-fix-ci-test-l0_model_namespacing-base

Conversation

@yinggeh

@yinggeh yinggeh commented Jun 6, 2026

Copy link
Copy Markdown
Contributor

What does the PR do?

Fixes the L0_model_namespacing--base and L0_metrics--base tests. Strict server readiness (InferenceServer::IsReady) and InferenceServer::ModelIsReady resolved models by name only, which is ambiguous when model namespacing is enabled and the same model name exists in multiple namespaces — the lookup fails and the server is reported not-ready. Readiness now resolves models by the full ModelIdentifier.

This regression was introduced by #473 (fix: /v2/health/ready returns 200 when Python backend stub is dead (#8604), commit 53fc26e), which added the runtime backend readiness check in the strict-readiness path using name-only model lookup.

Checklist

  • PR title reflects the change and is of format <commit_type>: <Title>
  • Changes are described in the pull request.
  • Related issues are referenced.
  • Populated github labels field
  • Added test plan and verified test passes.
  • Verified that the PR passes existing CI.
  • Verified copyright is correct on all changed files.
  • Added succinct git squash message before merging.
  • All template sections are filled out.
  • Optional: Additional screenshots for behavior/output changes with before/after.

Commit Type:

  • fix

Related PRs:

Where should the reviewer start?

  • src/server.cc / src/server.h: IsReady strict-readiness loop now resolves each model via the full ModelIdentifier (GetModel(mv.first, ...)); ModelIsReady takes an already-resolved Model&.
  • src/tritonserver.cc: TRITONSERVER_ServerModelIsReady resolves the model (name-based, matching the public C-API contract) and calls the new ModelIsReady overload.

Test plan:

  • L0_model_namespacing--base

  • L0_metrics--base

  • CI Pipeline ID: 53863861

Caveats:

The public C-API TRITONSERVER_ServerModelIsReady still resolves by name (no namespace parameter in the API contract); namespaced readiness resolution applies to the internal strict-readiness path.

Background

Related Issues:

N/A

@yinggeh yinggeh changed the title fix: Resolve readiness model lookup by ModelIdentifier (TRI-1341) fix: Resolve readiness model lookup by ModelIdentifier Jun 6, 2026
Strict server readiness and ModelIsReady resolved models by name only,
which is ambiguous when model namespacing is enabled and the same model
name exists in multiple namespaces, causing L0_model_namespacing to fail.
IsReady now resolves models via the full ModelIdentifier, and ModelIsReady
takes an already-resolved Model so each caller controls model resolution.
@yinggeh yinggeh force-pushed the yinggeh/tri-1341-fix-ci-test-l0_model_namespacing-base branch from 7de0d9f to 632ad53 Compare June 6, 2026 02:15
@yinggeh yinggeh self-assigned this Jun 6, 2026
@yinggeh yinggeh added the bug Something isn't working label Jun 6, 2026
@yinggeh yinggeh changed the title fix: Resolve readiness model lookup by ModelIdentifier fix: Model readiness for namespaced models Jun 6, 2026
@yinggeh yinggeh requested review from pskiran1 and whoisj June 6, 2026 02:43

@whoisj whoisj left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@yinggeh yinggeh merged commit 73aad7c into main Jun 10, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bug Something isn't working

Development

Successfully merging this pull request may close these issues.

2 participants