feat: Enable PyTorch2 Batching Tests by mattwittwer · Pull Request #8814 · triton-inference-server/server

mattwittwer · 2026-06-02T23:54:18Z

What does the PR do?

Makes the AOTI test models batch‑capable and adds coverage: the simple add/sub model is exported with a dynamic batch dim and max_batch_size: 8; a new sequence (implicit‑state accumulator) model + config is added to gen_qa_implicit_models.py and wired into gen_qa_model_repository; and torch_aoti_infer_test.py gains batched inference cases (batch 1/4/8 across dtypes) plus a sequence test class (single + interleaved sequences), with test.sh updated to pull and run them.

Checklist

Commit Type:

Check the conventional commit type
box here and add the label to the github PR.

Related PRs:

triton-inference-server/pytorch_backend#196

Where should the reviewer start?

Test plan:

CI Pipeline ID:

Caveats:

Background

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

closes GitHub issue: #xxx

…-inference-server/server into mwittwer/enable_pytorch2_batching

Copilot

Pull request overview

This PR expands the PyTorch AOTInductor (PT2 / torch_aoti) QA assets to support and validate batching, including dynamic batching behavior, plus adds new sequence-batching models and corresponding L0 test coverage.

Changes:

Export AOTI models (simple add/sub + torchvision) with a dynamic batch dimension and configure max_batch_size: 8 + dynamic batching.
Add new AOTI batching-coverage models (variable non-batch dim, multi-instance) and AOTI sequence-batching models (including forward-interface + initial_state + negative-load variants).
Extend L0_torch_aoti tests to cover batched inference, dynamic batching coalescing, multi-instance correctness, variable-shape batching, sequence scheduling, and negative load-failure checks.

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
`qa/L0_torch_aoti/torch_aoti_infer_test.py`	Adds batched inference cases, dynamic batching coalescing checks, variable-shape and multi-instance coverage, and new sequence-batching tests.
`qa/L0_torch_aoti/test.sh`	Adds additional models to the repo setup, pulls in sequence models, and runs a new negative load-failure phase.
`qa/common/gen_qa_models.py`	Exports AOTI models with dynamic batch dims, sets `max_batch_size`, and adds new batching-coverage model generators.
`qa/common/gen_qa_model_repository`	Wires AOTI implicit-sequence model generation into the model repository build step.
`qa/common/gen_qa_implicit_models.py`	Implements AOTI sequence model + configs (including variants and negative configs) and adds a `--torch-aoti` flag.

Comments suppressed due to low confidence (1)

qa/L0_torch_aoti/test.sh:197

The redirection operator is incorrect (&1>2). This does not redirect output to stderr as intended; use 1>&2 so the test runner properly captures failures.

if [[ ${RET} -ne 0 ]]; then
    echo -e "${COLOR_ERROR}\n***\n*** Test Suite FAILED\n***${COLOR_RESET}" &1>2
else

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

yinggeh · 2026-06-09T10:13:35Z

+    kill -s SIGINT ${SERVER_PID}
+    wait ${SERVER_PID} || true
+fi
+rm -rf ${BAD_MODELDIR}


Remove model directories at the top so that one can inspect them after test completes.

Updated and moved to the top of the tests:
https://github.com/triton-inference-server/server/pull/8814/changes#diff-97b7cf613201d3c908b9cb50d67db76ebf5bb026a737d67a8a805ba0fe653d9cR83-R85

whoisj

LGTM

whoisj · 2026-06-05T19:34:45Z

+  max_sequence_idle_microseconds: 5000000
+  control_input [
+    {{
+      name: "INPUT__2"


we should probably have a test with the other parameter naming schema used as well.

Added: test_forward_interface_sequence to cover the ARGS[...]/RESULT[...] schema

yinggeh · 2026-06-09T19:49:09Z

@whoisj Should we test functionality of dynamic and sequence batching in L0_batcher and L0_sequence_batcher?

whoisj · 2026-06-09T20:23:35Z

We should, but we should NOT block this PR because of it.

mattwittwer added 2 commits June 2, 2026 20:02

simple batching

e9ccba2

add sequence batching

cfd11ed

mattwittwer self-assigned this Jun 2, 2026

mattwittwer added 4 commits June 2, 2026 16:55

Merge branch 'main' into mwittwer/enable_pytorch2_batching

660fea3

pre-commit

0239b06

Merge branch 'mwittwer/enable_pytorch2_batching' of github.com:triton…

b48454f

…-inference-server/server into mwittwer/enable_pytorch2_batching

remove sys import

036b77b

mattwittwer mentioned this pull request Jun 3, 2026

feat: Enable PyTorch2 Batching triton-inference-server/pytorch_backend#196

Open

20 tasks

mattwittwer changed the title ~~draft: enable pytorch2 batching~~ feat: enable pytorch2 batching Jun 4, 2026

mattwittwer changed the title ~~feat: enable pytorch2 batching~~ feat: Enable PyTorch2 Batching Jun 4, 2026

mattwittwer changed the title ~~feat: Enable PyTorch2 Batching~~ feat: Enable PyTorch2 Batching Tests Jun 4, 2026

mattwittwer added 2 commits June 4, 2026 18:35

add correlation id test case

e13ca02

revert input alias mapping

05aae4b

mattwittwer requested review from pskiran1, whoisj and yinggeh June 5, 2026 18:32

Add L0_torch_aoti coverage for additional bathing scenarios

a0dc3d3

github-advanced-security AI found potential problems Jun 8, 2026

View reviewed changes

Comment thread qa/common/gen_qa_implicit_models.py Fixed

Comment thread qa/common/gen_qa_implicit_models.py Fixed

Comment thread qa/common/gen_qa_implicit_models.py Fixed

Comment thread qa/common/gen_qa_models.py Fixed

Comment thread qa/common/gen_qa_models.py Fixed

clean up test cases

1825b48

yinggeh requested a review from Copilot June 8, 2026 23:22

Copilot started reviewing on behalf of yinggeh June 8, 2026 23:22 View session

Copilot AI reviewed Jun 8, 2026

View reviewed changes

Comment thread qa/L0_torch_aoti/test.sh Outdated

Comment thread qa/L0_torch_aoti/test.sh

Comment thread qa/L0_torch_aoti/torch_aoti_infer_test.py Outdated

fix staggered test

19f8e22

yinggeh reviewed Jun 9, 2026

View reviewed changes

Merge branch 'main' into mwittwer/enable_pytorch2_batching

e1ad1aa

whoisj previously approved these changes Jun 9, 2026

View reviewed changes

clear model repro at the start of the test

aca7c29

mattwittwer dismissed whoisj’s stale review via aca7c29 June 9, 2026 20:57

whoisj approved these changes Jun 9, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Enable PyTorch2 Batching Tests#8814

feat: Enable PyTorch2 Batching Tests#8814
mattwittwer wants to merge 13 commits into
mainfrom
mwittwer/enable_pytorch2_batching

mattwittwer commented Jun 2, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yinggeh Jun 9, 2026

Uh oh!

mattwittwer Jun 9, 2026

Uh oh!

whoisj left a comment

Uh oh!

whoisj Jun 5, 2026

Uh oh!

mattwittwer Jun 9, 2026

Uh oh!

yinggeh commented Jun 9, 2026

Uh oh!

whoisj commented Jun 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

5 participants

Conversation

mattwittwer commented Jun 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does the PR do?

Checklist

Commit Type:

Related PRs:

Where should the reviewer start?

Test plan:

Caveats:

Background

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yinggeh Jun 9, 2026

Choose a reason for hiding this comment

Uh oh!

mattwittwer Jun 9, 2026

Choose a reason for hiding this comment

Uh oh!

whoisj left a comment

Choose a reason for hiding this comment

Uh oh!

whoisj Jun 5, 2026

Choose a reason for hiding this comment

Uh oh!

mattwittwer Jun 9, 2026

Choose a reason for hiding this comment

Uh oh!

yinggeh commented Jun 9, 2026

Uh oh!

whoisj commented Jun 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

5 participants

mattwittwer commented Jun 2, 2026 •

edited

Loading