Goal
Determine optimal number of concurrent Wan 2.2 instances per MI300X GPU.
Theoretical: 4-9 instances per GPU (32-72 total on 8-GPU node). Needs real hardware validation.
Tasks
Details
See docs/research/mi300x-benchmarking.md for full protocol.
Blocked By
Access to MI300X hardware.
Goal
Determine optimal number of concurrent Wan 2.2 instances per MI300X GPU.
Theoretical: 4-9 instances per GPU (32-72 total on 8-GPU node). Needs real hardware validation.
Tasks
max_instances_per_gpuin config based on findingsDetails
See docs/research/mi300x-benchmarking.md for full protocol.
Blocked By
Access to MI300X hardware.