feat: add fp8 optimization support for transformer model by eric-gitta-moore · Pull Request #6 · brandon929/FramePack

eric-gitta-moore · 2025-08-24T18:37:58Z

Implement fp8 quantization utilities for linear layers
Add fp8 optimization option to gradio demo interface
Modify worker function to handle fp8 optimized state dict
Include monkey patching for fp8 linear layer forward pass

merge https://github.com/kohya-ss/FramePack-LoRAReady/blob/main/utils/fp8_optimization_utils.py

- Implement fp8 quantization utilities for linear layers - Add fp8 optimization option to gradio demo interface - Modify worker function to handle fp8 optimized state dict - Include monkey patching for fp8 linear layer forward pass

Add --offline flag to load models from local cache instead of downloading from HuggingFace hub. This enables usage in environments with restricted internet access.

Fix the --offline argument to remove incorrect store_true action and adjust the minimum value of gpu_memory_preservation slider from 6 to 0 for better flexibility in low-memory scenarios

The FP8 optimization checkbox was disabled by default, which may lead to suboptimal performance for users who are unaware of this setting. Enabling it by default ensures better performance out of the box.

eric-gitta-moore added 6 commits August 25, 2025 02:37

feat(demo): add offline mode support for model loading

49505e7

Add --offline flag to load models from local cache instead of downloading from HuggingFace hub. This enables usage in environments with restricted internet access.

fix: default offline

f27bd4e

fix: demo_gradio.py fp8

cc8735d

fix: correct argument parser and adjust gpu memory slider range

a6490ae

Fix the --offline argument to remove incorrect store_true action and adjust the minimum value of gpu_memory_preservation slider from 6 to 0 for better flexibility in low-memory scenarios

fix(demo_gradio): enable FP8 optimization by default

e5f2ed9

The FP8 optimization checkbox was disabled by default, which may lead to suboptimal performance for users who are unaware of this setting. Enabling it by default ensures better performance out of the box.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add fp8 optimization support for transformer model#6

feat: add fp8 optimization support for transformer model#6
eric-gitta-moore wants to merge 6 commits intobrandon929:mainfrom
eric-gitta-moore:main

eric-gitta-moore commented Aug 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

eric-gitta-moore commented Aug 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant