Skip to content

Add Sage Attention (CORE-74)#42

Open
contentis wants to merge 10 commits into
Comfy-Org:mainfrom
contentis:sage_attn
Open

Add Sage Attention (CORE-74)#42
contentis wants to merge 10 commits into
Comfy-Org:mainfrom
contentis:sage_attn

Conversation

@contentis

Copy link
Copy Markdown
Contributor

This PR adds the kernel for SageAttention 2 using FP8 and slighlty updated quantization kernels for improved end-to-end performance.

image

@alexisrolland alexisrolland changed the title Add Sage Attention Add Sage Attention (CORE-74) May 22, 2026
Resolve conflicts by keeping Sage Attention kernels alongside main's
SVDQuant W4A4, AWQ W4A16, and split-half RoPE APIs.
@contentis

Copy link
Copy Markdown
Contributor Author

@rattus128 any thoughts? If you think there are any critical features missing please let me know.

@Ph0rk0z

Ph0rk0z commented Jun 3, 2026

Copy link
Copy Markdown

I have to use different sage attention because I'm on ampere/turning. Will this conflict? Can i still use my own?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants