mamba-2

Here are 9 public repositories matching this topic...

keshik6 / grafting

[NeurIPS 2025 Oral] Official Code for Exploring Diffusion Transformer Designs via Grafting

image-generation post-training self-attention convolutions diffusion-models grafting linear-attention text-to-image-generation architecture-research diffusion-transformer sub-quadratic-attention model-grafting hyena-operator model-architecture-editing diffusion-transformers architecture-editing hyena-x hyena-y mamba-2

Updated Jan 9, 2026
Jupyter Notebook

Run IBM Granite 4.0 locally on Raspberry Pi 5 with Ollama.This is a privacy-first AI. Your data never leaves your device because it runs 100% locally. There are no cloud uploads and no third-party tracking.

linux open-source raspberry-pi ai ibm arm64 embedded-linux ai-project edge-ai huggingface on-device-ai llm local-ai ollama small-language-models offline-ai private-ai self-hosted-ai mamba-2

Updated Mar 27, 2026
Shell

humanjesse / Granite4-Mamba2-Mech-Interp-Suite

Star

mech-interp suite for Granite4 models that use Mamba-2 architecture

ibm mech-interp mamba-2 granite4

Updated Apr 9, 2026
Python

Pomilon-Intelligence-Lab / ALSI

Star

Early baby steps towards a long-term vision regarding Mamba-2's state interpretability.

Updated Feb 4, 2026
Python

wisnunugroho21 / nugie-jax-nemotron-3-nano

Star

A simple, minimalistic, and explainable code implementation of of Nemotron 3 Nano in JAX

transformer moe mamba jax deep-lear lllm nemotron mamba-2 nemotron-nano

Updated May 27, 2026
Python

hinanohart / recurrentlens

Star

Mechanistic interpretability for State-Space Models: SAEs, feature visualization, and a Hub registry for Mamba/Mamba-2.

pytorch neural-networks ssm mamba interpretability sparse-autoencoder state-space-models mechanistic-interpretability mamba-2

Updated May 30, 2026
Python

HarmoniqOS / ssm-aware-lora-finetuning

Star

Systematic study of LoRA fine-tuning strategies for IBM Granite 4.0-H-Micro (Mamba-2 + Transformer hybrid). Demonstrates the impact of architecture-aware target selection and SSM core parameter co-training, including analysis of PEFT serialization behavior. Reports up to 37% relative improvement over LoRA-only baselines.

lora granite mamba state-space-model peft llama-cpp gguf parameter-efficient-fine-tuning llm-finetuning hybrid-architecture ibm-granite mamba-2 granite4

Updated Mar 1, 2026
Python

wisnunugroho21 / nugie-jax-mamba

Star

A simple, minimalistic, and explainable JAX implementation of Mamba 2 & Mamba 3

deep-learning neural-network transformer mamba jax llm mamba-state-space-models mamba-2 mamba2 mamba-ssm mamba-3

Updated May 10, 2026
Python

hinanohart / circuitbench

Star

Integrated mechanistic interpretability + sparse autoencoder framework for Hybrid SSM-Attention models (Mamba-2, Hymba, RWKV-7). v0.1.2 alpha: real forward-pass intervention + mean-ablation patching shipped, CPU smoke; GPU/real adapters in v0.2.

pytorch alignment ssm mamba sae interpretability sparse-autoencoder state-space-model rwkv mechanistic-interpretability hymba mamba-2 transformer-alternatives

Updated May 30, 2026
Python

Improve this page

Add a description, image, and links to the mamba-2 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mamba-2 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mamba-2

Here are 9 public repositories matching this topic...

keshik6 / grafting

Jewelzufo / granitepi-4-nano

humanjesse / Granite4-Mamba2-Mech-Interp-Suite

Pomilon-Intelligence-Lab / ALSI

wisnunugroho21 / nugie-jax-nemotron-3-nano

hinanohart / recurrentlens

HarmoniqOS / ssm-aware-lora-finetuning

wisnunugroho21 / nugie-jax-mamba

hinanohart / circuitbench

Improve this page

Add this topic to your repo