causal-intervention

Star

Here are 8 public repositories matching this topic...

explanare / ravel

Star

Evaluate interpretability methods on localizing and disentangling concepts in LLMs.

intervention interpretability sparse-autoencoder probing disentangled-representations causal-intervention

Updated Oct 30, 2025
Jupyter Notebook

CRIPAC-DIG / CF-FEND

Star

[SIGIR 2022] Source code and datasets for "Bias Mitigation for Evidence-aware Fake News Detection by Causal Intervention".

causal-inference evidence-based debiasing fake-news-detection causal-intervention

Updated Apr 23, 2024
Python

cloneiq / CIMB-MVQA

Star

Causal Intervention on Modality-specific Biases for Medical Visual Question Answering

causal-intervention medical-visual-question-answering multimodal-bias-mitigation

Updated Dec 16, 2025
Python

explanare / verbatim-memorization

Star

Demystifying Verbatim Memorization in Large Language Models

memorization unlearning causal-intervention

Updated Oct 30, 2025
Python

explanare / char-iit

Star

A causal intervention framework to learn robust and interpretable character representations inside subword-based language models

subword interpretability character-level-language-model causal-intervention

Updated Jul 10, 2023
Jupyter Notebook

explanare / eval-neuron-explanation

Star

A framework for evaluating auto-interp pipelines, i.e., natural language explanations of neurons.

neurons interpretability probing explanability causal-intervention

Updated Jun 26, 2024
Python

luka-group / Causal-View-of-Entity-Bias

Star

[EMNLP 2023] A Causal View of Entity Bias in (Large) Language Models

debiasing large-language-models faithfulness causal-intervention knowledge-conflicts

Updated Mar 21, 2024
Python

ahsanashfa / verbatim-flow

Star

Capture macOS dictation accurately without rewriting your words, keeping your input true to what you speak and avoiding common app issues.

react javascript nextjs epub-reader agents workflow-automation no-code memorization large-language-models latent-diffusion stable-diffusion chatgpt langchain causal-intervention agentic-workflow agentic-ai

Updated Apr 14, 2026
Swift

Improve this page

Add a description, image, and links to the causal-intervention topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the causal-intervention topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

causal-intervention

Here are 8 public repositories matching this topic...

explanare / ravel

CRIPAC-DIG / CF-FEND

cloneiq / CIMB-MVQA

explanare / verbatim-memorization

explanare / char-iit

explanare / eval-neuron-explanation

luka-group / Causal-View-of-Entity-Bias

ahsanashfa / verbatim-flow

Improve this page

Add this topic to your repo