"topic:causal-intervention" — Search | GitHunt

Repositories Developers Collections

© 2026 GitHunt · tansuasici

7 results for “topic:causal-intervention”

explanare/ravel

Evaluate interpretability methods on localizing and disentangling concepts in LLMs.

Jupyter Notebook579Updated 4 months ago

causal-interventiondisentangled-representationsinterpretabilityinterventionprobingsparse-autoencoder

CRIPAC-DIG/CF-FEND

[SIGIR 2022] Source code and datasets for "Bias Mitigation for Evidence-aware Fake News Detection by Causal Intervention".

Python111Updated 1 year ago

causal-inferencecausal-interventiondebiasingevidence-basedfake-news-detection

explanare/verbatim-memorization

Demystifying Verbatim Memorization in Large Language Models

Python94Updated 4 months ago

causal-interventionmemorizationunlearning

explanare/eval-neuron-explanation

A framework for evaluating auto-interp pipelines, i.e., natural language explanations of neurons.

Python31Updated 1 year ago

causal-interventionexplanabilityinterpretabilityneuronsprobing

explanare/char-iit

A causal intervention framework to learn robust and interpretable character representations inside subword-based language models

Jupyter Notebook30Updated 2 years ago

causal-interventioncharacter-level-language-modelinterpretabilitysubword

luka-group/Causal-View-of-Entity-Bias

[EMNLP 2023] A Causal View of Entity Bias in (Large) Language Models

Python20Updated 1 year ago

causal-interventiondebiasingfaithfulnessknowledge-conflictslarge-language-models

ahsanashfa/verbatim-flow

Capture macOS dictation accurately without rewriting your words, keeping your input true to what you speak and avoiding common app issues.

Swift00Updated just now

agentic-aiagentic-workflowagentscausal-interventionchatgptepub-readerjavascriptlangchainlarge-language-modelslatent-diffusionmemorizationnextjsno-codereactstable-diffusionworkflow-automation