GitHunt

BigsnarfDude

bigsnarfdude

Standing on the shoulders of giants - ML, Deep Learning, and DFIR. Kaggle Expert. https://www.Kaggle.com/vincento. Python, Scala, Spaces, and VIM

Organizations

Languages

Python83%Go4%Ruby4%Jupyter Notebook4%HTML4%

Repos

139

Stars

146

Forks

75

Top Language

Python

Loading contributions...

Top Repositories

Repositories

139
BI
bigsnarfdude/researchRalph

Autonomous research using multi-agent swarm for experiments

Python00Updated 18 hours ago
BI
bigsnarfdude/A3Fork

No description provided.

Python00Updated 4 days ago
BI
bigsnarfdude/tellthetruth

Probe-based hallucination detection: what replicates, what doesn't, and why

Python00Updated 3 weeks ago
BI
bigsnarfdude/agenthubFork

GitHub is for humans. AgentHub is for agents. First use case is for autoresearch but it's a lot more general than that. Exploratory project.

Go00Updated 1 week ago
BI
bigsnarfdude/autoresearchFork

AI agents running research on single-GPU nanochat training automatically

Python00Updated 1 week ago
BI
bigsnarfdude/persona-af-elicitation

No description provided.

Python00Updated 2 weeks ago
BI
bigsnarfdude/bigsnarfdude.github.io

personal website on github http://bigsnarfdude.github.io

Ruby00Updated 2 weeks ago
BI
bigsnarfdude/violentPythonForHackers

collection of python tools

Python7640Updated 7 years ago
BI
bigsnarfdude/af-generate-diverse-2026-01

Generate diverse alignment faking samples using 10-pattern reasoning system

Python00Updated 3 weeks ago
BI
bigsnarfdude/volclaw

Claude Memory forensics investigation toolkit with Volatility 3 automation, IOC extraction, and timeline building

Python00Updated 3 weeks ago
BI
bigsnarfdude/autoaudit

Automated alignment faking audit pipeline (Ralph loop + organism server)

Python00Updated 1 month ago
BI
bigsnarfdude/autointerp

No description provided.

Python00Updated 3 weeks ago
BI
bigsnarfdude/af-research-writeup

Alignment Faking Detection: A Two-Month Research Journey - comprehensive documentation of AF detection experiments

00Updated 1 month ago
BI
bigsnarfdude/constitutional-classifier-pp

Two-stage jailbreak defense system for LLMs with linear activation probe and ensemble classifier

Python21Updated 2 months ago
BI
bigsnarfdude/SELU_Keras_Tutorial

Keras based Tutorials and implementations for "Self-normalizing networks" - activation function SELU

Jupyter Notebook4815Updated 4 years ago
BI
bigsnarfdude/introspective-interpFork

Repository for "Training Language Models To Explain Their Own Computations"

00Updated 2 months ago
BI
bigsnarfdude/lightbright

Sparse autoencoder experiments for detecting deceptive reasoning in LLM chain-of-thought

Python00Updated 1 month ago
BI
bigsnarfdude/SAE_Cluster_Probe_Detector

CoT reasoning monitor using SAE cluster probes for alignment faking detection (0.884 AUROC)

Python00Updated 1 month ago
BI
bigsnarfdude/Design-Patterns-In-PythonFork

Common GOF Patterns implemented in Python

Python00Updated 1 month ago
BI
bigsnarfdude/sae-cluster-probe

SAE Cluster Probe for Alignment Faking Detection - 0.884 AUROC (83.9% gap closed)

Python00Updated 1 month ago
BI
bigsnarfdude/model-organisms

No description provided.

Python00Updated 1 month ago
BI
bigsnarfdude/ralphFork

Ralph is an autonomous AI agent loop that runs repeatedly until all PRD items are complete.

00Updated 2 months ago
BI
bigsnarfdude/detector

SAE linear probe for alignment faking detection - 72% AUROC on gold_106

Python00Updated 2 months ago
BI
bigsnarfdude/bench-afFork

Alignment Faking Model Organism Finetuning and Evaluation Utils

00Updated 4 months ago
BI
bigsnarfdude/mindreader

Fine-tuned classifiers for chain-of-thought deception detection - training code and weights

Python00Updated 2 months ago
BI
bigsnarfdude/global-cot-analysisFork

Global CoT Analysis: Initial attempts to uncover patterns across many chains of thought

00Updated 2 months ago
BI
bigsnarfdude/af-signatures

No description provided.

HTML00Updated 2 months ago
BI
bigsnarfdude/af-detection-benchmark

Evaluation dataset for chain-of-thought monitoring research (2330 labeled samples)

Python00Updated 2 months ago
BI
bigsnarfdude/guide-to-data-mining

iPython Notebook of the Guide to Data Mining

2019Updated 12 years ago
BI
bigsnarfdude/petriFork

An alignment auditing agent capable of quickly exploring alignment hypothesis

00Updated 2 months ago

Gists

Recent Activity

BigsnarfDude (bigsnarfdude) | GitHunt