BigsnarfDude

bigsnarfdude

Standing on the shoulders of giants - ML, Deep Learning, and DFIR. Kaggle Expert. https://www.Kaggle.com/vincento. Python, Scala, Spaces, and VIM

Canada

bigsnarfdude.github.io

Organizations

Languages

Python83%Go4%Ruby4%Jupyter Notebook4%HTML4%

Repos

139

Stars

146

Forks

Top Language

Python

Loading contributions...

Top Repositories

violentPythonForHackers

collection of python tools

76Python

SELU_Keras_Tutorial

Keras based Tutorials and implementations for "Self-normalizing networks" - activation function SELU

48Jupyter Notebook

guide-to-data-mining

iPython Notebook of the Guide to Data Mining

constitutional-classifier-pp

Two-stage jailbreak defense system for LLMs with linear activation probe and ensemble classifier

2Python

researchRalph

Autonomous research using multi-agent swarm for experiments

0Python

Repositories

139

bigsnarfdude/researchRalph

Autonomous research using multi-agent swarm for experiments

Python00Updated 18 hours ago

bigsnarfdude/A3Fork

No description provided.

Python00Updated 4 days ago

bigsnarfdude/tellthetruth

Probe-based hallucination detection: what replicates, what doesn't, and why

Python00Updated 3 weeks ago

bigsnarfdude/agenthubFork

GitHub is for humans. AgentHub is for agents. First use case is for autoresearch but it's a lot more general than that. Exploratory project.

Go00Updated 1 week ago

bigsnarfdude/autoresearchFork

AI agents running research on single-GPU nanochat training automatically

Python00Updated 1 week ago

bigsnarfdude/persona-af-elicitation

No description provided.

Python00Updated 2 weeks ago

bigsnarfdude/bigsnarfdude.github.io

personal website on github http://bigsnarfdude.github.io

Ruby00Updated 2 weeks ago

bigsnarfdude/violentPythonForHackers

collection of python tools

Python7640Updated 7 years ago

bigsnarfdude/af-generate-diverse-2026-01

Generate diverse alignment faking samples using 10-pattern reasoning system

Python00Updated 3 weeks ago

bigsnarfdude/volclaw

Claude Memory forensics investigation toolkit with Volatility 3 automation, IOC extraction, and timeline building

Python00Updated 3 weeks ago

bigsnarfdude/autoaudit

Automated alignment faking audit pipeline (Ralph loop + organism server)

Python00Updated 1 month ago

bigsnarfdude/autointerp

No description provided.

Python00Updated 3 weeks ago

bigsnarfdude/af-research-writeup

Alignment Faking Detection: A Two-Month Research Journey - comprehensive documentation of AF detection experiments

00Updated 1 month ago

bigsnarfdude/constitutional-classifier-pp

Two-stage jailbreak defense system for LLMs with linear activation probe and ensemble classifier

Python21Updated 2 months ago

bigsnarfdude/SELU_Keras_Tutorial

Keras based Tutorials and implementations for "Self-normalizing networks" - activation function SELU

Jupyter Notebook4815Updated 4 years ago

bigsnarfdude/introspective-interpFork

Repository for "Training Language Models To Explain Their Own Computations"

00Updated 2 months ago

bigsnarfdude/lightbright

Sparse autoencoder experiments for detecting deceptive reasoning in LLM chain-of-thought

Python00Updated 1 month ago

bigsnarfdude/SAE_Cluster_Probe_Detector

CoT reasoning monitor using SAE cluster probes for alignment faking detection (0.884 AUROC)

Python00Updated 1 month ago

bigsnarfdude/Design-Patterns-In-PythonFork

Common GOF Patterns implemented in Python

Python00Updated 1 month ago

bigsnarfdude/sae-cluster-probe

SAE Cluster Probe for Alignment Faking Detection - 0.884 AUROC (83.9% gap closed)

Python00Updated 1 month ago

bigsnarfdude/model-organisms

No description provided.

Python00Updated 1 month ago

bigsnarfdude/ralphFork

Ralph is an autonomous AI agent loop that runs repeatedly until all PRD items are complete.

00Updated 2 months ago

bigsnarfdude/detector

SAE linear probe for alignment faking detection - 72% AUROC on gold_106

Python00Updated 2 months ago

bigsnarfdude/bench-afFork

Alignment Faking Model Organism Finetuning and Evaluation Utils

00Updated 4 months ago

bigsnarfdude/mindreader

Fine-tuned classifiers for chain-of-thought deception detection - training code and weights

Python00Updated 2 months ago

bigsnarfdude/global-cot-analysisFork

Global CoT Analysis: Initial attempts to uncover patterns across many chains of thought

00Updated 2 months ago

bigsnarfdude/af-signatures

No description provided.

HTML00Updated 2 months ago

bigsnarfdude/af-detection-benchmark

Evaluation dataset for chain-of-thought monitoring research (2330 labeled samples)

Python00Updated 2 months ago

bigsnarfdude/guide-to-data-mining

iPython Notebook of the Guide to Data Mining

2019Updated 12 years ago

bigsnarfdude/petriFork

An alignment auditing agent capable of quickly exploring alignment hypothesis

00Updated 2 months ago

BigsnarfDude

Organizations

Languages

Loading contributions...

Top Repositories

Repositories

Gists

Recent Activity