"topic:plugin-testing" — Search

4 results for “topic:plugin-testing”

4-stage evaluation framework for testing Claude Code plugin component triggering. Validates skills, agents, and commands activate correctly via programmatic detection and LLM judgment.

TypeScript140Updated 3 days ago

ai-testinganthropicclaudeclaude-agent-sdkclaude-codeclideveloper-toolsevaluation-frameworkllmplugin-testingtest-automationtesting-frameworktypescript

zircote/oolong-pairs

Benchmark harness for A/B testing Claude Code plugins against OOLONG long-context reasoning tasks. Compare truncation vs RLM-RS recursive chunking strategies. Features Claude Code hooks integration, SQLite persistence, and comprehensive scoring aligned with the OOLONG paper methodology.

Python30Updated 1 week ago

ai-evaluationanthropicbenchmarkchunkingclaudeclaude-codeclicontext-windowdeveloper-toolsllmlong-contextnlpoolongplugin-testingpythonrecursive-language-modelrlm

codifycli/codify-plugin-test

Testing framework for Codify plugins with complete lifecycle testing, IPC communication, and cross-platform support

TypeScript00Updated 5 days ago

codifyframeworkintegration-testingipcpluginplugin-testingtesttesting

imbflool/cc-plugin-eval

🚀 Automate the evaluation of Claude Code plugin components to ensure accurate triggering of skills, agents, commands, and hooks.

TypeScript00Updated just now

ai-testinganthropicclaudeclaude-agent-sdkclaude-codeclideveloper-toolsevaluation-frameworkllmplugin-testingtest-automationtypescript