GitHunt — Discover GitHub Repositories

506 results for “topic:llm-security”

Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.

Jupyter Notebook56.2k1.4kUpdated just now

chatbothugging-facellmllm-localllm-prompting+10

NVIDIA/garak

the LLM vulnerability scanner

HTML7.2k813Updated 2 hours ago

aillm-evaluationllm-securitysecurity-scannersvulnerability-assessment

NVIDIA-NeMo/Guardrails

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

Python5.7k611Updated 3 hours ago

agentsgenerative-aiguardrailsllm-safetyllm-security+4

Giskard-AI/giskard-oss

🐢 Open-Source Evaluation & Testing library for LLM Agents

Python5.1k401Updated 2 days ago

agent-evaluationai-red-teamai-securityai-testingfairness-ai+12

verazuo/jailbreak_llms

[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).

Jupyter Notebook3.6k319Updated 9 hours ago

chatgptjailbreakjailbreakinglarge-language-modelllm+2

Tencent/AI-Infra-Guard

A full-stack AI Red Teaming platform securing AI ecosystems via AI Infra scan, MCP scan, Agent skills scan, and LLM jailbreak evaluation.

Python3.1k310Updated 1 hour ago

agentagent-scanagentskillsai-infraai-red-team+13

protectai/llm-guard

The Security Toolkit for LLM Interactions

Python2.6k347Updated 3 hours ago

adversarial-machine-learningchatgptlarge-language-modelsllmllm-security+5

mariocandela/beelzebub

A secure low code honeypot framework, leveraging AI for System Virtualization.

Go1.9k178Updated 1 hour ago

acisagentic-ai-securitycloudnativecloudsecuritycybersecurity+15

msoedov/agentic_security

Agentic LLM Vulnerability Scanner / AI red teaming kit 🧪

Python1.8k241Updated 5 hours ago

agent-frameworkagent-securityai-red-teamllm-evaluationllm-evaluation-framework+9

cyberark/FuzzyAI

A powerful tool for automated LLM fuzzing. It is designed to help developers and security researchers identify and mitigate potential jailbreaks in their LLM APIs.

Jupyter Notebook1.2k173Updated 9 hours ago

aiai-red-teamfuzzingjailbreakjailbreaking+5

OWASP/www-project-top-10-for-large-language-model-applications

OWASP Top 10 for Large Language Model Apps (Part of the GenAI Security Project)

Python1.1k295Updated 5 hours ago

aiappsecllmllm-security

splx-ai/agentic-radar

A security scanner for your LLM agentic workflows

Python923121Updated 18 hours ago

agentic-aiagentic-frameworkagentic-workflowaiai-red-teaming+11

EasyJailbreak/EasyJailbreak

An easy-to-use Python framework to generate adversarial jailbreak prompts.

Python82179Updated 1 day ago

discrete-optimizationjailbreakjailbreak-frameworklarge-language-modelllm-safety-benchmark+1

chawins/llm-sp

Papers and resources related to the security and privacy of LLMs 🤖

Python56943Updated 1 day ago

adversarial-machine-learningawesome-listllmllm-privacyllm-security+2

deadbits/vigil-llm

⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs

Python46253Updated 3 days ago

adversarial-attacksadversarial-machine-learninglarge-language-modelsllm-securityllmops+3

liu00222/Open-Prompt-Injection

This repository provides a benchmark for prompt injection attacks and defenses in LLMs

Python40062Updated 1 day ago

llmllm-securityllmsprompt-injectionprompt-injection-tool+1

mensfeld/code-on-incus

Run coding agents in hardened Incus containers with real-time network threat detection, automatic threat response (pause/kill), credential isolation, protected paths, session persistence, and multi-slot support.

Python30318Updated 4 hours ago

agentic-aiai-toolsanthropicclaudeclaude-code+15

TrustAI-laboratory/Learn-Prompt-Hacking

This is The most comprehensive prompt hacking course available, which record our progress on a prompt engineering and prompt hacking course.

Jupyter Notebook27033Updated 1 day ago

jailbreakllmllm-learningllm-securityllm-tutorials+5

adversa-ai/secureclaw

SecureClaw - Security Plugin and Skill for OpenClaw OWASP-Aligned

TypeScript22534Updated just now

agentic-aiai-agentsai-securityllm-securityopenclaw+5

R3DRUN3/sploitcraft

🏴‍☠️ Hacking Guides, Demos and Proof-of-Concepts 🥷

Jupyter Notebook21927Updated 1 week ago

aiawscloudcontainer-securitycybersecurity+13

sshh12/llm_backdoor

Experimental tools to backdoor large language models by re-writing their system prompts at a raw parameter level. This allows you to potentially execute offline remote code execution without running any actual code on the victim's machine or thwart LLM-based fraud/moderation systems.

Python20325Updated 4 days ago

backdoor-attacksllm-securityqwen2-5

LLAMATOR-Core/llamator

Red Teaming python-framework for testing chatbots and GenAI systems.

Python20219Updated just now

agentaiai-securityattackhallucinations+15

edwinkys/phantasm

Toolkits to create a human-in-the-loop approval layer to monitor and guide AI agents workflow in real-time.

Svelte1897Updated 1 week ago

ai-agentsai-safetyai-securityapproval-workflowautomation-tools+10

luckyPipewrench/pipelock

Firewall for AI agents. DLP scanning, SSRF protection, bidirectional MCP scanning, tool poisoning detection, and workspace integrity monitoring.

Go18410Updated 1 hour ago

ai-agentsai-securitydlpegress-proxyfetch-proxy+6

Pantheon-Security/medusa

AI-first security scanner with 76 analyzers, 4,000+ detection rules, 508 FP filters (96.8% reduction), and 133 CVE detections for AI/ML, LLM agents, and MCP servers

Python17231Updated 1 week ago

agent-securityai-securitycode-analysiscve-detectiondevsecops+12

lakeraai/pint-benchmark

A benchmark for prompt injection detection systems.

Jupyter Notebook16621Updated 1 day ago

benchmarkllmllm-benchmarkingllm-securityprompt-injection

yevh/TaaC-AI

AI-driven Threat modeling-as-a-Code (TaaC-AI)

HTML16123Updated 1 day ago

aiapplication-securityclaude-3devsecopsgpt+11

ZenGuard-AI/fast-llm-security-guardrails

The fastest Trust Layer for AI Agents

Python15220Updated 1 week ago

agentic-aiai-agentai-agentsai-runtimecx-agent+6

Repello-AI/whistleblower

Whistleblower is a offensive security tool for testing against system prompt leakage and capability discovery of an AI application exposed through API. Built for AI engineers, security researchers and folks who want to know what's going on inside the LLM-based app they use daily

Python14927Updated 3 days ago

ai-red-teamingai-securityhacktoberfesthacktoberfest2025jailbreaks+3

edward-playground/aidefense-framework

An open-source knowledge base of defensive countermeasures to protect AI/ML systems. Features interactive views and maps defenses to known threats from frameworks like MITRE ATLAS, MAESTRO, and OWASP.

JavaScript13728Updated 12 hours ago

ai-securityaidefendatlascybersecuritydefensive-security+8

Page 1 of 17