Open Source · Agent Security · OWASP Top 10

BREAK.
BEFORE
THEY DO

Crucible stress-tests AI agents against every known attack vector — prompt injection, goal hijacking, tool misuse — before your system reaches production.

View on GitHub Explore Modules →

Scroll to explore

Prompt Injection

Goal Hijacking

Tool Misuse

OWASP Agentic 2026

MCP Security

Multi-Agent Hardening

Red Teaming

CI/CD Integration

LangChain · CrewAI · AutoGen

Apache 2.0

Open Source

Prompt Injection

Goal Hijacking

Tool Misuse

OWASP Agentic 2026

MCP Security

Multi-Agent Hardening

Red Teaming

CI/CD Integration

LangChain · CrewAI · AutoGen

Apache 2.0

Open Source

Prompt Injection

Goal Hijacking

Tool Misuse

OWASP Agentic 2026

MCP Security

Multi-Agent Hardening

Red Teaming

CI/CD Integration

LangChain · CrewAI · AutoGen

Apache 2.0

Open Source

Prompt Injection

Goal Hijacking

Tool Misuse

OWASP Agentic 2026

MCP Security

Multi-Agent Hardening

Red Teaming

CI/CD Integration

LangChain · CrewAI · AutoGen

Apache 2.0

Open Source

The Problem

AI Agents
Ship
Untested

83% of organizations are deploying AI agents in 2026. Only 29% feel ready to do it securely. The gap between deployment speed and security maturity is where breaches happen.

The Attack Surface

1000+
Known
Vectors

Every AI agent exposes a new attack surface. OWASP published the Agentic Top 10 in December 2025 — the first formal taxonomy of autonomous AI risks. Crucible covers all of them.

The Solution

Test
Before
Deploy

Crucible runs 1,000+ adversarial attack scenarios against your agent in under 60 seconds. Automated, continuous, and integrated into your CI/CD pipeline.

What Crucible Tests

Eight Modules.
Zero Blind Spots.

Every attack surface an AI agent exposes — covered, tested, and scored before you ship.

01 / 08

Reliability Testing

Runs your agent 10× on identical inputs. Scores variance and detects non-deterministic failure modes.

Live v0.1

02 / 08

OWASP Top 10

Full coverage of all 10 OWASP Agentic AI risks. Goal hijacking, tool misuse, identity abuse, memory poisoning.

Live v0.1

03 / 08

Hallucination Detection

Benchmarks responses against ground truth. Scores factual accuracy and identifies hallucination patterns.

Live v0.1

04 / 08

Goal Completion

Did the agent actually finish the task? Evaluates outcomes, not just output format or response length.

v0.2

05 / 08

Tool Use Accuracy

Verifies agents call correct tools with valid parameters. Detects tool abuse and unauthorized API access.

v0.2

06 / 08

Multi-Agent Handoffs

Tests context integrity across agent-to-agent comms. Detects context poisoning in multi-step pipelines.

v0.3

07 / 08

Cost Efficiency

Token usage, latency, cost-per-task scoring. Production readiness report with optimization guidance.

v0.3

08 / 08

CI/CD Integration

GitHub Actions plugin. Fails your build automatically if agent security score drops below threshold.

v0.4

Live Simulation

The Crucible
In Action.

Witness a real-time adversarial scan. 1,247 vectors deployed. Zero blind spots remaining.

CRUCIBLE_ENGINE_V1.0.4 - ACTIVE_SCAN

● RECORDING

[ 16:34:53 ] CRUCIBLE CORE INITIALIZED

[ 16:34:54 ] CONNECTING TO TARGET: https://agent-v4.nexus-ai.io

[ 16:34:55 ] READY FOR DEPLOYMENT...

--------------------------------------------------

SCANNING...

PROGRESS: 0%

Three Steps

Install.
Configure.
Run.

Step 01

Install in One Line

One pip install, zero dependencies hell. Works on Python 3.10+. No Docker, no cloud signup, no API keys to start.

$ pip install crucible-security

✓ Installed crucible-security 0.1.0

Step 02

Point at Your Agent

Define your agent endpoint and attack scope. Choose modules, set thresholds, configure reporting — or use smart defaults.

# crucible.yaml

target: https://my-agent.io

modules: [security, reliability]

threshold: 80

Step 03

Get Your Score

Crucible hammers your agent with 1,000+ adversarial scenarios. You get a detailed report with severity, reproduction steps, and fix suggestions.

$ crucible scan

⠸ Running 1,247 attack vectors...

COMPLETE — Score: 74/100

3 critical · 7 medium · 12 low

"Every AI agent is a
new attack surface.
We exist to find the cracks
before production does."

— The Crucible Team · Open Source · Built in 2026

Apache 2.0 License

Fully
Open Source.
Forever.

Crucible's core is and always will be free. No paywalls, no telemetry, no lock-in. Built by the community, for the community building AI.

Star on GitHub Read Docs →

crucible — terminal

$ pip install crucible-security

✓ Installed crucible-security 0.1.0

$ crucible scan --target https://my-agent.com

⠸ Loading attack modules...

✓ Security: 1,247 vectors loaded

✓ Reliability module ready

✓ Hallucination module ready

⠸ Running adversarial scan...

SCAN COMPLETE — Score: 74/100

3 critical · 7 medium · 12 low

→ ./crucible-report.json saved

The Team

Built by Builders.
Open to Builders.

We recruit through contribution. Your code speaks louder than your resume. Contribute — and we'll reach out.

Founder & Core

Khalid Saifullah Sayyed

AI & backend engineer. LLM security specialist. IIT Patna hackathon winner. Contributor to YC W20 10k★ repos. Building the future of AI agent security.

LLM SecurityMulti-AgentFastAPIRed TeamingPython

Open Position

AI / LLM Engineer

Deep understanding of language model internals. Can build adversarial prompt systems that find vulnerabilities others miss. Transformer architecture knowledge essential.

TransformersAdversarial MLLangChainPrompt Eng

Open Position

Security Engineer

Traditional security background meets AI. Deep OWASP knowledge, penetration testing experience, now applying it to agentic systems.

PentestingOWASPCVE ResearchThreat Modeling

Open Position

Backend / Infra

Builds systems running thousands of concurrent attack simulations. Async Python, job queues, cloud-native. Performance matters.

FastAPIRedisDockerCeleryGo

Open Position

Frontend / DX

Builds the dashboard, CLI UX, and docs. Believes developer experience is a product decision. Obsessed with making complex things feel simple.

ReactTypeScriptCLI DesignDocs

Could Be You

Contribute to the repo. If your work stands out, we find you.

Contribute Now

BREAK. BEFORE THEY DO

AI AgentsShipUntested

1000+KnownVectors

TestBeforeDeploy

Eight Modules.Zero Blind Spots.

The CrucibleIn Action.

Install.Configure.Run.

FullyOpen Source.Forever.

Built by Builders.Open to Builders.

YOURAGENTIS NOTSAFE. YET.

BREAK.
BEFORE
THEY DO

AI Agents
Ship
Untested

1000+
Known
Vectors

Test
Before
Deploy

Eight Modules.
Zero Blind Spots.

The Crucible
In Action.

Install.
Configure.
Run.

Fully
Open Source.
Forever.

Built by Builders.
Open to Builders.

YOUR
AGENT
IS NOT
SAFE. YET.