RAPTOR uses a multi-agent architecture where specialized agents handle specific security testing tasks. Agents are coordinated through the Claude Code integration and leverage reusable skills.Documentation Index
Fetch the complete documentation index at: https://mintlify.com/gadievron/raptor/llms.txt
Use this file to discover all available pages before exploring further.
Agent Architecture
Agents are defined in.claude/agents/ and are invoked by the main orchestrators (raptor.py, raptor_agentic.py, raptor_codeql.py, raptor_fuzzing.py).
Agent Definition Format
Each agent is defined with YAML frontmatter:The 17 Specialized Agents
Crash Analysis Agents
crash-analysis-agent
Location:.claude/agents/crash-analysis-agent.md:1
Purpose: Main orchestrator for analyzing security bugs from C/C++ projects
Workflow:
- Fetch bug report from tracker URL
- Clone repository to
./repo-<project-name> - Create working directory
./crash-analysis-<timestamp>/ - Understand build system (autotools, CMake, Makefile, meson)
- Rebuild with instrumentation (AddressSanitizer, debug symbols)
- Reproduce the crash
- Generate execution trace (function-level)
- Generate coverage data (gcov)
- Create RR recording for deterministic replay
- Invoke crash-analyzer agent for root-cause analysis
- Validate analysis with crash-analyzer-checker agent
- Write confirmed hypothesis
rr-debugger, function-tracing, gcov-coverage
crash-analyzer-agent
Purpose: Deep root-cause analysis using rr traces Approach:- Analyze rr deterministic replay traces
- Examine function execution traces
- Review coverage data
- Form hypotheses about root cause
- Write hypothesis to
root-cause-hypothesis-YYY.md
crash-analyzer-checker-agent
Purpose: Validates crash analysis rigorously Approach:- Review hypothesis against evidence
- Check for logical inconsistencies
- Verify claims against actual code
- Write rebuttal file if hypothesis rejected
- Iterate until validated (max 3 iterations)
function-trace-generator-agent
Purpose: Creates function-level execution traces Method:- Instruments code with
-finstrument-functions - Captures function entry/exit events
- Generates trace files in
<working-dir>/traces/
function-tracing
coverage-analysis-generator-agent
Purpose: Generates gcov coverage data Method:- Compiles with
--coverageflags - Runs program to generate
.gcdafiles - Produces coverage reports in
<working-dir>/gcov/
gcov-coverage
OSS Forensics Agents
oss-investigator-gh-archive-agent
Location:.claude/agents/oss-investigator-gh-archive-agent.md:1
Purpose: Query GH Archive via BigQuery for tamper-proof forensic evidence
Responsibilities:
- Construct BigQuery queries for GitHub events
- Execute queries for PushEvent, PullRequestEvent, IssuesEvent, etc.
- Create evidence using
GHArchiveCollector - Track which table each event came from
- Store evidence in
evidence.json
- Force push recovery (deleted commits)
- Workflow vs Direct API attribution
- Deleted tags/branches
github-archive, github-evidence-kit
oss-investigator-github-agent
Purpose: Collect evidence from live GitHub API Collects:- Commits, issues, pull requests
- Files, branches, tags, releases
- Forks and repository metadata
github-evidence-kit
oss-investigator-local-git-agent
Purpose: Analyze cloned repositories for forensic evidence Key capability:- Find dangling commits (not reachable from any ref)
- Reveal force-pushed or deleted commits
- Analyze local git history
github-evidence-kit
oss-investigator-wayback-agent
Purpose: Recover deleted content from Wayback Machine Collects:- Archived snapshots of GitHub pages
- Historical content with date filtering
- Snapshot content retrieval
github-wayback-recovery, github-evidence-kit
oss-investigator-ioc-extractor-agent
Purpose: Extract Indicators of Compromise from vendor reports IOC types:- COMMIT_SHA, FILE_PATH, FILE_HASH
- CODE_SNIPPET, EMAIL, USERNAME
- REPOSITORY, TAG_NAME, BRANCH_NAME
- WORKFLOW_NAME, IP_ADDRESS, DOMAIN
- URL, API_KEY, SECRET
github-evidence-kit
oss-hypothesis-former-agent
Purpose: Form evidence-backed hypotheses Approach:- Analyze collected evidence
- Identify patterns and anomalies
- Form testable hypotheses
- Document predictions
oss-evidence-verifier-agent
Purpose: Verify evidence against original sources Method:- Run
store.verify_all()on evidence - Check for tampering or inconsistencies
- Validate against GitHub API, GH Archive, Wayback
- Report verification status
github-evidence-kit
oss-hypothesis-checker-agent
Purpose: Validate claims against verified evidence Approach:- Review hypotheses
- Check against verified evidence
- Accept or reject based on evidence
- Document reasoning
oss-report-generator-agent
Purpose: Produce final forensic report Generates:- Executive summary
- Evidence timeline
- Hypothesis validation results
- Forensic conclusions
- IOCs and recommendations
.out/oss-forensics-<timestamp>/forensic-report.md
Exploitability Validation Agent
exploitability-validator-agent
Location:.claude/agents/exploitability-validator-agent.md:1
Purpose: Multi-stage pipeline to validate vulnerability findings are real, reachable, and exploitable
Workflow:
Phase 0: Initialize working directory
Phase 1 - Stage 0 (Inventory):
- Enumerate all files in target path
- Exclude test/mock files
- Extract functions per file
- Write
checklist.json
- Assess each function for vulnerability type
- Attempt PoC for candidates
- Write
findings.json - Route based on findings
- Build attack trees
- Form and test hypotheses
- Track PROXIMITY
- Attempt multiple attack paths
- Update working documents
- Verify files exist
- Verify code matches verbatim
- Verify flow is real
- Verify code is reachable
- Check for test/mock/example code
- Check for unrealistic preconditions
- Check for hedging language
- Write CONFIRMED findings
- Applies to memory corruption only
- Run
analyze_binary()from exploit_feasibility package - Save context with
save_exploit_context() - Update finding with feasibility verdict
exploitability-validation
Offensive Security Specialist
offsec-specialist
Location:.claude/agents/offsec-specialist.md:1
Purpose: General offensive security expertise
Capabilities:
- Penetration testing methodology
- Exploit development guidance
- Attack surface analysis
- Security research techniques
Skills System
Skills are reusable capabilities defined in.claude/skills/ that agents can leverage.
Crash Analysis Skills
rr-debugger
Location:.claude/skills/crash-analysis/rr-debugger/SKILL.md:1
Purpose: Deterministic debugging with rr record-replay
Core workflow:
reverse-next/rn- Step back over function callsreverse-step/rs- Step back into functionsreverse-continue/rc- Continue backward to previous breakpointreverse-stepi/rsi- Step back one instruction
scripts/crash_trace.py automatically extracts execution trace before crash
function-tracing
Location:.claude/skills/crash-analysis/function-tracing/SKILL.md:1
Purpose: Function instrumentation with -finstrument-functions
Files:
trace_instrument.c- Instrumentation callbackstrace_to_perfetto.cpp- Convert traces to Perfetto format
gcov-coverage
Location:.claude/skills/crash-analysis/gcov-coverage/SKILL.md:1
Purpose: Code coverage collection
Usage:
line-execution-checker
Location:.claude/skills/crash-analysis/line-execution-checker/SKILL.md:1
Purpose: Fast line execution queries
File: line_checker.cpp - Query if specific lines executed
OSS Forensics Skills
github-evidence-kit
Location:.claude/skills/oss-forensics/github-evidence-kit/SKILL.md:1
Purpose: Generate, export, load, and verify forensic evidence from GitHub sources
Collectors:
- Events: PushEvent, PullRequestEvent, IssueEvent, etc.
- Observations: CommitObservation, IssueObservation, FileObservation, etc.
- IOCs: Indicators of Compromise with source verification
github-archive
Location:.claude/skills/oss-forensics/github-archive/SKILL.md:1
Purpose: Query GH Archive via BigQuery
Requires: GOOGLE_APPLICATION_CREDENTIALS for BigQuery
Event types: All 12 GitHub event types (PushEvent, PullRequestEvent, CreateEvent, DeleteEvent, etc.)
github-commit-recovery
Location:.claude/skills/oss-forensics/github-commit-recovery/SKILL.md:1
Purpose: Recover deleted commits from GH Archive
Method:
- Query GH Archive for force push events
- Extract deleted commit SHAs from
payload.before - Reconstruct commit metadata
github-wayback-recovery
Location:.claude/skills/oss-forensics/github-wayback-recovery/SKILL.md:1
Purpose: Recover content from Wayback Machine
Method:
- Query Wayback CDX API for snapshots
- Retrieve archived content
- Extract historical state
Exploitability Validation Skill
exploitability-validation
Location:.claude/skills/exploitability-validation/SKILL.md:1
Purpose: Multi-stage pipeline for validating vulnerability findings
Configuration:
- GATE-1 [ASSUME-EXPLOIT]: Assume exploitable until proven otherwise
- GATE-2 [STRICT-SEQUENCE]: Strictly follow instructions
- GATE-3 [CHECKLIST]: Check pipeline, update checklist, collect evidence
- GATE-4 [NO-HEDGING]: Verify all uncertain claims immediately
- GATE-5 [FULL-COVERAGE]: Test entire codebase against checklist.json
- GATE-6 [PROOF]: Always provide proof and show vulnerable code
| Stage | File | Purpose |
|---|---|---|
| 0 | stage-0-inventory.md | Build ground truth checklist |
| A | stage-a-oneshot.md | Quick exploitability + PoC |
| B | stage-b-process.md | Systematic analysis, attack trees |
| C | stage-c-sanity.md | Validate against actual code |
| D | stage-d-ruling.md | Filter preconditions/hedging |
| E | stage-e-feasibility.md | Binary constraint analysis |
attack-tree.json- Knowledge graph, source of truthhypotheses.json- Active hypotheses with statusdisproven.json- Failed hypotheses and whyattack-paths.json- Paths attempted, PoC results, PROXIMITY, blockersattack-surface.json- Sources, sinks, trust boundaries
Exploit Development Skill
exploit-dev
Location:.claude/skills/exploit-dev/instructions.md:1
Purpose: Exploit development guidance and templates
Coverage:
- Exploit code templates by vulnerability type
- Constraint checking (ASLR, DEP, stack canaries, etc.)
- Technique alternatives when standard approaches blocked
- Environment recommendations (Docker, older glibc)
Agent Orchestration Patterns
Sequential Orchestration
Used byraptor_agentic.py and raptor_codeql.py:
Parallel Agent Invocation
Used byoss-forensics:
Iterative Refinement
Used bycrash-analysis-agent:
Agent Usage Examples
Crash Analysis
OSS Forensics
Exploitability Validation
Benefits of Multi-Agent Architecture
- Specialization: Each agent focuses on one specific task
- Reusability: Skills can be shared across multiple agents
- Parallelization: Independent agents can run in parallel
- Testability: Each agent can be tested in isolation
- Extensibility: New agents can be added without modifying existing ones
- Clarity: Clear separation of concerns and responsibilities