The Exploitability Validator agent orchestrates a multi-stage pipeline that validates vulnerability findings before exploit development, preventing wasted effort on false positives and theoretical vulnerabilities.Documentation Index
Fetch the complete documentation index at: https://mintlify.com/gadievron/raptor/llms.txt
Use this file to discover all available pages before exploring further.
Purpose
Validate that findings:- Actually exist (not hallucinated)
- Are reachable (not dead code)
- Have working exploitation paths (not just theoretical)
Invocation
target_path: Directory or file to analyze--vuln-type: Optional focus (e.g.,command_injection,sql_injection,xss)--findings: Optional pre-existing findings to validate (skips Stage 0/A)
Pipeline Stages
Shared Context (MUST-GATEs)
Before executing ANY stage, load:.claude/skills/exploitability-validation/SKILL.md
This contains:
- [CONFIG]: Configuration settings
- [EXEC]: Execution rules
- [GATES]: MUST-GATEs 1-6 that apply to ALL stages
- [REMIND]: Critical reminders
MUST-GATEs Overview
GATE-1: Assume Exploitable
GATE-1: Assume Exploitable
Treat all findings as exploitable until proven otherwise. Burden of proof is on disproving, not proving.
GATE-2: No Hedging
GATE-2: No Hedging
Verify all uncertain claims. No “likely”, “probably”, “appears to” without verification.
GATE-3: Document Everything
GATE-3: Document Everything
Update working documents after every action. Maintain audit trail.
GATE-4: Verify Claims
GATE-4: Verify Claims
Every claim needs evidence. No assumptions without verification.
GATE-5: No Sampling
GATE-5: No Sampling
Check ALL code per checklist.json. No random sampling or incomplete coverage.
GATE-6: Proof Required
GATE-6: Proof Required
Working PoC or concrete disproof required. No theoretical assessments.
Stage Details
Stage 0: Inventory
Load:.claude/skills/exploitability-validation/stage-0-inventory.md
Execution:
- Enumerate all files in target_path
- Exclude test/mock files
- Extract functions per file
- Write
checklist.json
checklist.json with complete function inventory
Stage A: One-Shot
Load:.claude/skills/exploitability-validation/stage-a-oneshot.md
Execution:
- Assess each function for vuln_type
- Attempt PoC for candidates
- Write
findings.json
- All PoCs succeed → Skip to Stage C
- Some “not_disproven” → Continue to Stage B
- All disproven → Report “no exploitable findings” and exit
Stage B: Systematic Process
Load:.claude/skills/exploitability-validation/stage-b-process.md
Execution:
- Build attack trees for “not_disproven” findings
- Form and test hypotheses
- Track PROXIMITY metrics
- Attempt multiple attack paths
- Update working documents
findings.json(updated)attack-tree.jsonhypotheses.jsondisproven.jsonattack-paths.jsonattack-surface.json
Stage C: Sanity Check
Load:.claude/skills/exploitability-validation/stage-c-sanity.md
Execution:
- Verify files exist
- Verify code matches verbatim
- Verify flow is real
- Verify code is reachable
findings.json with sanity_check results
Stage D: Ruling
Load:.claude/skills/exploitability-validation/stage-d-ruling.md
Execution:
- Check for test/mock/example code
- Check for unrealistic preconditions
- Check for hedging language
findings.json with CONFIRMED findings only
Stage E: Feasibility (Memory Corruption Only)
Load:.claude/skills/exploitability-validation/stage-e-feasibility.md
Applies to:
- buffer_overflow
- heap_overflow
- format_string
- use_after_free
- double_free
- integer_overflow
- out_of_bounds_read/write
- command_injection
- sql_injection
- xss
- path_traversal
- ssrf
- deserialization
Working Directory Structure
Final Report Format
Example Executions
- Web Vulnerability
- Memory Corruption
Error Handling
- File not found: Stop, report which file, ask user for correct path
- Stage fails: Report which stage, what failed, offer to retry or skip
- No findings: Report “no exploitable vulnerabilities found” (valid outcome)
- Sanity check failures: Report as potential hallucinations, continue with valid findings
Integration with /agentic
The
/agentic command now automatically runs exploitability validation (Phase 2) between scanning and analysis.Use --skip-validation to bypass.Related Agents
OffSec Specialist
Offensive security operations and vulnerability discovery
Crash Analysis
Analyze crashes from fuzzing campaigns
Related Personas
Exploit Developer
Generate working exploit proof-of-concepts
Binary Exploitation Specialist
Binary exploit generation methodology