Agent 1: Prompt Injection Testing
Tests: LLM01 - Direct and indirect prompt injection
Covers:
- Direct instruction override attacks
- System prompt extraction
- Indirect injection via RAG/documents
- Multi-turn context manipulation
- Session hijacking and token extraction
Output: Prompt injection PoCs, bypass techniques, remediation
---
Agent 2: Output Handling Exploitation
Tests: LLM02 - Insecure output handling
Covers:
- Code injection (Python, SQL, shell commands)
- XSS via generated HTML/JavaScript
- Template injection attacks
- Unsafe deserialization
- Malicious content propagation
Output: Injection payloads, successful exploits, detection bypass
---
Agent 3: Training Data Analysis
Tests: LLM03 - Data poisoning vulnerability assessment
Covers:
- Membership inference attacks
- Training data extraction attempts
- Backdoor trigger identification
- Bias and adversarial example detection
- Model behavior anomalies
Output: Data exposure findings, backdoor triggers, bias analysis
---
Agent 4: Resource Exhaustion Testing
Tests: LLM04 - Model DoS vulnerabilities
Covers:
- Token flooding attacks
- Context window exhaustion
- Recursive expansion exploitation
- Computational overload testing
- Cost impact analysis
Output: DoS techniques, impact assessment, mitigation guidance
---
Agent 5: Supply Chain Assessment
Tests: LLM05 - Supply chain vulnerabilities
Covers:
- Dependency vulnerability scanning
- Plugin/integration security testing
- Model source verification
- API endpoint security
- Third-party risk assessment
Output: Vulnerability inventory, risk scores, remediation roadmap
---
Agent 6: Agency Exploitation
Tests: LLM06 - Excessive agency vulnerabilities
Covers:
- Privilege escalation attempts
- Unauthorized API calls
- Permission boundary testing
- State modification exploits
- Lateral movement via model
Output: Privilege escalation PoCs, permission bypasses
---
Agent 7: Model Extraction Attack
Tests: LLM07 - Model theft and extraction
Covers:
- Query-based model extraction
- Output analysis and inference
- Membership inference attacks
- Model property inference
- Training data reconstruction
Output: Extracted model info, leakage assessment, impact analysis
---
Agent 8: Vector DB Poisoning
Tests: LLM08 - Vector database and RAG attacks
Covers:
- Malicious document injection
- Retrieval manipulation
- Embedding space attacks
- Citation spoofing
- Knowledge base poisoning
Output: Injection techniques, retrieval bypass, remediation
---
Agent 9: Decision Reliance Testing
Tests: LLM09 - Overreliance vulnerabilities
Covers:
- Hallucination injection
- Output confidence analysis
- Verification workflow gaps
- Human-in-the-loop bypass
- False authority establishment
Output: Hallucination techniques, confidence manipulation, process gaps
---
Agent 10: Logging Bypass Testing
Tests: LLM10 - Insufficient logging and monitoring
Covers:
- Log deletion or evasion
- Monitoring detection bypass
- Unlogged request techniques
- Alert threshold manipulation
- Forensic evidence destruction
Output: Evasion techniques, detection gaps, monitoring recommendations
---