evaluation

30 results for tag "evaluation"

🔌

Plugins

promptfoo-evaluationdaymade/claude-code-skills4880

A professional Claude Code skills marketplace featuring 37 production-ready skills for enhanced development workflows.

quant-researchterrylica/cc-skills★ 14 llm-evalsvanman2024/ai-dev-marketplace★ 3 eval-1337yzavyas/claude-1337★ 2 agent-toolslhohan/agent-chisels★ 1 evaluation-toolsAgentient/vibekit skill-evaluatorlhohan/claude-code-plugins

🏪

Marketplaces

fr33d3m0n-threat-modelingfr33d3m0n/threat-modeling2301

AI-native, LLM-driven threat-modeling skill (fr33d3m0n/threat-modeling v3.1.0) for automated software risk analysis, security audit, and penetration testing — covers the full OWASP MCP Top 10 (2025), the SAO (Subject-Action-Object) agent threat model, 13 pre-built MITRE ATT&CK agent attack chains, and the `SKILL.MD = UNTRUSTED` trust-inversion paradigm for agent security.

🎯

Skills

evaluationsickn33/antigravity-awesome-skills1710

Systematically evaluates agent system performance through multi-dimensional rubrics, tracking improvements, and validating context engineering choices.

🔌

Plugins

promptfoo-evaluationdaymade/claude-code-skills4880

A professional Claude Code skills marketplace featuring 37 production-ready skills for enhanced development workflows.

developer-tools promptfoo evaluation+6

🏪

Marketplaces

fr33d3m0n-threat-modelingfr33d3m0n/threat-modeling2301

🎯

Skills

evaluationsickn33/antigravity-awesome-skills1710

Systematically evaluates agent system performance through multi-dimensional rubrics, tracking improvements, and validating context engineering choices.

fr33d3m0n-skill-threat-modelingfr33d3m0n/skill-threat-modeling1301

Code-First Deep Threat Modeling - LLM-native security analysis framework with automated 8-phase workflow, dual-track knowledge architecture (Security Controls + Threat Patterns), and comprehensive verification capabilities. Transform any codebase into structured threat models without design documents.

community 8phase alignment+8

ancoleman-ai-design-componentsancoleman/ai-design-components10519

Comprehensive full-stack development skills for AI-assisted development covering UI/UX, backend, DevOps, infrastructure, security, and AI/ML.

community across analyzing+112

alonw0-llm-docs-optimizeralonw0/llm-docs-optimizer521

A Claude Code plugin that optimizes documentation for AI coding assistants like Claude, GitHub Copilot, and other LLMs. Makes your docs more effective through c7score optimization, llms.txt generation, question-driven restructuring, and automated quality scoring.

community assistants automated+8

rafaelcalleja-claude-market-placerafaelcalleja/claude-market-place221

Comprehensive Claude Code plugin marketplace featuring productivity commands (/quick-test, /analyze-deps, /project-stats), specialized code analysis agents (security-auditor, performance-optimizer, architecture-reviewer), and automatic code formatting hooks. Perfect for learning plugin development or extending your Claude Code workflow with production-ready examples.

productivity development framework+251