🎯

agent-evaluation

🎯Skill

from glennguilloux/context-engineering-kit

VibeIndex|
What it does

Evaluates AI agent performance through comprehensive metrics, benchmarking, and systematic analysis of response quality, accuracy, and contextual understanding.

πŸ“¦

Same repository

glennguilloux/context-engineering-kit(55 items)

agent-evaluation

Installation

Quick InstallInstall with npx
npx skills add https://github.com/glennguilloux/context-engineering-kit --skill agent-evaluation

Need more details? View full documentation on GitHub β†’

1Installs
-
AddedFeb 13, 2026

More from this repository10

🎯
test-driven-development🎯Skill

Guides developers through writing automated tests before code, ensuring robust, modular software design and preventing regressions.

🎯
create-ideas🎯Skill

Rapidly brainstorm and generate innovative concepts across domains by leveraging structured ideation techniques and creative prompting strategies.

🎯
software-architecture🎯Skill

Designs scalable system architectures, evaluates design patterns, and provides architectural recommendations for complex software projects.

🎯
why🎯Skill

Provides deep contextual reasoning and explanation generation for complex problems, breaking down root causes and logical connections.

🎯
write-tests🎯Skill

Generates comprehensive unit, integration, and end-to-end test suites with best practices, covering edge cases and ensuring robust code quality across different frameworks.

🎯
thought-based-reasoning🎯Skill

Enables systematic cognitive decomposition of complex problems through structured reasoning frameworks, logical inference chaining, and meta-cognitive analysis techniques.

🎯
reflect🎯Skill

Generates comprehensive self-analysis and introspective insights about Claude's current context, reasoning, and potential biases during conversation.

🎯
plan🎯Skill

Generates comprehensive project roadmaps with detailed milestones, resource allocation, risk assessment, and actionable implementation strategies.

🎯
judge🎯Skill

Evaluates code quality, complexity, and potential issues by analyzing syntax, design patterns, and best practices across multiple programming languages.

🎯
attach-review-to-pr🎯Skill

Automatically links code reviews to pull requests, streamlining collaboration and tracking feedback across GitHub repositories.