🎯

agent-evaluation

🎯Skill

from glennguilloux/context-engineering-kit

What it does

Evaluates AI agent performance through comprehensive metrics, benchmarking, and systematic analysis of response quality, accuracy, and contextual understanding.

📦

Same repository

glennguilloux/context-engineering-kit(55 items)

agent-evaluation

Installation

Quick InstallInstall with npx

npx skills add https://github.com/glennguilloux/context-engineering-kit --skill agent-evaluation

Need more details? View full documentation on GitHub →

1Installs

AddedFeb 13, 2026

View on GitHub Back to Skills

More from this repository10

🎯

test-driven-development🎯Skill

Guides developers through writing automated tests before code, ensuring robust, modular software design and preventing regressions.

🎯

create-ideas🎯Skill

Rapidly brainstorm and generate innovative concepts across domains by leveraging structured ideation techniques and creative prompting strategies.

🎯

software-architecture🎯Skill

Designs scalable system architectures, evaluates design patterns, and provides architectural recommendations for complex software projects.

🎯

why🎯Skill

Provides deep contextual reasoning and explanation generation for complex problems, breaking down root causes and logical connections.

🎯

write-tests🎯Skill

Generates comprehensive unit, integration, and end-to-end test suites with best practices, covering edge cases and ensuring robust code quality across different frameworks.

🎯

thought-based-reasoning🎯Skill

Enables systematic cognitive decomposition of complex problems through structured reasoning frameworks, logical inference chaining, and meta-cognitive analysis techniques.

🎯

reflect🎯Skill

Generates comprehensive self-analysis and introspective insights about Claude's current context, reasoning, and potential biases during conversation.

🎯

plan🎯Skill

Generates comprehensive project roadmaps with detailed milestones, resource allocation, risk assessment, and actionable implementation strategies.

🎯

judge🎯Skill

Evaluates code quality, complexity, and potential issues by analyzing syntax, design patterns, and best practices across multiple programming languages.

🎯

attach-review-to-pr🎯Skill

Automatically links code reviews to pull requests, streamlining collaboration and tracking feedback across GitHub repositories.