eval-harness
π―Skillfrom affaan-m/everything-claude-code
Evaluates and grades Claude AI performance across different tasks using configurable verification loops and pass@k metrics.
Installation
npx skills add https://github.com/affaan-m/everything-claude-code --skill eval-harnessSkill Details
Overview
# Everything Claude Code
[](https://github.com/affaan-m/everything-claude-code/stargazers)
[](LICENSE)



The complete collection of Claude Code configs from an Anthropic hackathon winner.
Production-ready agents, skills, hooks, commands, rules, and MCP configurations evolved over 10+ months of intensive daily use building real products.
---
The Guides
This repo is the raw code only. The guides explain everything.
Shorthand Guide
Setup, foundations, philosophy. Read this first.Longform Guide
Token optimization, memory persistence, evals, parallelization.
| Topic | What You'll Learn |
|-------|-------------------|
| Token Optimization | Model selection, system prompt slimming, background processes |
| Memory Persistence | Hooks that save/load context across sessions automatically |
| Continuous Learning | Auto-extract patterns from sessions into reusable skills |
| Verification Loops | Checkpoint vs continuous evals, grader types, pass@k metrics |
| Parallelization | Git worktrees, cascade method, when to scale instances |
| Subagent Orchestration | The context problem, iterative retrieval pattern |
---
Cross-Platform Support
This plugin now fully supports Windows, macOS, and Linux. All hooks and scripts have been rewritten in Node.js for maximum compatibility.
Package Manager Detection
The plugin automatically detects your preferred package manager (npm, pnpm, yarn, or bun) with the following priority:
- Environment variable:
CLAUDE_PACKAGE_MANAGER - Project config:
.claude/package-manager.json - package.json:
packageManagerfield - Lock file: Detection from package-lock.json, yarn.lock, pnpm-lock.yaml, or bun.lockb
- Global config:
~/.claude/package-manager.json - Fallback: First available package manager
To set your preferred package manager:
```bash
# Via environment variable
export CLAUDE_PACKAGE_MANAGER=pnpm
# Via global config
node scripts/setup-package-manager.js --global pnpm
# Via project config
node scripts/setup-package-manager.js --project bun
# Detect current setting
node scripts/setup-package-manager.js --detect
```
Or use the /setup-pm command in Claude Code.
---
What's Inside
This repo is a Claude Code plugin - install it directly or copy components manually.
```
everything-claude-code/
|-- .claude-plugin/ # Plugin and marketplace manifests
| |-- plugin.json # Plugin metadata and component paths
| |-- marketplace.json # Marketplace catalog for /plugin marketplace add
|
|-- agents/ # Specialized subagents for delegation
| |-- planner.md # Feature implementation planning
| |-- architect.md # System design decisions
| |-- tdd-guide.md # Test-driven development
| |-- code-reviewer.md # Quality and security review
| |-- security-reviewer.md # Vulnerability analysis
| |-- build-error-resolver.md
| |-- e2e-runner.md # Playwright E2E testing
| |-- refactor-cleaner
More from this repository9
Provides a comprehensive collection of production-ready Claude Code configurations, agents, skills, and optimization techniques for enhancing AI interaction and development workflows across multipl...
Provides a comprehensive collection of production-ready Claude Code configurations, agents, skills, and optimization techniques for enhancing AI interaction and development workflows across multipl...
Provides a comprehensive collection of production-ready Claude Code configurations, agents, skills, and optimization techniques for advanced AI interaction and development.
Provides a comprehensive Claude plugin marketplace with production-ready agents, skills, hooks, and configurations optimized for efficient AI interaction and development.
Automatically extracts and transforms recurring interaction patterns from Claude conversations into reusable, generalized skills for future use.
Optimizes and condenses complex instructions into a more compact, strategically structured format to improve Claude's comprehension and response efficiency.
Generates and manages common PostgreSQL database design patterns, query optimization techniques, and schema best practices for efficient database development.
Automatically extracts and transforms interaction patterns from Claude conversations into reusable, generalized skills for future autonomous learning and adaptation.
Retrieves and refines information progressively through multiple targeted queries, optimizing context and accuracy with each iterative search pass.