🎯

advanced-evaluation

🎯Skill

from flora131/atomic

What it does

An Atomic SDK skill for advanced evaluation techniques including pairwise comparison, position-bias mitigation, and building evaluation pipelines.

📦

Same repository

flora131/atomic(28 items)

advanced-evaluation

Installation

Vibe Index InstallInstalls to .claude/skills/

npx vibeindex add flora131/atomic --skill advanced-evaluation

skills.sh Install⚠ Installs to .agents/skills/

npx skills add flora131/atomic --skill advanced-evaluation

Manual InstallCopy SKILL.md content and save to the path below

~/.claude/skills/advanced-evaluation/SKILL.md

SKILL.md

163Installs

AddedApr 13, 2026

View on GitHub Back to Skills

More from this repository10

🎯

research-codebase🎯Skill

Part of the Atomic agent framework, this skill enables deep codebase research by dispatching specialized sub-agents — a codebase-locator finds relevant files, a codebase-analyzer reads implementations, and an online-researcher queries external docs.

🎯

explain-code🎯Skill

Part of the Atomic agent framework, this skill explains code functionality in detail using DeepWiki to provide comprehensive code understanding and documentation.

🎯

prompt-engineer🎯Skill

Part of the Atomic agent framework, this skill helps create, improve, and optimize prompts using best practices, within Atomic's multi-agent architecture that dispatches specialized sub-agents for focused task execution.

🎯

context-compression🎯Skill

An Atomic SDK skill for summarizing transcripts at session boundaries while preserving actionable information to manage long-running agent sessions.

🎯

skill-creator🎯Skill

A meta skill from the Atomic multi-agent harness that enables creating, modifying, evaluating, and benchmarking custom agent skills. Auto-invoked when building or iterating on SKILL.md files.

🎯

context-optimization🎯Skill

An Atomic SDK skill for optimizing LLM context usage through KV-cache optimization, observation masking, and context budgeting techniques.

🎯

find-skills🎯Skill

A built-in Atomic skill that discovers and installs agent skills from the community. Part of the Atomic multi-agent harness that orchestrates Claude Code, OpenCode, and GitHub Copilot CLI.

🎯

tool-design🎯Skill

An Atomic SDK skill for designing clear tool contracts and reducing agent-tool friction in AI coding agent workflows.

🎯

project-development🎯Skill

An Atomic SDK skill for validating task-model fit before building and estimating costs for AI agent development projects.

🎯

context-fundamentals🎯Skill

An Atomic SDK skill covering how LLM context windows work, including attention mechanics and progressive disclosure techniques for effective context management.