๐ŸŽฏ

advanced-evaluation

๐ŸŽฏSkill

from flora131/atomic

VibeIndex|
What it does
|

An Atomic SDK skill for advanced evaluation techniques including pairwise comparison, position-bias mitigation, and building evaluation pipelines.

๐Ÿ“ฆ

Same repository

flora131/atomic(28 items)

advanced-evaluation

Installation

Vibe Index InstallInstalls to .claude/skills/ - auto-recognized by Claude Code
npx vibeindex add flora131/atomic --skill advanced-evaluation
skills.sh Installโš  Installs to .agents/skills/ - may not be auto-recognized by Claude Code
npx skills add flora131/atomic --skill advanced-evaluation
Manual InstallCopy SKILL.md content and save to the path below
~/.claude/skills/advanced-evaluation/SKILL.md

SKILL.md

100Installs
-
AddedApr 13, 2026

More from this repository10

๐Ÿช
flora131-atomic๐ŸชMarketplace

Opinionated workflows, Ralph Loops, and memory for AI coding agents.

๐ŸŽฏ
explain-code๐ŸŽฏSkill

Part of the Atomic agent framework, this skill explains code functionality in detail using DeepWiki to provide comprehensive code understanding and documentation.

๐ŸŽฏ
research-codebase๐ŸŽฏSkill

Part of the Atomic agent framework, this skill enables deep codebase research by dispatching specialized sub-agents โ€” a codebase-locator finds relevant files, a codebase-analyzer reads implementations, and an online-researcher queries external docs.

๐ŸŽฏ
prompt-engineer๐ŸŽฏSkill

Part of the Atomic agent framework, this skill helps create, improve, and optimize prompts using best practices, within Atomic's multi-agent architecture that dispatches specialized sub-agents for focused task execution.

๐ŸŽฏ
context-optimization๐ŸŽฏSkill

An Atomic SDK skill for optimizing LLM context usage through KV-cache optimization, observation masking, and context budgeting techniques.

๐ŸŽฏ
find-skills๐ŸŽฏSkill

A built-in Atomic skill that discovers and installs agent skills from the community. Part of the Atomic multi-agent harness that orchestrates Claude Code, OpenCode, and GitHub Copilot CLI.

๐ŸŽฏ
skill-creator๐ŸŽฏSkill

A meta skill from the Atomic multi-agent harness that enables creating, modifying, evaluating, and benchmarking custom agent skills. Auto-invoked when building or iterating on SKILL.md files.

๐ŸŽฏ
context-fundamentals๐ŸŽฏSkill

An Atomic SDK skill covering how LLM context windows work, including attention mechanics and progressive disclosure techniques for effective context management.

๐ŸŽฏ
overdrive๐ŸŽฏSkill

An Atomic SDK design skill that pushes UI designs to their creative limits, maximizing visual impact and boldness beyond conventional boundaries.

๐ŸŽฏ
filesystem-context๐ŸŽฏSkill

An Atomic SDK skill for offloading context to the filesystem and enabling file-based agent coordination to work within LLM context limits.