Vibe IndexVibe Index
PicksNewSkillsMCP ServersMarketplacesPlugins
⚑
Vibe Index Skill
Project recommendations
πŸ”Œ
Vibe Index MCP
Search from AI tools
πŸ”‘
Vibe Index API
Developer interface
Ranking
Vibe IndexVibe Index

Everything you need for vibe coding. Real-time updates on skills, plugins, MCP servers, and marketplaces.

Resources

  • Skills
  • MCP Servers
  • Marketplaces
  • Plugins

Support

  • About Us
  • Contact Us
  • Feedback
  • Sync Activity

Legal

  • Privacy Policy
  • Terms of Service

Β© 2026 Vibe Index. All rights reserved. Operated by JoLab

πŸ›‘οΈ Security scanning active on all resources

Vibe Index is an independent, community-driven directory. Not affiliated with, endorsed by, or sponsored by Anthropic, Vercel, Microsoft, or any other company whose tools are listed here. All product names and trademarks are the property of their respective owners.

agent-evaluation

22 results for tag "agent-evaluation"

Loading...
🎯

Skills

22
agent-evaluationsupercent-io/skills-template10.1K0

Evaluates AI agent performance, capabilities, and effectiveness through systematic assessment and scoring methodologies.

agent-evaluation
agent-evaluationsickn33/antigravity-awesome-skills3130

A collection of 255+ universal agentic skills for AI coding assistants including Claude Code, Gemini CLI, Codex CLI, Antigravity IDE, GitHub Copilot, and Cursor.

agent-evaluation
agent-evaluationdavila7/claude-code-templates3120

Evaluates AI agent performance by systematically testing and scoring their capabilities across multiple predefined metrics and scenarios.

agent-evaluation
agent-evaluationmlflow/skills630

Agent evaluation skill using MLflow for systematically evaluating and improving LLM agent output quality. Covers tool selection accuracy, answer quality, cost reduction, and end-to-end evaluation with datasets, scorers, and tracing.

agent-evaluation
agent-evaluationeyadsibai/ltk350

Agent evaluation skill from ltk, a personal development toolkit for Claude Code with 35 skills, 16 commands, 7 agents, 4 hooks, and 3 MCP servers. Provides extensible, per-project tooling with auto-loading domain knowledge.

agent-evaluation
agent-evaluationneolabhq/context-engineering-kit200

Evaluates AI agent performance with structured assessment frameworks, benchmarks, and improvement tracking for context engineering workflows.

agent-evaluation
agent-evaluationoimiragieo/agent-studio90

Evaluates AI agent performance across multiple dimensions, generating comprehensive metrics and insights for benchmarking and improvement strategies.

agent-evaluation
agent-evaluationakillness/skills-templateβ˜… 18agent-evaluationguia-matthieu/clawfu-skillsβ˜… 17agent-evaluationomer-metin/skills-for-antigravityβ˜… 11agent-evaluationb-step62/skillsβ˜… 8agent-evaluationzpankz/mcp-skillsetβ˜… 7agent-evaluationxfstudio/skills
β˜… 6
agent-evaluationjarmen423/skillsβ˜… 4
agent-evaluationsebas-aikon-intelligence/antigravity-awesome-skillsβ˜… 4
agent-evaluationglennguilloux/context-engineering-kitβ˜… 3
agent-evaluationautomindtechnologie-jpg/ultimate-skill.mdβ˜… 3
agent-evaluationhainamchung/agent-assistantβ˜… 2
agent-evaluationtavi-agency/antigravity-awesome-skillsβ˜… 2
agent-evaluationdokhacgiakhoa/antigravity-ideβ˜… 2
agent-evaluationschoi80/antigravity-awesome-skillsβ˜… 2
agent-evaluationclaude-code-community-ireland/claude-code-resourcesβ˜… 1