Vibe IndexVibe Index
SkillsMCP ServersMarketplacesPluginsPicks
πŸ’»
Vix Code Beta
Integrated AI Coding Tool Β· Beta
⚑
Vibe Index Skill
Project recommendations
πŸ”Œ
Vibe Index MCP
Search from AI tools
πŸ”‘
Vibe Index API
Developer interface
Ranking
Vibe IndexVibe Index

Everything you need for vibe coding. Real-time updates on skills, plugins, MCP servers, and marketplaces.

Resources

  • Skills
  • MCP Servers
  • Marketplaces
  • Plugins

Support

  • About Us
  • Intro Seminar
  • Contact Us
  • Sync Activity

Legal

  • Privacy Policy
  • Terms of Service

Β© 2026 Vibe Index. All rights reserved. Operated by JoLab

πŸ›‘οΈ Security scanning active on all resources

Vibe Index is an independent, community-driven directory. Not affiliated with, endorsed by, or sponsored by Anthropic, Vercel, Microsoft, or any other company whose tools are listed here. All product names and trademarks are the property of their respective owners.

agent-evaluation

23 results for tag "agent-evaluation"

Loading...
🎯

Skills

23
agent-evaluationsupercent-io/skills-template10.1K0
agent-evaluation
agent-evaluationsickn33/antigravity-awesome-skills5560

A collection of 255+ universal agentic skills for AI coding assistants including Claude Code, Gemini CLI, Codex CLI, Antigravity IDE, GitHub Copilot, and Cursor.

agent-evaluation
agent-evaluationdavila7/claude-code-templates4510

Evaluates AI agent performance by systematically testing and scoring their capabilities across multiple predefined metrics and scenarios.

agent-evaluation
agent-evaluationmlflow/skills2220

Agent evaluation skill using MLflow for systematically evaluating and improving LLM agent output quality. Covers tool selection accuracy, answer quality, cost reduction, and end-to-end evaluation with datasets, scorers, and tracing.

agent-evaluation
agent-evaluationguia-matthieu/clawfu-skills800

An agent evaluation skill from the ClawFu collection of 175 expert marketing methodologies, providing structured frameworks for assessing AI agent quality, performance, and outputs using named expert methodologies.

agent-evaluation
agent-evaluationneolabhq/context-engineering-kit690

Evaluates AI agent performance with structured assessment frameworks, benchmarks, and improvement tracking for context engineering workflows.

agent-evaluation
agent-evaluationeyadsibai/ltk470

Agent evaluation skill from ltk, a personal development toolkit for Claude Code with 35 skills, 16 commands, 7 agents, 4 hooks, and 3 MCP servers. Provides extensible, per-project tooling with auto-loading domain knowledge.

agent-evaluation
agent-evaluationoimiragieo/agent-studio280

Evaluates AI agent performance across multiple dimensions, generating comprehensive metrics and insights for benchmarking and improvement strategies.

agent-evaluation
agent-evaluationakillness/skills-templateβ˜… 18agent-evaluationomer-metin/skills-for-antigravityβ˜… 15agent-evaluationakillness/oh-my-godsβ˜… 12agent-evaluationb-step62/skillsβ˜… 9agent-evaluationzpankz/mcp-skillsetβ˜… 7agent-evaluationhainamchung/agent-assistant
β˜… 6
agent-evaluationnotque/claude-code-toolkitβ˜… 6
agent-evaluationxfstudio/skillsβ˜… 6
agent-evaluationjarmen423/skillsβ˜… 5
agent-evaluationsebas-aikon-intelligence/antigravity-awesome-skillsβ˜… 4
agent-evaluationglennguilloux/context-engineering-kitβ˜… 3
agent-evaluationschoi80/antigravity-awesome-skillsβ˜… 2
agent-evaluationtavi-agency/antigravity-awesome-skillsβ˜… 2
agent-evaluationdokhacgiakhoa/antigravity-ideβ˜… 2
agent-evaluationclaude-code-community-ireland/claude-code-resourcesβ˜… 1