Vibe IndexVibe Index
PicksNewSkillsMCP ServersMarketplacesPlugins
⚑
Vibe Index Skill
Project recommendations
πŸ”Œ
Vibe Index MCP
Search from AI tools
πŸ”‘
Vibe Index API
Developer interface
Ranking
Vibe IndexVibe Index

Everything you need for vibe coding. Real-time updates on skills, plugins, MCP servers, and marketplaces.

Resources

  • Skills
  • MCP Servers
  • Marketplaces
  • Plugins

Support

  • About Us
  • Contact Us
  • Feedback
  • Sync Activity

Legal

  • Privacy Policy
  • Terms of Service

Β© 2026 Vibe Index. All rights reserved. Operated by JoLab

πŸ›‘οΈ Security scanning active on all resources

Vibe Index is an independent, community-driven directory. Not affiliated with, endorsed by, or sponsored by Anthropic, Vercel, Microsoft, or any other company whose tools are listed here. All product names and trademarks are the property of their respective owners.

llm-evaluation

12 results for tag "llm-evaluation"

Loading...
🎯

Skills

12
llm-evaluationwshobson/agents2.8K0

A production-ready plugin system with 112 AI agents, 146 skills, 16 workflow orchestrators, and 79 development tools organized into 73 focused plugins for Claude Code.

llm-evaluation
llm-evaluationsickn33/antigravity-awesome-skills850

Evaluates LLM applications systematically using automated metrics, human feedback, and comparative techniques to measure performance and quality.

llm-evaluation
llm-evaluationovachiever/droid-tings240

Evaluates LLM performance systematically using automated metrics, human feedback, and benchmarking techniques across various dimensions.

llm-evaluation
llm-evaluationphrazzld/claude-config230

A skill for LLM prompt testing, evaluation, and CI/CD quality gates using Promptfoo. Covers prompt regression testing, security testing (red teaming, jailbreaks), model performance comparison, and building evaluation suites for RAG, factuality, or safety.

llm-evaluation
llm-evaluationrmyndharis/antigravity-skillsβ˜… 16llm-evaluationyonatangross/orchestkitβ˜… 10llm-evaluationhermeticormus/libreuiux-claude-codeβ˜… 6llm-evaluationmicrock/ordinary-claude-skillsβ˜… 5llm-evaluationyonatangross/skillforge-claude-pluginβ˜… 4llm-evaluation
ckorhonen/claude-skills
β˜… 3
llm-evaluationravinani02/opencode-agent-skillsβ˜… 2
llm-evaluationlifangda/claude-pluginsβ˜… 2