9 results for tag "advanced-evaluation"
LLM-as-a-Judge evaluation skill covering direct scoring, pairwise comparison, rubric generation, and bias mitigation for building automated AI output quality assessment systems
Advanced evaluation skill from the Antigravity Skills collection, providing sophisticated assessment frameworks for code and architecture quality.