phoenix-evals
🎯Skillfrom github/awesome-copilot
Builds and validates code-first and LLM-as-judge evaluators for AI/LLM applications using Phoenix, with reference workflows for error analysis, axial coding, RAG faithfulness, batch DataFrame evaluation, and experiment runs. Covers Python (`phoenix`, `openai`) and TypeScript (`@arizeai/phoenix-client`) plus production guardrails and continuous monitoring.
Same repository
github/awesome-copilot(363 items)
Installation
npx vibeindex add github/awesome-copilot --skill phoenix-evalsnpx skills add github/awesome-copilot --skill phoenix-evals~/.claude/skills/phoenix-evals/SKILL.mdSKILL.md
More from this repository10
A community-curated collection of custom agents, prompts, instructions, and skills to enhance GitHub Copilot across different domains, languages, and use cases.
Generates Excalidraw diagrams from natural language descriptions, creating visual representations of processes, systems, relationships, and ideas.
A community-curated collection of custom agents, prompts, instructions, and skills to enhance GitHub Copilot across different domains, languages, and use cases.
Expert technical writer guided by the Diataxis framework, creating tutorials, how-to guides, reference docs, and explanation pages for software documentation.
Generates comprehensive Product Requirements Documents (PRDs) by systematically analyzing requirements and crafting detailed technical specifications
A community-curated collection of custom agents, prompts, instructions, and skills to enhance GitHub Copilot across different domains, languages, and use cases.
Provides Spring Boot best practices covering project setup, dependency injection, REST API design, data access with JPA, security configuration, and testing strategies.
Creates optimized multi-stage Dockerfiles for any language or framework, separating builder and runtime stages to produce smaller, more secure container images.
Creates README.md files for projects.
Generates Playwright end-to-end tests from user-described scenarios using Playwright MCP for browser automation.