querying-mlflow-metrics
π―Skillfrom mlflow/skills
Fetches and analyzes aggregated MLflow metrics like token usage, performance, and cost across AI agent runs and experiments.
Part of
mlflow/skills(6 items)
Installation
npx skills add mlflow/skillsgit clone https://github.com/mlflow/mlflow-skills.gitgit clone https://github.com/mlflow/mlflow-skills.git .skills/mlflowSkill Details
More from this repository5
Systematically evaluates and improves LLM agent performance using MLflow, tracking output quality, tool selection, and response accuracy.
Debugs and analyzes MLflow traces by examining spans, identifying issues, and correlating trace details with the underlying codebase to help diagnose performance or execution problems.
Analyzes and reconstructs the details of an MLflow session, helping diagnose issues and understand the flow of interactions during a machine learning experiment or AI agent workflow.
Guides new users through setting up MLflow tracing and evaluation in their AI project, providing step-by-step configuration and best practices for initial MLflow integration.
Automatically adds MLflow tracing instrumentation to Python and TypeScript code for AI applications, supporting popular LLM frameworks and libraries.