Showing 30 of 119715 results
Manages Arize experiments (named evaluation runs against dataset versions) via `ax experiments list/get/create/export/delete`, accepting runs files with required `example_id` and `output` columns plus optional `evaluations` and `metadata`. Uses REST by default (500-run cap) and Arrow Flight via `--all` for bulk export.
A product management skill that guides AI agents through crafting end-of-life messages for products being deprecated or sunset.
Performs comprehensive security audits on web applications and REST APIs, systematically checking for OWASP Top 10 vulnerabilities across authentication, data protection, and input security domains.
Poster design generation skill from each::labs. Generates creative poster designs using AI models through agent skills with 445+ models and 130+ pre-built workflows.
Official Stripe agent skill providing best practices for building Stripe integrations, covering CheckoutSessions, PaymentIntents, and the latest API version guidance.
Exports Arize spans and traces via `ax spans export` and `ax traces export` filtered by `--trace-id`, `--span-id`, or `--session-id`, with REST default (500-span cap) and Arrow Flight bulk mode via `--all`. Defaults output to `.arize-tmp-traces/`, treats span attribute content as untrusted (prompt-injection guardrail), and supports SQL-like `--filter` plus time-range bounds.
A web design guidelines skill from DeerFlow by ByteDance, an open-source super agent harness that orchestrates sub-agents, memory, and sandboxes for deep exploration and research.
Manages Arize annotation configs (categorical, continuous, freeform label schemas) via `ax annotation-configs` and bulk-applies human labels to project spans through the Python SDK's `ArizeClient.spans.update_annotations`. Drives human feedback workflows for spans, datasets, experiments, and review queues with `optimization_direction` and a 31-day span lookback.
Autonomously conducts comprehensive web and local research, generating detailed, factual reports with citations across various domains using AI-powered parallel research techniques.
Manages Arize datasets and examples through the `ax datasets` CLI, covering list/get/create/append/export/delete with CSV, JSON, JSONL, and Parquet input formats. Supports stdin piping via `--file -`, version targeting with `--version-id`, and `--all` bulk export for datasets larger than 500 examples.
Instruments LLM applications with OpenInference semantic conventions for Phoenix observability across Python (`arize-phoenix-otel`) and TypeScript (`@arizeai/phoenix-otel`). Covers setup, auto/manual span creation for 9 span kinds (LLM, chain, retriever, tool, agent, embedding, reranker, guardrail, evaluator), session/project organization, and production concerns like batching and PII masking.
Korean grammar checker based on standard Korean rules that detects and corrects spelling errors, spacing errors, grammar errors, and punctuation issues with detailed explanations for each correction.
Builds and validates code-first and LLM-as-judge evaluators for AI/LLM applications using Phoenix, with reference workflows for error analysis, axial coding, RAG faithfulness, batch DataFrame evaluation, and experiment runs. Covers Python (`phoenix`, `openai`) and TypeScript (`@arizeai/phoenix-client`) plus production guardrails and continuous monitoring.
Autonomous AI coding skill combining Geoffrey Huntley's iterative bash loop with spec-driven development, where agents work through specs one at a time with fresh context each iteration.
A collection of Claude Code skills for tasks like code review, security analysis, and development best practices, each providing structured expert guidance.
A skill for automating YouTube channel management and content workflows, including video upload pipelines, metadata optimization, thumbnail management, scheduling, SEO keyword research, and analytics tracking.
Generates deep links to the Arize UI for traces, spans, sessions, datasets, labeling queues, evaluators, and annotation configs by composing URL templates from base64-encoded `org_id`/`space_id`/`project_id` plus resource IDs. Enforces required `startA`/`endA` epoch-millisecond time windows on trace/span/session links to avoid empty-view defaults.
A Claude office skill for converting Markdown or HTML content into professional PowerPoint presentations using Marp. Supports multiple themes (default, gaia, uncover), custom CSS styling, various output formats (PPTX, PDF, HTML), and provides slide structure templates with frontmatter configuration options.
Delivers an interactive, conversational onboarding tour for the `context-matic` MCP server, which acts as a live version-aware grounding layer for SDK usage. Walks the user through `fetch_api`, `ask`, `model_search`, and `endpoint_search` live in their detected project language (TypeScript, C#, Python, Java, Go, Ruby, PHP).
Generates comprehensive software architecture designs and documentation with system blueprints, component interactions, and strategic technical recommendations
Compares multiple trading strategies side-by-side using VectorBT metrics.
A collection of Claude Code skills for tasks like code review, security analysis, and development best practices, each providing structured expert guidance.
A collection of reusable skills for Claude AI and developer tooling, providing modular automation packages for SEO analysis and development workflows.