🔌

hugging-face-datasets

🔌Plugin

huggingface/skills

VibeIndex|
What it does
|

Official Hugging Face skills defining AI/ML tasks like dataset creation, model training, and evaluation. Interoperable with Claude Code, OpenAI Codex, Gemini CLI, and Cursor using the standardized Agent Skill format.

Overview

Hugging-face-datasets is a skill from the Hugging Face Skills collection that provides AI coding agents with structured guidance for creating and managing datasets on the Hugging Face Hub. It follows the standardized Agent Skill format and is compatible with Claude Code, OpenAI Codex, Google Gemini CLI, and Cursor.

Key Features

  • Dataset Creation Guidance - Structured instructions for creating, formatting, and publishing datasets to the Hugging Face Hub
  • Cross-Agent Compatibility - Works with Claude Code (via plugin marketplace), OpenAI Codex (via AGENTS.md), and Gemini CLI (via gemini-extension.json)
  • Plugin-Based Installation - Install via /plugin marketplace add huggingface/skills then /plugin install hugging-face-datasets@huggingface/skills
  • Standardized Format - Follows the Agent Skill open format with SKILL.md files containing YAML frontmatter and agent guidance

Who is this for?

This skill is ideal for AI/ML practitioners and data engineers who need to create, curate, and publish datasets on the Hugging Face Hub using AI coding agents. It is particularly useful for teams automating their data pipeline workflows or standardizing dataset creation processes within their AI development tools.

🏪

Part of

huggingface-skills

Installation

Add marketplace in Claude Code:
/plugin marketplace add huggingface/skills
Step 2. Install plugin:
/plugin install hugging-face-datasets@huggingface-skills
1,195
-
Last UpdatedFeb 6, 2026

More from this repository10

🏪
huggingface-skills🏪Marketplace

Agent Skills for AI/ML tasks including dataset creation, model training, evaluation, and research paper publishing on Hugging Face Hub

🔌
hugging-face-jobs🔌Plugin

Official Hugging Face skills defining AI/ML tasks like dataset creation, model training, and evaluation. Interoperable with Claude Code, OpenAI Codex, Gemini CLI, and Cursor using the standardized Agent Skill format.

🔌
hugging-face-evaluation🔌Plugin

Official Hugging Face skills defining AI/ML tasks like dataset creation, model training, and evaluation. Interoperable with Claude Code, OpenAI Codex, Gemini CLI, and Cursor using the standardized Agent Skill format.

🎯
hf-cli🎯Skill

A skill for the Hugging Face Hub CLI (hf), enabling downloading, uploading, and managing models, datasets, spaces, buckets, repos, papers, and jobs on the Hugging Face Hub. Replaces the deprecated huggingface-cli.

🔌
hugging-face-tool-builder🔌Plugin

Official Hugging Face skills defining AI/ML tasks like dataset creation, model training, and evaluation. Interoperable with Claude Code, OpenAI Codex, Gemini CLI, and Cursor using the standardized Agent Skill format.

🎯
huggingface-gradio🎯Skill

Official Hugging Face Gradio skill for building Python web UIs and ML demos — covers the `Interface` high-level wrapper, `Blocks` low-level layout with explicit event wiring, `ChatInterface` for chatbots, core component signatures (Textbox, etc.), streaming inputs/outputs, custom CSS/JS, and sharing your app.

🎯
huggingface-datasets🎯Skill

Explores, queries, and extracts data from any Hugging Face dataset via the Dataset Viewer REST API and npx tooling with zero Python dependencies — split/config discovery, row pagination, text search, filtering, SQL via parquetlens, and CLI upload.

🎯
huggingface-llm-trainer🎯Skill

Train or fine-tune language models using TRL on Hugging Face Jobs infrastructure. Covers SFT, DPO, GRPO and reward modeling training methods, plus GGUF conversion for local deployment.

🎯
huggingface-papers🎯Skill

Hugging Face agent skills for AI/ML tasks like dataset creation, model training, and evaluation, interoperable with major coding agent tools.

🎯
transformers-js🎯Skill

Part of the official Hugging Face Skills collection, this skill teaches AI coding agents how to work with Transformers.js for running machine learning models directly in the browser and Node.js environments.