🎯

simpo-training

🎯Skill

from zechenzhangagi/ai-research-skills

What it does

Part of the AI Research Skills library by Orchestra Research, providing guidance for SimPO (Simple Preference Optimization) training workflows for aligning language models without a reference model.

📦

Same repository

zechenzhangagi/ai-research-skills(97 items)

simpo-training

Installation

Vibe Index InstallInstalls to .claude/skills/

npx vibeindex add zechenzhangagi/ai-research-skills --skill simpo-training

skills.sh Install⚠ Installs to .agents/skills/

npx skills add zechenzhangagi/ai-research-skills --skill simpo-training

Manual InstallCopy SKILL.md content and save to the path below

~/.claude/skills/simpo-training/SKILL.md

SKILL.md

89Installs

AddedFeb 4, 2026

View on GitHub Back to Skills

More from this repository10

🎯

ml-paper-writing🎯Skill

Comprehensive open-source library of 77 AI research engineering skills across 20 categories including model architecture, fine-tuning, distributed training, RAG, multimodal, and ML paper writing.

🎯

crewai-multi-agent🎯Skill

A skill from the AI Research Engineering Skills library, a comprehensive open-source collection of 82 skills across 20 categories that provide engineering capabilities for AI agents to conduct research experiments, including training, evaluation, deployment, and agent building.

🎯

brainstorming-research-ideas🎯Skill

A Claude Code skill providing structured ideation frameworks with 10 complementary lenses for discovering high-impact AI research directions, part of the Orchestra Research AI research engineering skills library with 85+ skills across 21 categories.

🎯

qdrant-vector-search🎯Skill

An AI research engineering skill for Qdrant, a high-performance Rust-powered vector search engine with hybrid search and filtering capabilities, part of the 83-skill AI Research Skills library covering RAG and retrieval workflows.

🎯

peft-fine-tuning🎯Skill

A comprehensive open-source library of 82 AI research engineering skills across 20 categories (fine-tuning, distributed training, RAG, multimodal, safety, and more) that enable coding agents to conduct AI research experiments.

🎯

huggingface-tokenizers🎯Skill

A skill from the AI Research Skills library, a comprehensive open-source collection of 87 skills across 22 categories that enable AI agents to autonomously conduct AI research from idea generation through experiment execution to paper writing.

🎯

optimizing-attention-flash🎯Skill

AI research engineering skill for optimizing attention mechanisms with Flash Attention techniques, part of a comprehensive 83-skill library covering model architecture, training, and deployment.

🎯

speculative-decoding🎯Skill

A speculative decoding skill from the AI Research Engineering Skills library, part of a collection of 82 research skills covering emerging AI techniques. Provides expert-level guidance on speculative decoding for accelerating LLM inference.

🎯

knowledge-distillation🎯Skill

Part of the AI Research Engineering Skills Library, a comprehensive collection of 83 skills across 20 categories covering the full AI research lifecycle. Provides expert-level guidance for knowledge distillation with real code examples and production-ready workflows for frameworks like Megatron-LM, vLLM, and TRL.

🎯

long-context🎯Skill

AI research skill for long-context model techniques and optimizations, part of an 87-skill library enabling autonomous AI research across 22 categories from idea to paper.