🎯

slime-rl-training

🎯Skill

from orchestra-research/ai-research-skills

VibeIndex|
What it does

Trains reinforcement learning agents using slime-based simulation environments with advanced exploration and policy optimization techniques

📦

Part of

orchestra-research/ai-research-skills(84 items)

slime-rl-training

Installation

npxRun with npx
npx @orchestra-research/ai-research-skills
npxRun with npx
npx @orchestra-research/ai-research-skills list # View installed skills
npxRun with npx
npx @orchestra-research/ai-research-skills update # Update installed skills
Add MarketplaceAdd marketplace to Claude Code
/plugin marketplace add orchestra-research/AI-research-SKILLs
Install PluginInstall plugin from marketplace
/plugin install fine-tuning@ai-research-skills # Axolotl, LLaMA-Factory, PEFT, Unsloth

+ 4 more commands

📖 Extracted from docs: orchestra-research/ai-research-skills
1Installs
-
AddedFeb 7, 2026

More from this repository10

🏪
orchestra-research-ai-research-skills🏪Marketplace

Streamlines AI research workflows by providing curated Claude skills for data analysis, literature review, experiment design, and research paper generation.

🎯
ml-paper-writing🎯Skill

Assists AI researchers in drafting, structuring, and generating machine learning research papers with academic writing best practices and technical precision.

🎯
ray-data🎯Skill

Streamlines distributed data processing and machine learning workflows using Ray's scalable data loading and transformation capabilities.

🎯
ray-train🎯Skill

Streamlines distributed machine learning training using Ray, optimizing hyperparameter tuning and parallel model execution across compute clusters.

🎯
speculative-decoding🎯Skill

Accelerates AI model inference by predicting and parallel processing multiple token candidates to reduce latency and improve generation speed.

🎯
outlines🎯Skill

Generates structured document outlines and hierarchical content maps with customizable depth and formatting for research and writing workflows

🎯
nemo-curator🎯Skill

Automates scientific literature curation by extracting, summarizing, and organizing research papers from marine biology and oceanography domains

🎯
lambda-labs-gpu-cloud🎯Skill

Provision and manage GPU cloud instances on Lambda Labs for machine learning and AI research workloads with automated setup and configuration.

🎯
openrlhf-training🎯Skill

Trains large language models using open-source reinforcement learning from human feedback (RLHF) techniques with advanced alignment and reward modeling

🎯
sparse-autoencoder-training🎯Skill

Trains sparse autoencoders on neural network activations to discover interpretable features and understand internal representations