๐ŸŽฏ

slime-rl-training

๐ŸŽฏSkill

from zechenzhangagi/ai-research-skills

VibeIndex|
What it does

Trains reinforcement learning agents in a slime-based environment using advanced machine learning techniques and custom simulation parameters.

๐Ÿ“ฆ

Part of

zechenzhangagi/ai-research-skills(83 items)

slime-rl-training

Installation

npxRun with npx
npx @orchestra-research/ai-research-skills
npxRun with npx
npx @orchestra-research/ai-research-skills list # View installed skills
npxRun with npx
npx @orchestra-research/ai-research-skills update # Update installed skills
Add MarketplaceAdd marketplace to Claude Code
/plugin marketplace add orchestra-research/AI-research-SKILLs
Install PluginInstall plugin from marketplace
/plugin install fine-tuning@ai-research-skills # Axolotl, LLaMA-Factory, PEFT, Unsloth

+ 4 more commands

๐Ÿ“– Extracted from docs: zechenzhangagi/ai-research-skills
1Installs
-
AddedFeb 6, 2026

More from this repository10

๐ŸŽฏ
ml-paper-writing๐ŸŽฏSkill

ml-paper-writing skill from zechenzhangagi/ai-research-skills

๐ŸŽฏ
qdrant-vector-search๐ŸŽฏSkill

Performs efficient semantic vector search and similarity matching using Qdrant's vector database for AI-powered information retrieval and recommendation systems.

๐ŸŽฏ
skypilot-multi-cloud-orchestration๐ŸŽฏSkill

Orchestrates seamless multi-cloud deployment and management of AI workloads across different cloud providers using SkyPilot's infrastructure automation capabilities.

๐ŸŽฏ
langchain๐ŸŽฏSkill

Enables AI agents to leverage LangChain's framework for building complex language model workflows and chaining together different AI components and tools.

๐ŸŽฏ
crewai-multi-agent๐ŸŽฏSkill

Orchestrates multi-agent collaboration using CrewAI for complex research tasks, enabling specialized AI agents to work together systematically.

๐ŸŽฏ
blip-2-vision-language๐ŸŽฏSkill

Enables AI agents to analyze and understand images by leveraging the BLIP-2 vision-language model for multimodal perception and reasoning tasks.

๐ŸŽฏ
llama-factory๐ŸŽฏSkill

Generates and manages fine-tuning configurations and workflows for Llama language models, streamlining the process of customizing and training large language models.

๐ŸŽฏ
weights-and-biases๐ŸŽฏSkill

Logs and tracks machine learning experiments, model performance, and hyperparameters using Weights & Biases platform for comprehensive AI research visualization.

๐ŸŽฏ
tensorboard๐ŸŽฏSkill

Visualizes and analyzes machine learning model performance, training metrics, and computational graphs using TensorBoard's interactive dashboard.

๐ŸŽฏ
speculative-decoding๐ŸŽฏSkill

Accelerates large language model inference by predicting and pre-computing potential token sequences before final model verification, reducing computational latency.