🎯

miles-rl-training

🎯Skill

from orchestra-research/ai-research-skills

VibeIndex|
What it does

Trains reinforcement learning models for autonomous vehicle navigation using Miles framework, optimizing policy and value networks

πŸ“¦

Part of

orchestra-research/ai-research-skills(104 items)

miles-rl-training

Installation

Quick InstallInstall with npx
npx skills add https://github.com/orchestra-research/ai-research-skills --skill miles-rl-training

Need more details? View full documentation on GitHub β†’

4Installs
-
AddedFeb 7, 2026

More from this repository10

πŸͺ
orchestra-research-ai-research-skillsπŸͺMarketplace

Streamlines AI research workflows by providing curated Claude skills for data analysis, literature review, experiment design, and research paper generation.

🎯
ml-paper-writing🎯Skill

Assists AI researchers in drafting, structuring, and generating machine learning research papers with academic writing best practices and technical precision.

🎯
ray-data🎯Skill

Streamlines distributed data processing and machine learning workflows using Ray's scalable data loading and transformation capabilities.

🎯
ray-train🎯Skill

Streamlines distributed machine learning training using Ray, optimizing hyperparameter tuning and parallel model execution across compute clusters.

🎯
lambda-labs-gpu-cloud🎯Skill

Provision and manage GPU cloud instances on Lambda Labs for machine learning and AI research workloads with automated setup and configuration.

🎯
rwkv-architecture🎯Skill

Implements and evaluates RWKV language model architectures, providing tools for training, fine-tuning, and performance analysis of linear attention transformer alternatives.

🎯
prompt-guard🎯Skill

Validates and sanitizes AI prompts to prevent injection attacks, filter sensitive content, and ensure safe, controlled interactions with language models.

🎯
stable-diffusion-image-generation🎯Skill

Generates high-quality, customizable AI images from text prompts using advanced Stable Diffusion models with fine-tuned control over style and content.

🎯
openrlhf-training🎯Skill

Trains large language models using open-source reinforcement learning from human feedback (RLHF) techniques with advanced alignment and reward modeling

🎯
llava🎯Skill

Analyze and describe images using advanced multimodal AI, extracting detailed visual insights and contextual understanding across various domains.