miles-rl-training
π―Skillfrom orchestra-research/ai-research-skills
Trains reinforcement learning models for autonomous vehicle navigation using Miles framework, optimizing policy and value networks
Part of
orchestra-research/ai-research-skills(104 items)
Installation
npx skills add https://github.com/orchestra-research/ai-research-skills --skill miles-rl-trainingNeed more details? View full documentation on GitHub β
More from this repository10
Streamlines AI research workflows by providing curated Claude skills for data analysis, literature review, experiment design, and research paper generation.
Assists AI researchers in drafting, structuring, and generating machine learning research papers with academic writing best practices and technical precision.
Streamlines distributed data processing and machine learning workflows using Ray's scalable data loading and transformation capabilities.
Streamlines distributed machine learning training using Ray, optimizing hyperparameter tuning and parallel model execution across compute clusters.
Provision and manage GPU cloud instances on Lambda Labs for machine learning and AI research workloads with automated setup and configuration.
Implements and evaluates RWKV language model architectures, providing tools for training, fine-tuning, and performance analysis of linear attention transformer alternatives.
Validates and sanitizes AI prompts to prevent injection attacks, filter sensitive content, and ensure safe, controlled interactions with language models.
Generates high-quality, customizable AI images from text prompts using advanced Stable Diffusion models with fine-tuned control over style and content.
Trains large language models using open-source reinforcement learning from human feedback (RLHF) techniques with advanced alignment and reward modeling
Analyze and describe images using advanced multimodal AI, extracting detailed visual insights and contextual understanding across various domains.