evaluating-code-models
π―Skillfrom orchestra-research/ai-research-skills
Systematically assess machine learning code generation models by benchmarking performance, identifying strengths/weaknesses, and generating comparative metrics.
Part of
orchestra-research/ai-research-skills(84 items)
Installation
npx @orchestra-research/ai-research-skillsnpx @orchestra-research/ai-research-skills list # View installed skillsnpx @orchestra-research/ai-research-skills update # Update installed skills/plugin marketplace add orchestra-research/AI-research-SKILLs/plugin install fine-tuning@ai-research-skills # Axolotl, LLaMA-Factory, PEFT, Unsloth+ 4 more commands
More from this repository10
Streamlines AI research workflows by providing curated Claude skills for data analysis, literature review, experiment design, and research paper generation.
Assists AI researchers in drafting, structuring, and generating machine learning research papers with academic writing best practices and technical precision.
Streamlines distributed data processing and machine learning workflows using Ray's scalable data loading and transformation capabilities.
Streamlines distributed machine learning training using Ray, optimizing hyperparameter tuning and parallel model execution across compute clusters.
Trains compact language models with minimal compute, enabling efficient text generation and fine-tuning on small datasets using PyTorch and transformer architectures.
Automatically segment and extract precise object masks from images using Meta AI's advanced computer vision model with high accuracy and flexibility.
Implements and evaluates RWKV language model architectures, providing tools for training, fine-tuning, and performance analysis of linear attention transformer alternatives.
Generates high-quality, customizable audio and music using Meta AI's advanced generative models with fine-tuned control over style and composition.
Compress and transfer complex machine learning model knowledge into smaller, more efficient neural networks with minimal performance loss
Validates and sanitizes AI prompts to prevent injection attacks, filter sensitive content, and ensure safe, controlled interactions with language models.