llama-cpp
π―Skillfrom orchestra-research/ai-research-skills
Enables high-performance local inference and quantization for large language models using efficient C/C++ implementations and GGML formats.
Part of
orchestra-research/ai-research-skills(84 items)
Installation
npx @orchestra-research/ai-research-skillsnpx @orchestra-research/ai-research-skills list # View installed skillsnpx @orchestra-research/ai-research-skills update # Update installed skills/plugin marketplace add orchestra-research/AI-research-SKILLs/plugin install fine-tuning@ai-research-skills # Axolotl, LLaMA-Factory, PEFT, Unsloth+ 4 more commands
More from this repository10
Streamlines AI research workflows by providing curated Claude skills for data analysis, literature review, experiment design, and research paper generation.
Assists AI researchers in drafting, structuring, and generating machine learning research papers with academic writing best practices and technical precision.
Streamlines distributed data processing and machine learning workflows using Ray's scalable data loading and transformation capabilities.
Streamlines distributed machine learning training using Ray, optimizing hyperparameter tuning and parallel model execution across compute clusters.
Streamline machine learning experiment tracking, model versioning, and deployment management with comprehensive MLflow integration and best practices.
Efficiently fine-tune large language models using Parameter-Efficient Fine-Tuning (PEFT) techniques with minimal computational resources and memory overhead.
Quantizes large language models using Activation-aware Weight Quantization (AWQ) to reduce model size and improve inference efficiency.
Orchestrates collaborative AI agents using CrewAI to solve complex tasks through dynamic role assignment, task delegation, and intelligent workflow management.
Detect and filter potentially harmful or inappropriate content in AI conversations using advanced safety classification models.
Enables remote neural network interpretation and analysis through advanced visualization, layer probing, and activation tracking techniques.