🎯

llama-cpp

🎯Skill

from orchestra-research/ai-research-skills

What it does

Enables high-performance local inference and quantization for large language models using efficient C/C++ implementations and GGML formats.

📦

Part of

orchestra-research/ai-research-skills(84 items)

llama-cpp

Installation

npxRun with npx

npx @orchestra-research/ai-research-skills

npxRun with npx

npx @orchestra-research/ai-research-skills list # View installed skills

npxRun with npx

npx @orchestra-research/ai-research-skills update # Update installed skills

Add MarketplaceAdd marketplace to Claude Code

/plugin marketplace add orchestra-research/AI-research-SKILLs

Install PluginInstall plugin from marketplace

/plugin install fine-tuning@ai-research-skills # Axolotl, LLaMA-Factory, PEFT, Unsloth

+ 4 more commands

📖 Extracted from docs: orchestra-research/ai-research-skills

Need more details? View full documentation on GitHub →

1Installs

AddedFeb 7, 2026

View on GitHub Back to Skills

More from this repository10

🏪

orchestra-research-ai-research-skills🏪Marketplace

Streamlines AI research workflows by providing curated Claude skills for data analysis, literature review, experiment design, and research paper generation.

🎯

ml-paper-writing🎯Skill

Assists AI researchers in drafting, structuring, and generating machine learning research papers with academic writing best practices and technical precision.

🎯

ray-data🎯Skill

Streamlines distributed data processing and machine learning workflows using Ray's scalable data loading and transformation capabilities.

🎯

ray-train🎯Skill

Streamlines distributed machine learning training using Ray, optimizing hyperparameter tuning and parallel model execution across compute clusters.

🎯

mlflow🎯Skill

Streamline machine learning experiment tracking, model versioning, and deployment management with comprehensive MLflow integration and best practices.

🎯

peft-fine-tuning🎯Skill

Efficiently fine-tune large language models using Parameter-Efficient Fine-Tuning (PEFT) techniques with minimal computational resources and memory overhead.

🎯

awq-quantization🎯Skill

Quantizes large language models using Activation-aware Weight Quantization (AWQ) to reduce model size and improve inference efficiency.

🎯

crewai-multi-agent🎯Skill

Orchestrates collaborative AI agents using CrewAI to solve complex tasks through dynamic role assignment, task delegation, and intelligent workflow management.

🎯

llamaguard🎯Skill

Detect and filter potentially harmful or inappropriate content in AI conversations using advanced safety classification models.

🎯

nnsight-remote-interpretability🎯Skill

Enables remote neural network interpretation and analysis through advanced visualization, layer probing, and activation tracking techniques.