quantizing-models-bitsandbytes
๐ฏSkillfrom zechenzhangagi/ai-research-skills
An AI research engineering skill for quantizing models using the bitsandbytes library. Part of the Orchestra Research AI Research Skills library, which provides 82 skills across 20 categories to enable coding agents to execute AI research engineering tasks such as model optimization and inference.
Same repository
zechenzhangagi/ai-research-skills(97 items)
Installation
npx vibeindex add zechenzhangagi/ai-research-skills --skill quantizing-models-bitsandbytesnpx skills add zechenzhangagi/ai-research-skills --skill quantizing-models-bitsandbytes~/.claude/skills/quantizing-models-bitsandbytes/SKILL.mdSKILL.md
More from this repository10
Comprehensive open-source library of 77 AI research engineering skills across 20 categories including model architecture, fine-tuning, distributed training, RAG, multimodal, and ML paper writing.
A skill from the AI Research Engineering Skills library, a comprehensive open-source collection of 82 skills across 20 categories that provide engineering capabilities for AI agents to conduct research experiments, including training, evaluation, deployment, and agent building.
A Claude Code skill providing structured ideation frameworks with 10 complementary lenses for discovering high-impact AI research directions, part of the Orchestra Research AI research engineering skills library with 85+ skills across 21 categories.
An AI research engineering skill for Qdrant, a high-performance Rust-powered vector search engine with hybrid search and filtering capabilities, part of the 83-skill AI Research Skills library covering RAG and retrieval workflows.
A comprehensive open-source library of 82 AI research engineering skills across 20 categories (fine-tuning, distributed training, RAG, multimodal, safety, and more) that enable coding agents to conduct AI research experiments.
A skill from the AI Research Skills library, a comprehensive open-source collection of 87 skills across 22 categories that enable AI agents to autonomously conduct AI research from idea generation through experiment execution to paper writing.
AI research engineering skill for optimizing attention mechanisms with Flash Attention techniques, part of a comprehensive 83-skill library covering model architecture, training, and deployment.
A speculative decoding skill from the AI Research Engineering Skills library, part of a collection of 82 research skills covering emerging AI techniques. Provides expert-level guidance on speculative decoding for accelerating LLM inference.
Part of the AI Research Engineering Skills Library, a comprehensive collection of 83 skills across 20 categories covering the full AI research lifecycle. Provides expert-level guidance for knowledge distillation with real code examples and production-ready workflows for frameworks like Megatron-LM, vLLM, and TRL.
AI research skill for long-context model techniques and optimizations, part of an 87-skill library enabling autonomous AI research across 22 categories from idea to paper.