4 results for tag "tensorrt-llm"
A large collection of Claude Code skill templates sponsored by Z.AI, providing ready-to-use development skill configurations across various domains.
TensorRT-LLM skill from Orchestra Research for optimizing and deploying large language models with NVIDIA TensorRT acceleration.
Optimizes and accelerates LLM inference on NVIDIA GPUs with up to 10-100x faster performance, supporting quantization, in-flight batching, and multi-GPU scaling.