sglang
π―Skillfrom ovachiever/droid-tings
Enables fast, structured LLM generation with RadixAttention prefix caching for JSON, regex, and agentic workflows with 5Γ faster inference.
Part of
ovachiever/droid-tings(370 items)
Installation
pip install "sglang[all]"pip install sglang[all] flashinfer -i https://flashinfer.ai/whl/cu121/torch2.4/git clone https://github.com/sgl-project/sglang.gitpip install -e "python[all]"python -m sglang.launch_server \Skill Details
Fast structured generation and serving for LLMs with RadixAttention prefix caching. Use for JSON/regex outputs, constrained decoding, agentic workflows with tool calls, or when you need 5Γ faster inference than vLLM with prefix sharing. Powers 300,000+ GPUs at xAI, AMD, NVIDIA, and LinkedIn.
More from this repository10
nextjs-shadcn-builder skill from ovachiever/droid-tings
security-auditor skill from ovachiever/droid-tings
threejs-graphics-optimizer skill from ovachiever/droid-tings
api-documenter skill from ovachiever/droid-tings
secret-scanner skill from ovachiever/droid-tings
readme-updater skill from ovachiever/droid-tings
applying-brand-guidelines skill from ovachiever/droid-tings
Configures Tailwind v4 with shadcn/ui, automating CSS variable setup, dark mode, and preventing common initialization errors.
deep-reading-analyst skill from ovachiever/droid-tings
dependency-auditor skill from ovachiever/droid-tings