nanochat-llm-training

1 results for tag "nanochat-llm-training"

🎯

Skills

nanochat-llm-trainingaradotso/trending-skills1.1K0

Karpathy's minimal, hackable end-to-end harness for training LLMs on a single GPU node — tokenization, pretraining, SFT/RL finetuning, DCLM CORE evaluation, KV-cache inference, and a ChatGPT-like web UI. A single `--depth` dial auto-configures width, heads, LR, and training horizon, letting you reproduce GPT-2 for ~$48 on 8×H100 in roughly 2 hours.

nanochat-llm-training