Training LLMs end to end is hard. Very excited to share our new blog (book?) that cover the full pipeline: pre-training, post-training and infra. 200+ pages of what worked, what didn’t, and how to make it run reliably huggingface.co/spaces/Huggin…

Oct 30, 2025 · 4:13 PM UTC

pretraining thread by @LoubnaBenAllal1 (who lead this project 🫶)
After ~4 years building SOTA models & datasets, we're sharing everything we learned in ⚡The Smol Training Playbook We cover the full LLM cycle: designing ablations, choosing an architecture, curating data, post-training, and building solid infrastructure. We'll help you navigate the messy training reality that LLM papers don't cover. Chapter highlights in the 🧵
1
4
88
post training thread by @_lewtun
We've just published the Smol Training Playbook: a distillation of hard earned knowledge to share exactly what it takes to train SOTA LLMs ⚡️ Featuring our protagonist SmolLM3, we cover: 🧭 Strategy on whether to train your own LLM and burn all your VC money 🪨 Pretraining, aka turning a mountain of text into a fancy auto-completer 🗿How to sculpt base models with post-training alchemy 🛠️ The underlying infra and how to debug your way out of NCCL purgatory Highlights from the post-training chapter in the thread 👇
1
2
56
infra thread by @Nouamanetazi
We're releasing The Smol Training Playbook 📖 Training SmolLM3 on 384 H100s for nearly a month taught us: infrastructure is the unsung hero of LLM training. Most care about architecture and data, yet few understand the hardware layer. This playbook changes that 🧵
2
49
Replying to @eliebakouch
you drop this elie??? on my birthday????? that’s so kind man thank you
13
40
mannn didn't know it was your birthday! happy birthday!!!
2
14
Replying to @eliebakouch
Your page is broken:
3
11
It’s working for me even on my phone (but I don’t have the new link feature so can’t reproduce exactly)
3
Replying to @eliebakouch
awesome work Elie!!
1
4
thanks!! :)
2
Replying to @eliebakouch
team hf giving treats on halloween 🙌🏻
1
3
enjoy 🎃
3
Replying to @eliebakouch
thanks elie! goated work.
1
1
Thanks!!
1
Replying to @eliebakouch
huge for the opensource community. congrats, elie!
1
1
thanks a lot!
Replying to @eliebakouch
is there a course where someone could go something like this on the cheap?
1
4
Replying to @eliebakouch
thanks for sharing! Excited to read this one!!
1
1
Replying to @eliebakouch
This was what I needed !!!
1
1
🫡🫡
2
Replying to @eliebakouch
where can I buy the paperback?
1
1
not yet release, we might do something like the ultra scale playbook if there is interest (but we need time!)
4
Replying to @eliebakouch
I get: "Job failed with exit code: 1. Reason: cache miss"
1
1
Can you try again? Should be good now
Replying to @eliebakouch
I wonder why this plot didn't make it into your summary
1
8
Replying to @eliebakouch
looks like i know what im doing for the next 2-4 days
7
Replying to @eliebakouch
Elie what have you done to me oh no
4
Replying to @eliebakouch
Care to share a PDF for poor individuals? :)
4
Replying to @eliebakouch
Is this something that can be done on anything a normal person has access to? I want to play with it, not just read the theory.
2