elie (@eliebakouch): "Training LLMs end to end is hard. Very excited to share our new blog (book?) that cover the full pipeline: pre-training, post-training and infra. 200+ pages of what worked, what didn’t, and how to make it run reliably https://huggingface.co/spaces/HuggingFaceTB/smol-training-playbook" | ab4n

elie

@eliebakouch

Oct 30

Training LLMs end to end is hard. Very excited to share our new blog (book?) that cover the full pipeline: pre-training, post-training and infra. 200+ pages of what worked, what didn’t, and how to make it run reliably huggingface.co/spaces/Huggin…

Oct 30, 2025 · 4:13 PM UTC

5,740

elie

@eliebakouch

Oct 30

pretraining thread by @LoubnaBenAllal1 (who lead this project 🫶)

Loubna Ben Allal

@LoubnaBenAllal1

Oct 30

After ~4 years building SOTA models & datasets, we're sharing everything we learned in ⚡The Smol Training Playbook We cover the full LLM cycle: designing ablations, choosing an architecture, curating data, post-training, and building solid infrastructure. We'll help you navigate the messy training reality that LLM papers don't cover. Chapter highlights in the 🧵

88

elie

@eliebakouch

Oct 30

post training thread by @_lewtun

Lewis Tunstall

@_lewtun

Oct 30

We've just published the Smol Training Playbook: a distillation of hard earned knowledge to share exactly what it takes to train SOTA LLMs ⚡️ Featuring our protagonist SmolLM3, we cover: 🧭 Strategy on whether to train your own LLM and burn all your VC money 🪨 Pretraining, aka turning a mountain of text into a fancy auto-completer 🗿How to sculpt base models with post-training alchemy 🛠️ The underlying infra and how to debug your way out of NCCL purgatory Highlights from the post-training chapter in the thread 👇

56

elie

@eliebakouch

Oct 30

infra thread by @Nouamanetazi

Nouamane Tazi @Nouamanetazi

Oct 30

We're releasing The Smol Training Playbook 📖 Training SmolLM3 on 384 H100s for nearly a month taught us: infrastructure is the unsung hero of LLM training. Most care about architecture and data, yet few understand the hardware layer. This playbook changes that 🧵

49

Yacine Mahdid

@yacinelearning

Oct 30

Replying to @eliebakouch

you drop this elie??? on my birthday????? that’s so kind man thank you

40

elie

@eliebakouch

Oct 30

mannn didn't know it was your birthday! happy birthday!!!

14

Lucas Beyer (bl16)

@giffmana

Oct 30

Replying to @eliebakouch

Your page is broken:

11

elie

@eliebakouch

Oct 30

It’s working for me even on my phone (but I don’t have the new link feature so can’t reproduce exactly)

3

Vincent Weisser

@vincentweisser

Oct 31

Replying to @eliebakouch

awesome work Elie!!

4

elie

@eliebakouch

Oct 31

thanks!! :)

2

λux

@novasarc01

Oct 30

Replying to @eliebakouch

team hf giving treats on halloween 🙌🏻

3

elie

@eliebakouch

Oct 30

enjoy 🎃

3

himanshu

@himanshustwts

Oct 31

Replying to @eliebakouch

thanks elie! goated work.

1

elie

@eliebakouch

Nov 1

Thanks!!

1

Seunghyun Seo @SeunghyunSEO7

Oct 31

Replying to @eliebakouch

huge for the opensource community. congrats, elie!

1

elie

@eliebakouch

Oct 31

thanks a lot!

protobluf @protobluf

Oct 30

Replying to @eliebakouch

is there a course where someone could go something like this on the cheap?

4

elie

@eliebakouch

Oct 30

cc @TheZachMueller

3

Vasudev Gupta

@thevasudevgupta

Oct 31

Replying to @eliebakouch

thanks for sharing! Excited to read this one!!

1

elie

@eliebakouch

Oct 31

Enjoy!!

1

Shekswess

@Shekswess

Oct 30

Replying to @eliebakouch

This was what I needed !!!

1

elie

@eliebakouch

Oct 30

🫡🫡

2

Baroian Andrei @AndreiBaroian

Oct 31

Replying to @eliebakouch

where can I buy the paperback?

1

elie

@eliebakouch

Oct 31

not yet release, we might do something like the ultra scale playbook if there is interest (but we need time!)

4

Rani Nelken @RaniNelken

Oct 30

Replying to @eliebakouch

I get: "Job failed with exit code: 1. Reason: cache miss"

1

elie

@eliebakouch

Oct 31

Can you try again? Should be good now

Loubna Ben Allal

@LoubnaBenAllal1

Oct 30

Replying to @eliebakouch

I wonder why this plot didn't make it into your summary

8

rajan agarwal

@_rajanagarwal

Oct 30

Replying to @eliebakouch

looks like i know what im doing for the next 2-4 days

7

tokenbender

@tokenbender

Oct 30

Replying to @eliebakouch

<3

5

Zach Mueller

@TheZachMueller

Oct 30

Replying to @eliebakouch

Elie what have you done to me oh no

4

Gowthami

@gowthami_s

Oct 30

Replying to @eliebakouch

Care to share a PDF for poor individuals? :)

4

Radek Sienkiewicz

@velvet_shark

Oct 31

Replying to @eliebakouch

Is this something that can be done on anything a normal person has access to? I want to play with it, not just read the theory.

2