Bryce Adelstein Lelbach · May 10, 2020 · 2:47 AM UTC

Bryce Adelstein Lelbach

Pinned Tweet

@blelbach

10 May 2020

The latest revision of @INCITS/@isostandards COBOL comes out this year The goals of COBOL sound normal today: - Portable - Freely available - Designed by the community In 1959 it was radical & unprecedented It was also conceived of & led by women This is the story of COBOL

105

341

ADSP: The Podcast · Nov 7, 2025 · 4:48 PM UTC

Bryce Adelstein Lelbach retweeted

ADSP: The Podcast @adspthepodcast

Nov 7

📢 Episode 259 is out! 📢 In this episode, @blelbach and @code_report record live from NDC TechTown in Norway 🇳🇴! We interview Vittorio Romeo and @jfbastien about C++, training, their talks and more! adspthepodcast.com/2025/11/0…

Bryce Adelstein Lelbach · Nov 2, 2025 · 4:22 AM UTC

Bryce Adelstein Lelbach

@blelbach

Nov 2

I got her a replacement. Don't worry, it's by a window that does not open!

8,726

Bryce Adelstein Lelbach · Nov 2, 2025 · 2:56 AM UTC

Bryce Adelstein Lelbach

@blelbach

Nov 2

This was from the 37th floor on Halloween. No remains have been found...

159

13,713

Bryce Adelstein Lelbach · Nov 1, 2025 · 4:28 PM UTC

Bryce Adelstein Lelbach

@blelbach

Nov 1

Why is @TripIt email processing still so bad? How is this not just an LLM? I bet you could handle 95% of unknown formats this way.

Bryce Adelstein Lelbach · Nov 1, 2025 · 4:07 AM UTC

Bryce Adelstein Lelbach

@blelbach

Nov 1

Our Lego orchid decided to exit the building.

276

2,815

406

125,700

Bryce Adelstein Lelbach · Nov 1, 2025 · 12:29 AM UTC

Bryce Adelstein Lelbach

@blelbach

Nov 1

I ate the taco dog.

Bryce Adelstein Lelbach · Oct 31, 2025 · 10:27 PM UTC

Bryce Adelstein Lelbach

@blelbach

Oct 31

This is one of the most aggravating "features" of GPT-5 - it won't launch a deep research task without asking some clarifying question first, even if you explained everything in your initial prompt.

ADSP: The Podcast · Oct 31, 2025 · 12:28 PM UTC

Bryce Adelstein Lelbach retweeted

ADSP: The Podcast @adspthepodcast

Oct 31

📢 Episode 258 is out! 📢 In this episode, @blelbach and @code_report record live from Norway the day before NDC TechTown! Bryce explains a taxonomy of algorithms: serial, parallel and cooperative! 🥳 adspthepodcast.com/2025/10/3…

ADSP: The Podcast · Oct 24, 2025 · 11:49 AM UTC

Bryce Adelstein Lelbach retweeted

ADSP: The Podcast @adspthepodcast

Oct 24

📢 Episode 257 is out! 📢 In this episode, @blelbach and @code_report record live from Norway 🇳🇴! They continue their chat about the replicate, scatter, gather and run length decode algorithms! They recap their train troubles 🚂 as well! adspthepodcast.com/2025/10/2…

Baxate · Oct 20, 2025 · 8:01 PM UTC

Bryce Adelstein Lelbach retweeted

Baxate

@Baxate_carter

Oct 20

Now anyone can pre-train your own model in 4 hours. Incredible work by @karpathy to open source and democratize some of the most important educational resources in the AI Era. The “Eurekas Per Second” in nanochat is something you must experience. The NVIDIA Brev team has created a launchable (see below) for you to try! Simply click “Deploy Launchable” and a GPU will be provisioned in the cloud, and nanochat will begin training! The first 10 developers to deploy will be able to train their own GPT 2 style model completely for free. Happy hacking!

Andrej Karpathy

@karpathy

Oct 13

Excited to release new repo: nanochat! (it's among the most unhinged I've written). Unlike my earlier similar repo nanoGPT which only covered pretraining, nanochat is a minimal, from scratch, full-stack training/inference pipeline of a simple ChatGPT clone in a single, dependency-minimal codebase. You boot up a cloud GPU box, run a single script and in as little as 4 hours later you can talk to your own LLM in a ChatGPT-like web UI. It weighs ~8,000 lines of imo quite clean code to: - Train the tokenizer using a new Rust implementation - Pretrain a Transformer LLM on FineWeb, evaluate CORE score across a number of metrics - Midtrain on user-assistant conversations from SmolTalk, multiple choice questions, tool use. - SFT, evaluate the chat model on world knowledge multiple choice (ARC-E/C, MMLU), math (GSM8K), code (HumanEval) - RL the model optionally on GSM8K with "GRPO" - Efficient inference the model in an Engine with KV cache, simple prefill/decode, tool use (Python interpreter in a lightweight sandbox), talk to it over CLI or ChatGPT-like WebUI. - Write a single markdown report card, summarizing and gamifying the whole thing. Even for as low as ~$100 in cost (~4 hours on an 8XH100 node), you can train a little ChatGPT clone that you can kind of talk to, and which can write stories/poems, answer simple questions. About ~12 hours surpasses GPT-2 CORE metric. As you further scale up towards ~$1000 (~41.6 hours of training), it quickly becomes a lot more coherent and can solve simple math/code problems and take multiple choice tests. E.g. a depth 30 model trained for 24 hours (this is about equal to FLOPs of GPT-3 Small 125M and 1/1000th of GPT-3) gets into 40s on MMLU and 70s on ARC-Easy, 20s on GSM8K, etc. My goal is to get the full "strong baseline" stack into one cohesive, minimal, readable, hackable, maximally forkable repo. nanochat will be the capstone project of LLM101n (which is still being developed). I think it also has potential to grow into a research harness, or a benchmark, similar to nanoGPT before it. It is by no means finished, tuned or optimized (actually I think there's likely quite a bit of low-hanging fruit), but I think it's at a place where the overall skeleton is ok enough that it can go up on GitHub where all the parts of it can be improved. Link to repo and a detailed walkthrough of the nanochat speedrun is in the reply.

132

2,186

Bryce Adelstein Lelbach · Oct 17, 2025 · 12:36 PM UTC

Bryce Adelstein Lelbach

@blelbach

Oct 17

This will go down as one of the great episodes in ADSP lore. I did in fact crack the one pass algorithm but Conor hasn't seen it yet

ADSP: The Podcast @adspthepodcast

Oct 17

📢 Episode 256 is out! 📢 In this episode, @blelbach and @code_report record live from the streets of Copenhagen 🇩🇰! They talk about the algorithms replicate, scatter, gather and run length decode while navigating bugs 🐛, clubs 🍾 and rain 🌧️! adspthepodcast.com/2025/10/1…

ADSP: The Podcast · Oct 17, 2025 · 12:13 PM UTC

Bryce Adelstein Lelbach retweeted

ADSP: The Podcast @adspthepodcast

Oct 17

Bryce Adelstein Lelbach · Oct 10, 2025 · 8:42 PM UTC

Bryce Adelstein Lelbach

@blelbach

Oct 10

Wow there are a LOT of rats at @Schiphol. And I say that as a New Yorker!

NVIDIA GeForce · Oct 10, 2025 · 3:01 PM UTC

Bryce Adelstein Lelbach retweeted

NVIDIA GeForce

@NVIDIAGeForce

Oct 10

🟢 GEFORCE DAY IS BACK 🟢 To celebrate, we're giving away TWO GeForce RTX 5080 Founders Edition GPUs, signed by NVIDIA CEO Jensen Huang. Want one? Comment "GeForce Day" for a chance to WIN & stay tuned for more!

61,509

3,822

267

48,926

ADSP: The Podcast · Oct 10, 2025 · 11:50 AM UTC

Bryce Adelstein Lelbach retweeted

ADSP: The Podcast @adspthepodcast

Oct 10

📢 Episode 255 is out! 📢 In this episode, @blelbach and @code_report record live from the streets of Copenhagen 🇩🇰! They recap the C++ Copenhagen Meetup hosted by Symbion, the replicate algorithm and much more! adspthepodcast.com/2025/10/1…

Bryce Adelstein Lelbach · Oct 9, 2025 · 8:33 PM UTC

Bryce Adelstein Lelbach

@blelbach

Oct 9

.@code_report release the @adspthepodcast walnut tapes!

Dmitrii Kovanikov · Oct 8, 2025 · 9:41 PM UTC

Bryce Adelstein Lelbach retweeted

Dmitrii Kovanikov

@ChShersh

Oct 8

Sad day to be a C++ dev. It will become harder to dunk on Python.

Charlie Marsh

@charliermarsh

Oct 8

As of Python 3.14, the free-threaded (or no-GIL) version of the Python interpreter is no longer considered experimental.

2,062

Bryce Adelstein Lelbach · Oct 9, 2025 · 11:56 AM UTC

Bryce Adelstein Lelbach

@blelbach

Oct 9

.@code_report's Parrot, a C++ parallel array-based library with implicit fusion using CUDA/Thrust, was launched today at @cppunderthesea! It's the culmination of 3 years of work by Conor. This is the best way to write GPU-accelerated algorithms in C++! github.com/nvlabs/parrot

288

sadernoheart · Oct 7, 2025 · 10:48 AM UTC

Bryce Adelstein Lelbach retweeted

sadernoheart

@sadernoheart

Oct 7

day 68/100 of GPU Programming - learnt how to write a relu kernel in cute dsl - slightly tweaked my fast cuda reverse array kernel and making it 7.84x faster on the B200(0.13842 ms to 0.01765 ms), similar improvements seen on the T4, A100, H200 and H100

sadernoheart

@sadernoheart

Oct 6

day 67/100 of GPU Programming - practiced writing my very fast dot product kernel and a fp16 gemm kernel

868

Bryce Adelstein Lelbach · Oct 4, 2025 · 3:44 PM UTC

Bryce Adelstein Lelbach

@blelbach

Oct 4

.@code_report said I didn't get enough @adspthepodcast stickers last time, so... Come get some at @cppunderthesea next week! We will both be there.