Dwarkesh Patel · Sep 26, 2025 · 4:01 PM UTC

Dwarkesh Patel

Dwarkesh Patel

@dwarkesh_sp

Sep 26

.@RichardSSutton, father of reinforcement learning, doesn’t think LLMs are bitter-lesson-pilled. My steel man of Richard’s position: we need some new architecture to enable continual (on-the-job) learning. And if we have continual learning, we don't need a special training phase - the agent just learns on-the-fly - like all humans, and indeed, like all animals. This new paradigm will render our current approach with LLMs obsolete. I did my best to represent the view that LLMs will function as the foundation on which this experiential learning can happen. Some sparks flew. 0:00:00 – Are LLMs a dead-end? 0:13:51 – Do humans do imitation learning? 0:23:57 – The Era of Experience 0:34:25 – Current architectures generalize poorly out of distribution 0:42:17 – Surprises in the AI field 0:47:28 – Will The Bitter Lesson still apply after AGI? 0:54:35 – Succession to AI

255

637

340

4,538

Okara · Oct 16, 2025 · 7:33 AM UTC

Okara · Oct 16, 2025 · 7:33 AM UTC

Okara

@askOkara

Oct 16

Replying to @dwarkesh_sp @RichardSSutton

loved the insights he shared thanks for interviewing him

Okara

@askOkara

Oct 16

richard sutton is the godfather of reinforcement learning and a turing award winner he recently went on @dwarkesh_sp's podcast to explain why LLMs are a dead end. i used okara’s youtube tool to summarize the interview here are the top 10 insights he shared 👇

Oct 16, 2025 · 7:33 AM UTC