Dwarkesh Patel · Sep 26, 2025 · 4:01 PM UTC

Dwarkesh Patel

Dwarkesh Patel

@dwarkesh_sp

Sep 26

.@RichardSSutton, father of reinforcement learning, doesn’t think LLMs are bitter-lesson-pilled. My steel man of Richard’s position: we need some new architecture to enable continual (on-the-job) learning. And if we have continual learning, we don't need a special training phase - the agent just learns on-the-fly - like all humans, and indeed, like all animals. This new paradigm will render our current approach with LLMs obsolete. I did my best to represent the view that LLMs will function as the foundation on which this experiential learning can happen. Some sparks flew. 0:00:00 – Are LLMs a dead-end? 0:13:51 – Do humans do imitation learning? 0:23:57 – The Era of Experience 0:34:25 – Current architectures generalize poorly out of distribution 0:42:17 – Surprises in the AI field 0:47:28 – Will The Bitter Lesson still apply after AGI? 0:54:35 – Succession to AI

255

637

340

4,538

Gary Marcus · Sep 26, 2025 · 7:11 PM UTC

Gary Marcus · Sep 26, 2025 · 7:11 PM UTC

Gary Marcus

@GaryMarcus

Sep 26

Replying to @dwarkesh_sp @RichardSSutton

much of Sutton’s critique of LLMs is virtually identical to what I have been arguing for many many years. it is disappointing @dwarkesh_sp that you would not let me present my views.

Sep 26, 2025 · 7:11 PM UTC

164

Chuck Russell · Oct 3, 2025 · 2:52 AM UTC

Chuck Russell @cichuck

Oct 3

Replying to @GaryMarcus @dwarkesh_sp @RichardSSutton

I don't think that's true.

oops · Sep 27, 2025 · 3:31 AM UTC

oops @Joonzzy

Sep 27

Replying to @GaryMarcus @dwarkesh_sp @RichardSSutton

Yes, this is about you

Nrx Accelerationist (elon musk stan) · Sep 26, 2025 · 8:32 PM UTC

Nrx Accelerationist (elon musk stan) @shyankothari

Sep 26

Replying to @GaryMarcus @dwarkesh_sp @RichardSSutton

ayyo get my man gary in there

Naveen Palli · Sep 27, 2025 · 12:02 AM UTC

Naveen Palli @naveenpalli

Sep 27

Replying to @GaryMarcus @dwarkesh_sp @RichardSSutton

You are an unpleasant person.

Tyler Moore · Sep 26, 2025 · 10:36 PM UTC

Tyler Moore @TylerMooreUS

Sep 26

Replying to @GaryMarcus @dwarkesh_sp @RichardSSutton

"would not let me present my views" - what does this mean? He did not invite you on his podcast?

Michael Guard · Sep 26, 2025 · 8:06 PM UTC

Michael Guard

@MickaelProsper

Sep 26

Replying to @GaryMarcus @dwarkesh_sp @RichardSSutton

I'd like to see you debate someone on his show.

Kamil Staszewski · Sep 27, 2025 · 4:56 AM UTC

Kamil Staszewski

@KamStaszewski

Sep 27

Replying to @GaryMarcus @dwarkesh_sp @RichardSSutton

Gary when someone asks him about Dwarkesh podcast

GIF

Peter Isza · Oct 1, 2025 · 5:21 PM UTC

Peter Isza @fs9h7kh4b5

Oct 1

Replying to @GaryMarcus @dwarkesh_sp @RichardSSutton

Who are you?

NeuralKitsune🏳️‍⚧️ · Sep 27, 2025 · 11:31 AM UTC

NeuralKitsune🏳️‍⚧️ @3ff3x_

Sep 27

Replying to @GaryMarcus @dwarkesh_sp @RichardSSutton

Bc Gary is the kind of guy that would get a negative view count

𝙩𝙮≃𝙛{𝕩}^A𝕀²·ℙarad𝕚g𝕞 · Sep 27, 2025 · 2:58 AM UTC

𝙩𝙮≃𝙛{𝕩}^A𝕀²·ℙarad𝕚g𝕞

@TaNGSoFT

Sep 27

Replying to @GaryMarcus @dwarkesh_sp @RichardSSutton

Sutton gets his own frame bias of TD learning algorithm?

Gautam · Oct 2, 2025 · 6:09 AM UTC

Gautam @gautamgoel978

Oct 2

Replying to @GaryMarcus @dwarkesh_sp @RichardSSutton

next guest?

Dirk de Vos · Sep 28, 2025 · 1:22 PM UTC

Dirk de Vos @DirkdeVos

Sep 28

Replying to @GaryMarcus @dwarkesh_sp @RichardSSutton

He had to be taken over the threshold by a (former) believer.

AdamKadmon91 · Oct 2, 2025 · 4:13 PM UTC

AdamKadmon91 @AdamKadmon91

Oct 2

Replying to @GaryMarcus @dwarkesh_sp @RichardSSutton

Gary, the last thing we want on this podcast is it to have it be polluted by your relentless self-aggrandizing intellectually incoherent noise generation.

Archimedes · Sep 28, 2025 · 1:41 AM UTC

Archimedes

@arXmedes

Sep 28

Replying to @GaryMarcus @dwarkesh_sp @RichardSSutton

Nobody cares about Gary