Mohammad Pezeshki · Oct 10, 2025 · 12:28 PM UTC

Mohammad Pezeshki · Oct 10, 2025 · 12:28 PM UTC

Mohammad Pezeshki

Mohammad Pezeshki @mpezeshki91

Oct 10

A very nice read. Fixed chunks make ultra-long reasoning feasible. Very nice visualizations too! Congrats to the authors!

Milad Aghajohari

@MAghajohari

Oct 9

Introducing linear scaling of reasoning: 𝐓𝐡𝐞 𝐌𝐚𝐫𝐤𝐨𝐯𝐢𝐚𝐧 𝐓𝐡𝐢𝐧𝐤𝐞𝐫 Reformulate RL so thinking scales 𝐎(𝐧) 𝐜𝐨𝐦𝐩𝐮𝐭𝐞, not O(n^2), with O(1) 𝐦𝐞𝐦𝐨𝐫𝐲, architecture-agnostic. Train R1-1.5B into a markovian thinker with 96K thought budget, ~2X accuracy 🧵

Oct 10, 2025 · 12:28 PM UTC