Looool · Nov 8, 2025 · 6:56 PM UTC

Looool

Looool @datawarmup

23h

“Hope, however, is a self-modifying recurrent architecture that can take advantage of unbounded levels of in-context learning and also is augmented with CMS blocks to scale to larger context windows.” that will be so cool.

Google Research

@GoogleResearch

Nov 7

Introducing Nested Learning: A new ML paradigm for continual learning that views models as nested optimization problems to enhance long context processing. Our proof-of-concept model, Hope, shows improved performance in language modeling. Learn more: goo.gle/47LJrzI @GoogleAI

Looool · Nov 7, 2025 · 1:02 AM UTC

Looool @datawarmup

Nov 7

piped.video/K09erFsOnxA?t=1227 Koltun's team established a rule that they should never kick a robot. That reminds me of my bad impression about unitree, in their PR video they show so often bad behaviors treating robots, makes me wonder are they really building "intelligent" robot?

Distinguished Colloquium: Vladlen Koltun, Sept 12, 2023

Title: A Quiet Revolution in RoboticsBio: Vladlen Koltun received his PhD in 2002 and has worked across multiple fields of computer science. He has mentored ...

youtube.com

Looool · Nov 6, 2025 · 8:19 PM UTC

Looool @datawarmup

Nov 6

and knowledge as described in incompleteideas.net/IncIdeas…, outcome conditioned prediction.

Looool · Oct 31, 2025 · 8:50 AM UTC

Looool @datawarmup

Oct 31

arxiv.org/abs/2212.07677

Transformers learn in-context by gradient descent

At present, the mechanisms of in-context learning in Transformers are not well understood and remain mostly an intuition. In this paper, we suggest that training Transformers on auto-regressive...

arxiv.org

Misha Laskin · Oct 26, 2022 · 1:41 PM UTC

Looool retweeted

Misha Laskin

@MishaLaskin

26 Oct 2022

In our new work - Algorithm Distillation - we show that transformers can improve themselves autonomously through trial and error without ever updating their weights. No prompting, no finetuning. A single transformer collects its own data and maximizes rewards on new tasks. 1/N

243

1,351

Looool · Oct 27, 2025 · 9:06 PM UTC

Looool @datawarmup

Oct 27

arxiv.org/abs/2210.14215

In-context Reinforcement Learning with Algorithm Distillation

We propose Algorithm Distillation (AD), a method for distilling reinforcement learning (RL) algorithms into neural networks by modeling their training histories with a causal sequence model....

arxiv.org

Looool · Oct 26, 2025 · 9:05 PM UTC

Looool @datawarmup

Oct 26

They all look quite equivalent to me: induced factors, parity verification matrix, codebook.

Looool · Oct 26, 2025 · 8:58 PM UTC

Looool @datawarmup

Oct 26

arxiv.org/abs/2103.15949

Transformer visualization via dictionary learning: contextualized...

Transformer networks have revolutionized NLP representation learning since they were introduced. Though a great effort has been made to explain the representation in transformers, it is widely...

arxiv.org

Looool · Oct 26, 2025 · 8:56 PM UTC

Looool @datawarmup

Oct 26

arxiv.org/abs/2307.04721

Large Language Models as General Pattern Machines

We observe that pre-trained large language models (LLMs) are capable of autoregressively completing complex token sequences -- from arbitrary ones procedurally generated by probabilistic...

arxiv.org

Looool · Oct 21, 2025 · 8:20 PM UTC

Looool @datawarmup

Oct 21

ICML 2024 Tutorial: Physics of Language Models piped.video/yBL7J0kgldU?si=9k0C… via @YouTube

ICML 2024 Tutorial: Physics of Language Models

Project page (with further readings): https://physics.allen-zhu.com/Abstract: We divide "intelligence" into multiple dimensions (like language structures, kn...

youtube.com

Looool · Oct 15, 2025 · 11:35 PM UTC

Looool @datawarmup

Oct 15

And it looks to me we always transform information in the following order most often (1st order approximation): Reality => see/act/(w * reason with language model in mind) => language. w change with growth. Show thinking

Looool · Oct 15, 2025 · 10:49 PM UTC

Looool @datawarmup

Oct 15

Compared to VAE and Diffusion models, "The noise channel" the GPT must overcome is the inherent ambiguity and uncertainty of language", hence the information acquision from this source is in theory limited by how well can human acquire information from reality using language

Yi Ma

@YiMaTweets

Oct 13

The lecture slides and videos for the first six weeks of my new course are now posted on the open book website: ma-lab-berkeley.github.io/de…

Nando de Freitas · Oct 12, 2025 · 1:34 PM UTC

Looool retweeted

Nando de Freitas

@NandoDF

Oct 12

Machines that can predict what their sensors (touch, cameras, keyboard, temperature, microphones, gyros, …) will perceive are already aware and have subjective experience. It’s all a matter of degree now. More sensors, data, compute, tasks will lead without any doubt to the “I think therefore I am” moment for computers, and we’re not ready for it yet. arxiv.org/pdf/1804.06318 share.google/kxx6WyqHpwPmo6Q…

391

Ken Goldberg · Oct 10, 2025 · 10:50 PM UTC

Looool retweeted

Ken Goldberg @Ken_Goldberg

Oct 10

Looking fwd to playing w this @GoogleDeepMind developers.googleblog.com/en…

Phillip Isola · Oct 10, 2025 · 10:13 PM UTC

Looool retweeted

Phillip Isola @phillip_isola

Oct 10

Over the past year, my lab has been working on fleshing out theory/applications of the Platonic Representation Hypothesis. Today I want to share two new works on this topic: Eliciting higher alignment: arxiv.org/abs/2510.02425 Unpaired rep learning: arxiv.org/abs/2510.08492 1/9

119

693

Ion Stoica · Oct 9, 2025 · 8:33 PM UTC

Looool retweeted

Ion Stoica

@istoica05

Oct 9

Excited to share our new paper on AI-Driven Research for Systems. We show that AI can autonomously generate and verify novel solutions for classic systems performance problems, matching or exceeding human designs. A glimpse into how AI might transform not only systems, but the research process itself.

AI-Driven Research Systems

@ai4research_ucb

Oct 9

🚀 Excited to release our new paper: “Barbarians at the Gate: How AI is Upending Systems Research” We show how AI-Driven Research for Systems (ADRS) can rediscover or outperform human-designed algorithms across cloud scheduling, MoE expert load balancing, LLM-SQL optimization, transaction scheduling, and more — all within hours ⚡️ and under $20 💰. 🧵👇 Check it out!

177

Gerard Pons-Moll · Oct 7, 2025 · 12:22 PM UTC

Looool retweeted

Gerard Pons-Moll @GerardPonsMoll1

Oct 7

Real time online 3D reconstruction of 3D scene and humans represented with SMPL. fanegg.github.io/Human3R/ I don't get tired of looking at these results

421

Looool · Oct 8, 2025 · 8:11 AM UTC

Looool @datawarmup

Oct 8

今天的心情也让我想起那个徒手攀岩的Alex，看到山和不平处就想去爬。最后是山和被攀爬的对象在不断shape他的肌肉认知和技巧一样。这和米开朗基罗倒是相反。

This tweet is unavailable

Mikael Henaff · Jun 9, 2025 · 3:53 PM UTC

Looool retweeted

Mikael Henaff @HenaffMikael

Jun 9

A couple bits of news: 1. Happy to share my first (human) NetHack ascension-next step is RL agents :) 2. I wrote a post discussing some @NetHack_LE challenges & how they map to open problems in RL & agentic AI. Still the best RL benchmark imo. mikaelhenaff.substack.com/p/…

Mikael Henaff · Oct 1, 2025 · 5:37 PM UTC

Looool retweeted

Mikael Henaff @HenaffMikael

Oct 1

Introducing Scalable Option Learning (SOL☀️), a blazingly fast hierarchical RL algorithm that makes progress on long-horizon tasks and demonstrates positive scaling trends on the largely unsolved NetHack benchmark, when trained for 30 billion samples. Details, paper and code in >