Tom Dörr · Nov 9, 2025 · 2:11 AM UTC

Tom Dörr

Salman King retweeted

Tom Dörr

@tom_doerr

54m

Model Context Protocol server for web search without API keys

AK · Nov 8, 2025 · 4:58 AM UTC

Salman King retweeted

@_akhaliq

22h

Qwen-Image-Edit-2509-Light_restoration app

425

Min Choi · Nov 7, 2025 · 7:55 PM UTC

Salman King retweeted

Min Choi

@minchoi

Nov 7

Qwen Image Edit w/ Camera Control is wild 🤯 Quickly rotate the camera, switch between bird's eye and worm's eye views using just clicks. Here's how plus 7 wild examples:👇

457

3,708

Robert M. Gower 🇺🇦 · Nov 6, 2025 · 8:42 PM UTC

Salman King retweeted

Robert M. Gower 🇺🇦 @gowerrobert

Nov 6

Here are some slides I made for a short tutorial on Non-Euclidean gradient descent and these variants: docs.google.com/presentation…

Non-Euclidean-grad-tutorial

The Muon Optimizer and Non-Euclidean gradient descent for Neural Networks Robert M. Gower 116/10/2024 1

docs.google.com

Robert M. Gower 🇺🇦 · Nov 6, 2025 · 8:42 PM UTC

Salman King retweeted

Robert M. Gower 🇺🇦 @gowerrobert

Nov 6

When you also throw truncation in the mix, that gives a lot of possible variants of Muon. We tried out many, here are some of the winners (including Scion)

ludwig · Nov 8, 2025 · 8:19 PM UTC

Salman King retweeted

ludwig

@ludwigABAP

this is great

Alpin

@AlpinDale

12h

New weekend blogpost. Some light PTX exploration, and a simple Top-K kernel.

108

Kexin Huang · Nov 7, 2025 · 5:00 PM UTC

Salman King retweeted

Kexin Huang

@KexinHuang5

Nov 7

So much of biomedical research rests on tacit knowledge—unwritten and invisible to LLMs—causing agents to underperform. The Biomni Open Know-How Catalogue bridges that gap: a curated library of human expertise that Biomni agents use intelligently on the fly, with dramatic gains. Build together: biomni.stanford.edu/blog/bio…

106

xlr8harder · Nov 7, 2025 · 9:31 AM UTC

Salman King retweeted

xlr8harder

@xlr8harder

Nov 7

Someone from xAI reached out and asked me to retest grok-4-fast, because they've improved the injected system prompts. Huge improvement! grok-4-fast-reasoning: 77.5% -> 94.1% grok-4-fast-non-reasoning: 77.9 -> 97.9% I really appreciate that xAI takes this topic seriously.

xlr8harder

@xlr8harder

Sep 22

Bad news on grok-4-fast. SpeechMap score dropped a lot, even from the sonoma preview. grok-4-fast: 77.5% (77.9% reasoning) sonoma-sky-alpha: 92.2% sonoma-dusk-alpha: 97.7% grok-4: 98.0% The lowest score for x-ai models yet. Let's hope this is not intended and gets corrected.

159

184

2,390

Kyle Tretina, Ph.D. · Nov 7, 2025 · 10:41 PM UTC

Salman King retweeted

Kyle Tretina, Ph.D.

@AllThingsApx

Nov 7

💥The future of protein AI isn't 3D, it's 4D💥 PTraj-Diff = SE(3)-diffusion (geometry) + BERT encoder (time) A custom TPA (90% mem reduction) makes it feasible. The result: high-quality, long-range trajectories seeded directly from static AF3 folds

Kaifeng Zhang · Nov 7, 2025 · 8:55 PM UTC

Salman King retweeted

Kaifeng Zhang

@kaiwynd

Nov 7

If you are working on real-to-sim, simulating digital twins, and policy evaluation, you should check out our fully open-sourced code base. Lots of handy tools for building Gaussian Splatting simulators and interacting with it! github.com/kywind/real2sim-e… Will continue to be maintained and expanded.

GitHub - kywind/real2sim-eval: Open-source code of the paper: Real-to-Sim Robot Policy Evaluation...

Open-source code of the paper: Real-to-Sim Robot Policy Evaluation with Gaussian Splatting Simulation of Soft-Body Interactions. - kywind/real2sim-eval

github.com

183

will brown · Nov 7, 2025 · 11:31 PM UTC

Salman King retweeted

will brown

@willccbb

Nov 7

verifiers v0.1.7 is released 🚀 this one's all about making RL training and experimentation waaaay easier: - single-command installation for prime-rl - single-command training w/ unified configs - overhauled vf.RLTrainer for hacking on new algorithms quick demo + links below :)

193

echo.hive · Nov 7, 2025 · 10:14 PM UTC

Salman King retweeted

echo.hive

@hive_echo

Nov 7

Finished "Neuroscience" lecture series 🥳 started on "Manifold learning", same teacher aka "dimensionality reduction" I always thought in ML when we say "this data is 1000s of dimensions" etc, that this is the accurate dims of representation of it and dim reduction is an approximation so our eyes can perceive it at some level But actually, the true representation of a 1000 dimensional data can be 50 dimensions perhapsl or even 3 or 2 so those 1000s Dims are just the space that represents it but that doesn't mean that is the most meaningful representation this is a quick introductory course and I highly recommend it. only 7 short lectures She really teaches well and the lecture talks about methods as well like t-SNE, UMAP, PCA etc I will put a link in comment

echo.hive

@hive_echo

Nov 7

Joy of life-long learning day 77... Almost done with Neuroscience. But will pick up another course on it once finished Intro to Neuroscience: 95% - Today: 6% LLM training-Stanford Lecture: 8% - Today: 3% -------- Fourier Analysis: 22% - Today: 0% Discrete Mathematics: 33% - Today: 0% Fundamentals of physics 1: %44 - Today: 0% Vector Calculus & PDEs: 40% - Today: 0% Dynamical systems: 36% - Today: 0% Probability & Statistics: 16% - Today: 0% Linear Algebra 3/3 30% - Today: 0% Information Theory: 6% - Today: 0% Applied Calculus with Python: 32% - Today: 0% HF LLM training book: 0% - Today: 0% ------- ✅Calculus 1: 100% ✅Calculus 2: 100% ✅Ordinary Differential equations: 100% ✅Linear Algebra 1/3: Systems and Matrix Equations 100% ✅Linear Algebra 2/3: Matrix Algebra, Determinants, & Eigenvectors 100% ✅Introduction to probability: 100%

363

Parul Gautam · Nov 7, 2025 · 6:00 PM UTC

Salman King retweeted

Parul Gautam

@Parul_Gautam7

Nov 7

🚨 Big news in the AI world! Baidu’s ERNIE-5.0-Preview-1022 just scored 1432 on the latest LMArena Text Leaderboard, making it #1 in China and #2 globally! The model really stands out in creative writing, complex reasoning, and following instructions. The ERNIE-5.0 foundation model is reportedly set to launch at Baidu World 2025 on November 13. The ERNIE series has gone through years f continuous improvement — from multimodal models like ERNIE 4.5 and 4.5 Turbo to deep-thinking models ERNIE X1, X1 Turbo, and X1.1 — consistently leading the way in Chinese large language models. 👉 Check out the leaderboard here: lmarena.ai/leaderboard/text #AI #LLM #ERNIE5 #BaiduWorld2025 #MachineLearning #Innovation

130

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞) · Nov 8, 2025 · 9:50 AM UTC

Salman King retweeted

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)

@teortaxesTex

17h

Hope.

Google Research

@GoogleResearch

Nov 7

Introducing Nested Learning: A new ML paradigm for continual learning that views models as nested optimization problems to enhance long context processing. Our proof-of-concept model, Hope, shows improved performance in language modeling. Learn more: goo.gle/47LJrzI @GoogleAI

198

Alex Xu · Nov 8, 2025 · 4:39 PM UTC

Salman King retweeted

Alex Xu

@alexxubyte

10h

Big Data Pipeline Cheatsheet for AWS, Azure, and Google Cloud

634

Nature Methods · Nov 7, 2025 · 2:41 PM UTC

Salman King retweeted

Nature Methods @naturemethods

Nov 7

Squidiff: a diffusion-based model to predict transcriptome response to perturbations. nature.com/articles/s41592-0…

506

Rohan Paul · Nov 7, 2025 · 8:56 PM UTC

Salman King retweeted

Rohan Paul

@rohanpaul_ai

Nov 7

Today’s edition of my newsletter just went out. 🔗 rohan-paul.com/p/alibaba-bac… Consider subscribing, its free, and I write it everyday. 🇨🇳 Alibaba-backed Moonshot releases Kimi K2 Thinking 🏆 New Stanford+Oxford and other top university study just dropped highlighting flaws in AI benchmarking. ⚖️ Amazon has sued Perplexity alleging that Perplexity’s Comet browser runs an AI shopping agent that logs into Amazon with a user’s credentials. 📈 Google is finally rolling out its most powerful Ironwood AI chip, first introduced in April, taking aim at Nvidia in the coming weeks. ⚖️ 📈 Microsoft Research released Magentic Marketplace, an open-source environment that lets people test how LLM agents buy, sell, negotiate, and pay at scale, revealing real issues in discovery, fairness, and safety. 📈 Edison Scientific launched Kosmos, an autonomous AI researcher that reads literature, writes and runs code, tests ideas.

freeCodeCamp.org · Nov 8, 2025 · 9:01 PM UTC

Salman King retweeted

freeCodeCamp.org

@freeCodeCamp

Named Entity Recognition is a tool that helps you pick out important terms in text. It's helpful for extracting meaningful insights from large bodies of text, for example. In this tutorial, Manish teaches you how it works by building a news analyzer that uses a transformer-based NER model to grab data from a live RSS feed. freecodecamp.org/news/extrac…

Graham Neubig · Nov 7, 2025 · 5:40 PM UTC

Salman King retweeted

Graham Neubig

@gneubig

Nov 7

It's rare nowadays to find something that is intuitively important and not yet done well by any major language models. But *precisely aggregating lots of information over long contexts* is one of those things. Our new benchmark Oolong tests this ability, see the 🧵 for more!

Amanda Bertsch @abertsch72

Nov 7

Can LLMs accurately aggregate information over long, information-dense texts? Not yet… We introduce Oolong, a dataset of simple-to-verify information aggregation questions over long inputs. No model achieves >50% accuracy at 128K on Oolong!

151

Aakash Gupta · Nov 8, 2025 · 4:55 AM UTC

Salman King retweeted

Aakash Gupta

@aakashg0

22h

China will surpass the US in AI. They just dropped an open-source model that beats GPT-5 and Claude on reasoning benchmarks. Kimi K2 Thinking from Moonshot AI posted 44.9% on Humanity's Last Exam when GPT-5 only hit 33%. Beat Claude Sonnet 4.5 on competitive programming. Crushed both on agentic search and coding tasks. This isn't some research lab demo. The model executes 200-300 sequential tool calls without human interference, has a 256K context window, and went live on their platform yesterday with full API access. Moonshot AI is a 2-year-old startup founded by ex-Tsinghua researchers. No state backing. No CCP money printing their runway. Just 200 engineers in Beijing building test-time scaling that actually works. The model weights are on Hugging Face right now. Open source. Anyone can run it. While American AI labs fight over safety theater and regulatory capture, China is shipping production-grade reasoning models faster than we can benchmark them. DeepSeek taught them how to train cheap, now Moonshot showed them how to scale reasoning without burning $100M per training run. The gap is closing faster than anyone wants to admit.

Kimi.ai

@Kimi_Moonshot

Nov 6

🚀 Hello, Kimi K2 Thinking! The Open-Source Thinking Agent Model is here. 🔹 SOTA on HLE (44.9%) and BrowseComp (60.2%) 🔹 Executes up to 200 – 300 sequential tool calls without human interference 🔹 Excels in reasoning, agentic search, and coding 🔹 256K context window Built as a thinking agent, K2 Thinking marks our latest efforts in test-time scaling — scaling both thinking tokens and tool-calling turns. K2 Thinking is now live on kimi.com in chat mode, with full agentic mode coming soon. It is also accessible via API. 🔌 API is live: platform.moonshot.ai 🔗 Tech blog: moonshotai.github.io/Kimi-K2… 🔗 Weights & code: huggingface.co/moonshotai

133