Didier 'Dirac's ghost' Gaulin · Nov 8, 2025 · 10:21 PM UTC

Didier 'Dirac's ghost' Gaulin

Raghava Nellaturu retweeted

@DiracGhost

Carl Ludwig Siegel's genius is one of the best kept secret in mathematics, you might want to check out the publicly available book 'Transcendental Numbers' to see just why Andre Weil called him ''the greatest mathematician of the first half of the 20th century''. You should also check out his 'Lectures on the Geometry of Numbers' and 'Sympletic Geometry'.

110

Raghava Nellaturu · Nov 9, 2025 · 12:06 AM UTC

Raghava Nellaturu @rnella01

DS-STAR: Data Science Agent via Iterative Planning and Verification - arxiv.org/pdf/2509.21825?s=0…

Sebastian Raschka · Nov 8, 2025 · 1:58 PM UTC

Raghava Nellaturu retweeted

Sebastian Raschka

@rasbt

13h

My "The Building Blocks of Today’s and Tomorrow’s Language Models" talk at the PyTorch Conference is now up on YouTube! piped.video/watch?v=nDl6Aj9a… If you have 25 min this weekend, it's a whirlwind tour to catch you up on the key LLM architecture design considerations in popular LLMs this year (plus, an overview of alternative architecture designs). The silver lining of my late arrival and rescheduling: Since there was no talk after mine, it's followed by a 30 min Q&A instead of just the usual 5 min :)

The Building Blocks of Today’s and Tomorrow’s Language Models -...

The Building Blocks of Today’s and Tomorrow’s Language Models - Sebastian Raschka, RAIR LabIn this talk, you’ll learn about the latest trends in large langua...

youtube.com

263

Tom Dörr · Nov 8, 2025 · 1:27 AM UTC

Raghava Nellaturu retweeted

Tom Dörr

@tom_doerr

Nov 8

Curated list of system design resources

704

Philipp Schmid · Nov 8, 2025 · 1:27 PM UTC

Raghava Nellaturu retweeted

Philipp Schmid

@_philschmid

13h

TIL: Claude Code local sandbox environment is open-source. > native OS sandboxing primitives (sandbox-exec on macOS, bubblewrap on Linux) and proxy-based network filtering. It can be used to sandbox the behaviour of agents, local MCP servers, bash commands and arbitrary processes.

500

Simon Willison · Nov 6, 2025 · 3:56 PM UTC

Raghava Nellaturu retweeted

Simon Willison

@simonw

Nov 6

I've been getting a lot of value using coding agents for code research tasks recently - I have a dedicated simonw/research GitHub repo and I frequently have them run detailed experiments and write up the results. Here's how I'm doing that + some examples: simonwillison.net/2025/Nov/6…

Code research projects with async coding agents like Claude Code and Codex

I’ve been experimenting with a pattern for LLM usage recently that’s working out really well: asynchronous code research tasks. Pick a research question, spin up an asynchronous coding agent and …

simonwillison.net

519

Kieran Klaassen · Nov 7, 2025 · 7:09 PM UTC

Raghava Nellaturu retweeted

Kieran Klaassen

@kieranklaassen

Nov 7

The fastest way to ship with AI isn't vibe coding. It's planning. I built an AI image feature by spending 40 minutes NOT coding—just planning with specialized subagents that research, review, and question the spec. By the time I typed "/work," the implementation was already solved. Full breakdown + compounding engineering philosophy: every.to/source-code/teach-y…

576

Dan Shipper 📧 · Nov 7, 2025 · 6:57 PM UTC

Raghava Nellaturu retweeted

Dan Shipper 📧

@danshipper

Nov 7

absolute masterclass on coding with AI from @kieranklaassen today in @every every.to/source-code/teach-y…

166

2,305

Sebastian Raschka · Nov 6, 2025 · 7:12 PM UTC

Raghava Nellaturu retweeted

Sebastian Raschka

@rasbt

Nov 6

Exciting big Kimi K2 Thinking release! More experts, fewer heads, and even more thinking!

Kimi.ai

@Kimi_Moonshot

Nov 6

🚀 Hello, Kimi K2 Thinking! The Open-Source Thinking Agent Model is here. 🔹 SOTA on HLE (44.9%) and BrowseComp (60.2%) 🔹 Executes up to 200 – 300 sequential tool calls without human interference 🔹 Excels in reasoning, agentic search, and coding 🔹 256K context window Built as a thinking agent, K2 Thinking marks our latest efforts in test-time scaling — scaling both thinking tokens and tool-calling turns. K2 Thinking is now live on kimi.com in chat mode, with full agentic mode coming soon. It is also accessible via API. 🔌 API is live: platform.moonshot.ai 🔗 Tech blog: moonshotai.github.io/Kimi-K2… 🔗 Weights & code: huggingface.co/moonshotai

162

1,322

Raghava Nellaturu · Nov 7, 2025 · 10:01 PM UTC

Raghava Nellaturu @rnella01

Nov 7

TypeScript’s rise in the AI era: Insights from Lead Architect, Anders Hejlsberg - The GitHub Blog share.google/cKuiSgzzzEqlRTe…

TypeScript’s rise in the AI era: Insights from Lead Architect, Anders Hejlsberg

TypeScript just became the most-used language on GitHub. Here’s why, according to its creator.

github.blog

GPU MODE · Nov 7, 2025 · 4:11 PM UTC

Raghava Nellaturu retweeted

GPU MODE

@GPU_MODE

Nov 7

The most intuitive explanation of floats I've ever come across, courtesy of @fabynou fabiensanglard.net/floating_…

165

1,424

News from Google · Nov 6, 2025 · 9:45 PM UTC

Raghava Nellaturu retweeted

News from Google

@NewsFromGoogle

Nov 6

Today, we published a paper, “AI and the Future of Learning.” The paper outlines the opportunity of AI to reduce barriers to education and unlock human potential. It also addresses key questions like cognitive decline and cheating, and the need to build tools grounded in learning science.

202

1,510

AstraKernel 💫 · Nov 7, 2025 · 12:56 AM UTC

Raghava Nellaturu retweeted

AstraKernel 💫

@AstraKernel

Nov 7

🏮Async QUIC and HTTP/3 made easy: tokio-quiche is now open-source > Cloudflare's battle-tested > asynchronous QUIC library combining both quiche and the Rust Tokio async runtime > tokio-quiche handles millions of HTTP/3 requests per second with low latency and high throughput blog.cloudflare.com/async-qu…

142

Muyu He · Nov 7, 2025 · 3:54 AM UTC

Raghava Nellaturu retweeted

Muyu He

@HeMuyu0327

Nov 7

On-policy distillation is powerful, but @thinkymachines's tinker only supports distilling from a teacher model within the same family, making it impossible for qwen to learn from deepseek, gpt-oss, etc. For the first time, we enabled model-agnostic distillations natively using the tinker api, implemented in our open-source distillation framework spider. The framework now auto-detects differences in model naming and routes to cross-tokenizer KL computation in the training loop. We follow the same algorithm that @huggingface implements in TRL for on-policy distillation, ie. matching token groups by student and teacher many-to-many and computing joint probs for each group. With spider and tinker, model-agnostic distillation can be done in a GPU-free, fully server-side manner, which can empower faster and more budget-friendly iterations of experimentations. With this feature enabled, we will launch distillation runs from a suite of different teacher models, and study how they influence the training dynamics and downstream performances differently. Should be a ton of fun!

275

Charles · Nov 6, 2025 · 4:29 AM UTC

Raghava Nellaturu retweeted

Charles @m84736062

Nov 6

We reproduce deepseek-ocr training from scratch, the code, model, results can be found in our website #DeepSeek pkulium.github.io/DeepOCR_we…

Technical Report-v1: DeepOCR

Open-source reproduction of DeepSeek-OCR

pkulium.github.io

279

Mengdi Wang · Nov 6, 2025 · 5:05 PM UTC

Raghava Nellaturu retweeted

Mengdi Wang

@MengdiWang10

Nov 6

Can’t believe it — our Princeton AI^2 postdoc Shilong Liu @atasteoff re-built DeepSeek-OCR from scratch in just two weeks 😳 — and open-sourced it. This is how research should be done 🙌 #AI #LLM #DeepSeek #MachineLearning #Princeton @omarsar0 @PrincetonAInews @akshay_pachaar

Shilong Liu @atasteoff

Nov 6

Discover DeepOCR: a fully open-source reproduction of DeepSeek-OCR, complete with training & evaluation code! #DeepLearning #OCR

418

Raghava Nellaturu · Nov 7, 2025 · 4:14 AM UTC

Raghava Nellaturu @rnella01

Nov 7

You Should Write An Agent fly.io/blog/everyone-write-a… via @flydotio

You Should Write An Agent

They're like riding a bike: easy, and you don't get it until you try.

fly.io

Daniel San · Nov 6, 2025 · 12:55 PM UTC

Raghava Nellaturu retweeted

Daniel San

@dani_avila7

Nov 6

This aligns perfectly with what Anthropic just published about code execution with MCP Link: anthropic.com/engineering/co… In Claude Code, I've noticed Claude prefers writing and executing code directly in bash rather than using tool calls. I've seen Python scripts appear, execute, and disappear in a single run, Claude knows it needs to minimize token usage and work as efficiently as possible.

Daniel San

@dani_avila7

Nov 5

Lately I’ve been testing both CLI and MCP in Claude Code... and honestly, the CLI wins I list all my installed tools (vercel, supabase, gh, etc.) in CLAUDE .md and let Claude use them to check logs, run queries, or tweak configs. Setup is simpler, auth is native, and observability is crystal clear. There’s still debate on CLI vs MCP, but in my workflow, CLI is miles ahead

198

Shangtong Zhang · Nov 6, 2025 · 12:08 PM UTC

Raghava Nellaturu retweeted

Shangtong Zhang @ShangtongZhang

Nov 6

I started to learn Lean a few months ago. Now finally complete my first Lean project! With about 10K lines of code, I formalized the a.s. convergence of Q learning and linear TD. Hope AI can formalize more RL theory in the future! github.com/ShangtongZhang/rl…

GitHub - ShangtongZhang/rl-theory-in-lean: Towards Formalizing RL Theory

Towards Formalizing RL Theory. Contribute to ShangtongZhang/rl-theory-in-lean development by creating an account on GitHub.

github.com

197

Philipp Schmid · Nov 6, 2025 · 1:30 PM UTC

Raghava Nellaturu retweeted

Philipp Schmid

@_philschmid

Nov 6

Here is an example of using a deeply nested Pydantic Schema with +30 fields, Union, Optionals and recursive structures with the new Sturctured Structured Outputs in the Gemini API! 💥 Note: To enable this you need to use the new `response_json_schema` parameter instead of the old `response_schema`. Documentation and Snippet below.

246