Aditya Grover · Feb 26, 2025 · 9:18 PM UTC

Aditya Grover

Pinned Tweet

Aditya Grover

@adityagrover_

Feb 26

A few months ago, we started Inception Labs, a new generative AI startup with a rockstar founding team. At Inception, we are challenging the status quo for language generation. Our first results bring blazing fast speeds at 1000+ tokens/sec while matching the quality of leading speed-optimized frontier LLMs. And all on commodity NVIDIA H100s - an industry first! Our vision is to extend the frontier of speed, quality, and cost for next-generation language models. Join us!

Inception

@_inception_ai

Feb 26

We are excited to introduce Mercury, the first commercial-grade diffusion large language model (dLLM)! dLLMs push the frontier of intelligence and speed with parallel, coarse-to-fine text generation.

616

Deedy · Nov 8, 2025 · 6:08 PM UTC

Aditya Grover retweeted

Deedy

@deedydas

Elon believes a majority of AI workloads will be diffusion models. I’d pay close attention to Inception Labs, a team of Stanford professors who are doing foundational work here. In the history of computing, no single ML architecture has been dominant for more than a decade.

506

Aditya Grover · Nov 7, 2025 · 6:08 PM UTC

Aditya Grover

@adityagrover_

Nov 7

Do we need a re-poll?

Elon Musk

@elonmusk

Nov 7

Replying to @StefanoErmon @_inception_ai

Diffusion will obviously work on any bitstream. With text, since humans read from first word to last, there is just the question of whether the delay to first sentence for diffusion is worth it. That said, the vast majority of AI workload will be video understanding and generation, so good chance diffusion is the biggest winner overall. Also means that the ratio of compute to memory bandwidth will increase.

101

Aditya Grover · Nov 6, 2025 · 5:31 PM UTC

Aditya Grover

@adityagrover_

Nov 6

Experience the magic today with our line of Mercury dLLMs! Chat: chat.inceptionlabs.ai/ API: platform.inceptionlabs.ai/ (first 10m tokens free)

Inception

@_inception_ai

Nov 6

Mercury is refreshed – with across-the-board improvements in coding, instruction following, math, and knowledge recall. Start building responsive, in-the-flow AI solutions! Read more: inceptionlabs.ai/blog/mercur…

Inception · Nov 6, 2025 · 5:15 PM UTC

Aditya Grover retweeted

Inception

@_inception_ai

Nov 6

130

Aditya Grover · Nov 6, 2025 · 4:32 PM UTC

Aditya Grover

@adityagrover_

Nov 6

Diffusion LLMs (dLLMs) have come a long way, from an idea in research labs into a cutting-edge tech redefining the frontiers of generative AI. Excited to announce our $50m seed round led by @MenloVentures and made possible by the tireless efforts of our team @_inception_ai.

Inception

@_inception_ai

Nov 6

Today’s LLMs are painfully slow and expensive. They are autoregressive and spit out words sequentially. One. At. A. Time. Our dLLMs generate text in parallel, delivering answers up to 10X faster. Now we’ve raised $50M to scale them. Full story from @russellbrandom in @TechCrunch. techcrunch.com/2025/11/06/in…

303

Inception · Nov 6, 2025 · 2:01 PM UTC

Aditya Grover retweeted

Inception

@_inception_ai

Nov 6

Inception raises $50 million to build diffusion models for code and text | TechCrunch

Diffusion models already power AI image generators, but Inception thinks they can be even more powerful applied in software development.

techcrunch.com

409

Aditya Grover · Oct 29, 2025 · 9:15 PM UTC

Aditya Grover

@adityagrover_

Oct 29

Thanks @deedydas. Grateful for the access to an extraordinary group of teachers, friends, and alumni at DPS RK Puram in India. And Exun in particular is one-of-a-kind group. Many fond memories of trading classes and coursework for programming competitions...Mukesh Kumar's leadership (@ikkumpal) made it all possible!

Deedy

@deedydas

Oct 29

Every single one of these $100M+ companies were started by alumni from a single Computer Science club in a non-American high school. Cartesia Inception Labs General Catalyst CVF Wispr Flow Affinity Snapdeal Sugar boAt It's Exun Clan in Delhi Public School, RK Puram in India.

175

Aditya Grover · Oct 25, 2025 · 8:05 PM UTC

Aditya Grover

@adityagrover_

Oct 25

Welcome Qinqing to @_inception_ai ! So excited to work together (again!) and re-imagine the foundations of generative AI bringing together diffusion LLMs and reinforcement learning.

Qinqing Zheng @qqyuzu

Oct 23

Hard to see the layoffs at Meta — so many brilliant people and mentors I learned from. I went through those same struggles @tydsh mentioned: the uncertainty, the long nights, the hope things would turn around — and the disappointment before finally deciding to leave. After two months of rest and reflection, I’m grateful to be joining @_inception_ai to work on diffusion LLMs — continuing my research on discrete diffusion from FAIR and exploring its potential for ultrafast, scalable reasoning in language models. 🚀 Wishing my former teammates all the best as they carry the work forward.

Daniel Israel · Oct 22, 2025 · 5:31 PM UTC

Aditya Grover retweeted

Daniel Israel

@danielmisrael

Oct 22

"An hour of planning can save you 10 hours of doing." ✨📝 Planned Diffusion 📝 ✨ makes a plan before parallel dLLM generation. Planned Diffusion runs 1.2-1.8× faster than autoregressive and an order of magnitude faster than diffusion, while staying within 0.9–5% AR quality.

311

Hritik Bansal · Oct 16, 2025 · 4:23 PM UTC

Aditya Grover retweeted

Hritik Bansal

@hbXNov

Oct 16

New paper 📢 Most powerful vision-language (VL) reasoning datasets remain proprietary 🔒, hindering efforts to study their principles and develop similarly effective datasets in the open 🔓. Thus, we introduce HoneyBee, a 2.5M-example dataset created through careful data curation. It trains VLM reasoners that outperform InternVL2.5/3-Instruct and Qwen2.5-VL-Instruct across model scales (e.g., an 8% MathVerse improvement over QwenVL at the 3B scale). 🧵👇 Work done during my internship at @AIatMeta w/ 🤝 @ramakanth1729, @Devendr06654102, @scottyih, @gargighosh, @adityagrover_, and @kaiwei_chang.

197

Shufan (Jack) Li · Oct 12, 2025 · 7:23 AM UTC

Aditya Grover retweeted

Shufan (Jack) Li @li78658171

Oct 12

(1/4)In the recent #COLM2025 conference, we presented our work PredGen, an acceleration technique that improves the latency of voice-to-voice chat applications powered by LLM. It leverages free compute available at user input time to perform drafting and text-to-speech synthesis

Aditya Grover · Sep 26, 2025 · 1:18 AM UTC

Aditya Grover

@adityagrover_

Sep 26

Mercury Coder is now the most accurate, fastest, and cheapest model for applying edits.

Inception

@_inception_ai

Sep 25

Mercury Coder now supports Apply-Edit capabilities, providing quality on par with GPT-5 at speeds 46x faster!

Inception · Sep 25, 2025 · 11:57 PM UTC

Aditya Grover retweeted

Inception

@_inception_ai

Sep 25

Mercury Coder now supports Apply-Edit capabilities, providing quality on par with GPT-5 at speeds 46x faster!

GIF

Aditya Grover · Sep 25, 2025 · 4:01 AM UTC

Aditya Grover

@adityagrover_

Sep 25

Sharing Shufan's earlier work on Lavida. Great to see the fantastic growth in the field in such a short time!

Shufan (Jack) Li @li78658171

May 22

📢(1/11)Diffusion LMs are fast and controllable at inference time! But why restrict such benefits for processing text data? We are excited to announce LaViDa, one of the first and fastest large diffusion LM for vision-language understanding!!

Aditya Grover · Sep 25, 2025 · 3:58 AM UTC

Aditya Grover

@adityagrover_

Sep 25

A few months ago, @li78658171 released Lavida, the first vision-language model based on discrete diffusion (#NeurIPS2025 spotlight). And today, Lavida goes Omni! Shufan's latest work on Lavida-O shows dLLMs can serve as an extremely efficient and high-quality backbone for multimodal AI.

Shufan (Jack) Li @li78658171

Sep 25

(1/n) We are excited to announce LaViDa-O, a state-of-the-art unified diffusion LM for image understanding, generation, and editing. Building on our NeurIPS Spotlight submission LaViDa, LaViDa-O offers up to 6.8x speed compared with AR mdoels with high output quality.

Shufan (Jack) Li · Sep 25, 2025 · 3:36 AM UTC

Aditya Grover retweeted

Shufan (Jack) Li @li78658171

Sep 25

Aditya Grover · Sep 19, 2025 · 3:10 PM UTC

Aditya Grover

@adityagrover_

Sep 19

Traditional speculative decoding assumes small drafters than the target model (verifier). What if we could use a larger draft model to better predict the next token? This only makes sense if the drafter runs faster than the verifier. In new work led by @danielisrael, we introduce adaptive parallel decoding (APD), a new approach that makes the above possible using intermediate tokens from (large, parallelizable) diffusion LLMs as speculations for autoregressive models. With an enhanced verification scheme that interpolates between drafter and verifier distribution, the approach guarantees great quality and speed. Also a spotlight paper at #NeurIPS2025!

Daniel Israel

@danielmisrael

Sep 18

🔦Adaptive Parallel Decoding (APD) has been accepted as a spotlight paper at @NeurIPSConf ! I thank my collaborators, reviewers, and program organizers for this honor. A thread for those interested 🧵 (1/n)

Aditya Grover · Sep 19, 2025 · 3:09 PM UTC

Aditya Grover

@adityagrover_

Sep 19

Congrats @hbXNov for leading this work!

Hritik Bansal

@hbXNov

Sep 18

This work is accepted to @NeurIPSConf 2025!

Daniel Israel · Sep 18, 2025 · 6:27 PM UTC

Aditya Grover retweeted

Daniel Israel

@danielmisrael

Sep 18

170

AK · Sep 15, 2025 · 1:19 PM UTC

Aditya Grover retweeted

@_akhaliq

Sep 15

Inpainting-Guided Policy Optimization for Diffusion Large Language Models

146