Inception · Nov 6, 2025 · 5:15 PM UTC

Inception

Pinned Tweet

Inception

@_inception_ai

Nov 6

Mercury is refreshed – with across-the-board improvements in coding, instruction following, math, and knowledge recall. Start building responsive, in-the-flow AI solutions! Read more: inceptionlabs.ai/blog/mercur…

131

Elon Musk · Nov 7, 2025 · 11:47 AM UTC

Inception retweeted

Elon Musk

@elonmusk

Nov 7

Replying to @StefanoErmon @_inception_ai

Diffusion will obviously work on any bitstream. With text, since humans read from first word to last, there is just the question of whether the delay to first sentence for diffusion is worth it. That said, the vast majority of AI workload will be video understanding and generation, so good chance diffusion is the biggest winner overall. Also means that the ratio of compute to memory bandwidth will increase.

127

198

2,174

Samar Khanna · Nov 6, 2025 · 10:23 PM UTC

Inception retweeted

Samar Khanna @samar_a_khanna

Nov 6

Mercury is now much better at agentic tasks! You can try out the blazing speed of our dLLMs on your coding agents 😎 Here's a little zombie shooter game I spun up using Mercury with @goose_oss . Go ahead and diffuse w goose @_inception_ai

Inception · Nov 6, 2025 · 5:24 PM UTC

Inception

@_inception_ai

Nov 6

Mercury runs five times faster than Claude 4.5 Haiku at less than one-fourth the price, while maintaining higher quality.

Inception · Nov 6, 2025 · 2:01 PM UTC

Inception

@_inception_ai

Nov 6

Today’s LLMs are painfully slow and expensive. They are autoregressive and spit out words sequentially. One. At. A. Time. Our dLLMs generate text in parallel, delivering answers up to 10X faster. Now we’ve raised $50M to scale them. Full story from @russellbrandom in @TechCrunch. techcrunch.com/2025/11/06/in…

Inception raises $50 million to build diffusion models for code and text | TechCrunch

Diffusion models already power AI image generators, but Inception thinks they can be even more powerful applied in software development.

techcrunch.com

409

Inception · Oct 28, 2025 · 6:08 PM UTC

Inception

@_inception_ai

Oct 28

🚀We've partnered with ProxyAI! Our Mercury Coder dLLM is now the default for ProxyAI's autocomplete, next edit, and auto apply tooling, providing developers with lightning-fast and accurate code edits. Read more: tryproxy.io/blog/proxyai-inc… #AI #DiffusionModels #dLLM

ProxyAI - Blog

tryproxy.io

Andrej Karpathy · Oct 20, 2025 · 6:58 PM UTC

Inception retweeted

Andrej Karpathy

@karpathy

Oct 20

Nice, short post illustrating how simple text (discrete) diffusion can be. Diffusion (i.e. parallel, iterated denoising, top) is the pervasive generative paradigm in image/video, but autoregression (i.e. go left to right bottom) is the dominant paradigm in text. For audio I've seen a bit of both. A lot of diffusion papers look a bit dense but if you strip the mathematical formalism, you end up with simple baseline algorithms, e.g. something a lot closer to flow matching in continuous, or something like this in discrete. It's your vanilla transformer but with bi-directional attention, where you iteratively re-sample and re-mask all tokens in your "tokens canvas" based on a noise schedule until you get the final sample at the last step. (Bi-directional attention is a lot more powerful, and you get a lot stronger autoregressive language models if you train with it, unfortunately it makes training a lot more expensive because now you can't parallelize across sequence dim). So autoregression is doing an `.append(token)` to the tokens canvas while only attending backwards, while diffusion is refreshing the entire token canvas with a `.setitem(idx, token)` while attending bidirectionally. Human thought naively feels a bit more like autoregression but it's hard to say that there aren't more diffusion-like components in some latent space of thought. It feels quite possible that you can further interpolate between them, or generalize them further. And it's a component of the LLM stack that still feels a bit fungible. Now I must resist the urge to side quest into training nanochat with diffusion.

Nathan Barry

@nathanbarrydev

Oct 20

BERT is just a Single Text Diffusion Step! (1/n) When I first read about language diffusion models, I was surprised to find that their training objective was just a generalization of masked language modeling (MLM), something we’ve been doing since BERT from 2018. The first thought I had was, “can we finetune a BERT-like model to do text generation?”

269

576

5,217

GIF

Inception · Oct 10, 2025 · 9:28 PM UTC

Inception

@_inception_ai

Oct 10

Our CEO @StefanoErmon joined the Infinite Curiosity Podcast and shared how our Mercury diffusion LLMs deliver faster, cheaper models and why diffusion is reshaping coding, reasoning, and multimodal AI. Thanks for having him on @PrateekVJoshi! piped.video/watch?v=BaZT4aQI…

Diffusion LLMs - The Fastest LLMs Ever Built | Stefano Ermon,...

Stefano Ermon is the cofounder of Inception Labs and an associate professor at Stanford. Inception is developing a new type of AI models called Diffusion LLM...

youtube.com

Inception · Oct 7, 2025 · 3:14 PM UTC

Inception

@_inception_ai

Oct 7

We’re in! We are now part of the #AWSGenAIAccelerator2025. We’re looking forward to working with @AWSstartups to help us deliver ultra-fast and efficient diffusion large language models.

Inception · Oct 1, 2025 · 10:33 PM UTC

Inception

@_inception_ai

Oct 1

Honored that our co-founder @adityagrover_ has been named to the 2025 Mayfield | Divot AI List! Thank you to @MayfieldFund and @StartupGrind for the recognition alongside 50 innovators shaping the future of AI. See the full list here: divot.org/list

Mayfield | Divot AI List — Divot: Make Your Mark

The AI List will spotlight change makers and rising stars: startups, builders, researchers, industry leaders, media voices, policymakers, and more.

divot.org

Inception · Sep 25, 2025 · 11:59 PM UTC

Inception

@_inception_ai

Sep 25

Replying to @_inception_ai @continuedev

For more info, check out our blog: inceptionlabs.ai/blog/ultra-…

Ultra-Fast Apply-Edit with Mercury Coder - Inception

We are leveraging diffusion technology to develop a new generation of LLMs. Our dLLMs are much faster and more efficient than traditional auto-regressive LLMs. And diffusion models are more accurate,...

inceptionlabs.ai

Inception · Sep 25, 2025 · 11:57 PM UTC

Inception

@_inception_ai

Sep 25

You can use Mercury Coder’s Apply-Edit functionality immediately through the @continuedev extension on VS Code. Apply-Edit is also available through the Inception Platform: platform.inceptionlabs.ai #dLLM #InceptionAI #ApplyEdit

Inception · Sep 25, 2025 · 11:57 PM UTC

Inception

@_inception_ai

Sep 25

Apply-Edit entails integrating suggested code changes into existing code. With its diffusion-based, parallel generation process, Mercury Coder strictly dominates both frontier LLMs and specialized Apply-Edit models in speed, quality, and cost.

Inception · Sep 25, 2025 · 11:57 PM UTC

Inception

@_inception_ai

Sep 25

Mercury Coder now supports Apply-Edit capabilities, providing quality on par with GPT-5 at speeds 46x faster!

GIF

Inception · Sep 8, 2025 · 7:08 PM UTC

Inception

@_inception_ai

Sep 8

Our cofounder @adityagrover_ is one of MIT Technology Review's 35 Innovators Under 35, which recognizes the top young innovators around the world!!

MIT Technology Review

@techreview

Sep 8

Introducing: This year's list of 35 Innovators Under 35. Every year, MIT Technology Review recognizes 35 extraordinary young people brimming with ideas for how to crack tough problems—all of whom are under the age of 35. Get to know them all: trib.al/5cVFFbL

Inception · Aug 30, 2025 · 11:14 PM UTC

Inception

@_inception_ai

Aug 30

🚀Our CEO @StefanoErmon recently spoke on @latentspacepod about Mercury - our family of game-changing diffusion LLMs! 📊 The numbers speak for themselves: 1000+ tokens/second 5-10x faster than speed-optimized models #1 for speed & quality on Copilot Arena Parallel token prediction is the future. Check out the interview👇 piped.video/watch?v=2fDBeMu6… Try it: platform.inceptionlabs.ai | chat.inceptionlabs.ai #AI #dLLM #Mercury

⚡️Mercury: Ultra-Fast Diffusion LLMs — Estefano Ermon, CEO Inception...

https://arxiv.org/abs/2506.17298We present Mercury, a new generation of commercial-scale large language models (LLMs) based on diffusion. These models are pa...

youtube.com

Continue · Aug 20, 2025 · 4:00 PM UTC

Inception retweeted

Continue

@continuedev

Aug 20

Code editing just got a lot faster.⚡Introducing Next Edit⚡ Next Edit delivers real-time, multi-line suggestions at up to 1100 tokens/sec, powered by @inceptionAILabs Mercury Coder, the world’s first commercial-scale diffusion LLM for code.

126

GIF

Inception · Aug 18, 2025 · 5:52 PM UTC

Inception

@_inception_ai

Aug 18

Read more here: inceptionlabs.ai/inception-a… Here's our AWS listing: aws.amazon.com/marketplace/s…

Inception · Aug 18, 2025 · 5:52 PM UTC

Inception

@_inception_ai

Aug 18

4. Batch Processing: Use Bedrock’s batch processing APIs to get a 10x throughput improvement for large-scale tasks.

Inception · Aug 18, 2025 · 5:52 PM UTC

Inception

@_inception_ai

Aug 18

3. Unified API: Access Mercury models alongside a wide array of other Bedrock foundation models through a single, consistent API, simplifying development and integration.