Raja Koduri · Jul 15, 2024 · 5:29 AM UTC

Raja Koduri

Pinned Tweet

Raja Koduri

@RajaXg

15 Jul 2024

x.com/i/article/181261991856…

121

764

Raja Koduri · Nov 5, 2025 · 11:10 PM UTC

Raja Koduri

@RajaXg

Nov 5

This is a tool we built for ourselves originally and can't live without it for our daily development. If you need access to heterogenous AI hardware (AMD, Apple, Intel, Nvidia, Tenstorrent...) at your fingertips for your development and creation workflows, and switch between them frequently, OxCapsule is your tool. We are giving access to our hardware cluster for select few users now. Eventually we want to release this as a tool you can install on your own edge infra. Please watch the video to see how one of our creators uses this infrastructure...and yes OxCapsule enables both compute and pixel streaming..

202

Shobu Yarlagadda · Nov 5, 2025 · 2:06 PM UTC

Raja Koduri retweeted

Shobu Yarlagadda

@Shobu_

Nov 5

Firstly, thank you for all the love and appreciation for the teaser of #BaahubaliTheEternalWar. We have successfully passed the first step in a long journey ahead of bringing this film to theatres globally in 2027. We loved the concept that @_Ishan_Shukla proposed to @ssrajamouli and me, and we put it into development immediately. While @sharma_sowmya18 and Ishan developed the story, we onboarded #ScottMosier (Dr Suess - The Grinch) to write a captivating screenplay! We roped in the knowledgeable @vinvaranasi as a mythology consultant to ensure authenticity and @devakatta and @madhankarky to write the dialogues! @FireflyVFXIndia has helped with defining the rules for the new worlds. @mmkeeravaani garu is composing music to match the scale of the story! To bring Ishan's vision to life, the team @mihiravisualabs, which we co-founded, is building workflows, tech stack and infrastructure while working with talented Indian and international artists and animation studios, just like we did for @BaahubaliMovie. Some Indian artists working on concepts include Rupali Gatti, Gibby Joseph, Priyanka Chavan, Ajay Lele, and Sanjiv Waeerkar. The international studios we are working with to achieve world-class style and animation include @Aniventure_, #Zaratan, #alcyde, and #LesAndroidsAssociés. And of course, our excellent post-production partner @AnnapurnaStdios Again, we @arkamediaworks are trying to push the boundaries of scale and ambition, and with these incredible partners and a few more partners joining us soon, we feel we are in the right hands. Your appreciation has only increased our responsibility. Exciting times ahead! piped.video/GJKsRRX3v1Q?si=epDZ…

Baahubali - The Eternal War Part 1 Teaser (Hindi) | Prabhas | Ishan...

#BAAHUBALI: THE ETERNAL WAR, a two-part animated epic directed by award-winning filmmaker Ishan Shukla.Voice Cast: Prabhas as "Amarendra Bahubali" and Ramya ...

youtube.com

810

4,108

Raja Koduri · Nov 5, 2025 · 10:03 AM UTC

Raja Koduri

@RajaXg

Nov 5

Love the style. Congratulations to Ishan and creative teams that collaborated, including the Mihira Visual Labs team lead by Abhishek and Sandeep. piped.video/RdUPs9e1bUk?si=sFDA…

Baahubali - The Eternal War Part 1 Teaser (Telugu) | Prabhas | Ishan...

#BAAHUBALI: THE ETERNAL WAR, a two-part animated epic directed by award-winning filmmaker Ishan Shukla.Voice Cast: Prabhas as "Amarendra Bahubali" and Ramya ...

youtube.com

186

Rajeev Chandrasekhar 🇮🇳 · Nov 3, 2025 · 2:48 PM UTC

Raja Koduri retweeted

Rajeev Chandrasekhar 🇮🇳

@RajeevRC_X

Nov 3

A break from politics - to speak about #AI and things related in a chat wth @RajaXg - as always enjoyed listening to young startups & talking tech

Raja Koduri · Oct 29, 2025 · 2:10 AM UTC

Raja Koduri

@RajaXg

Oct 29

10 years later..same excitement! variety.com/2025/film/review…

‘Baahubali: The Epic’ Review: S.S. Rajamouli’s Landmark Blockbuster Ascends Into Myth

‘RRR’ director S.S. Rajamouli re-edits his landmark two-parter 'Baahubali' into a single, unforgettable experience.

variety.com

Ash Vardanian · Oct 28, 2025 · 1:03 PM UTC

Raja Koduri retweeted

Ash Vardanian

@ashvardanian

Oct 28

Scaling Elections with GPUs and Mojo 🔥 I love occasionally optimizing obscure algorithms no one cares about. One of those, also having O(n³) complexity, like matrix multiplication, is the Schulze method for ranked-choice voting. So last year, at one of the AGI hackathons I tried accelerating it across CPUs and GPUs, with CUDA C++, Numba, and Mojo 🔥 It was an epic event and I really wanted to share the results sooner, but without AMD GPUs they didn't feel complete! We are talking about plurality of choice, after all! Now, after gathering AMD MI355X results - the story is out. The Mojo variant didn't require a single line change, ran out of the box, and at scale delivered 4x the throughput of H100. Don't take the article too seriously. It's just a fun exploration of modern GPU tech beyond LLMs — tropical semirings, blocked Floyd-Warshall, and the participation paradox in voting! Blogpost: ashvardanian.com/posts/scali… Code: github.com/ashvardanian/Scal… Overall it was one of the best cozy gatherings that year - @clattner_llvm talked about Mojo's GPU support, @RajaXg explained the reasons for the success of the CUDA ecosystem in the last decade, @tri_dao gave the first public talk on FlashAttention-3... now superseded by his FlashAttention-4, and @dylan522p shared some insights on the data-center market! Thanks to @bztree, @verdagon, and Pradeep Ramani for collaborating with me on this one! Thanks to @JvNixon, @khoomeik, and @kylejohnmorris for bringing the AGI house together and hosting all of us and to @nebiusai for providing the compute! Can't wait to gather again!

Scaling Elections with GPUs and Mojo 🔥

Last summer, me, Chris Lattner, and a bunch of other people across the industry gathered together for a GPU-programming hackathon at the AGI House in San Francisco. After one too many LLM optimizat...

ashvardanian.com

Raja Koduri · Oct 13, 2025 · 8:06 PM UTC

Raja Koduri

@RajaXg

Oct 13

Reusable Chiplets across vendors (for AI)

Shreya

@shreyacasmalert

Oct 12

One of the best statements I've seen in a while

Raja Koduri · Oct 11, 2025 · 3:38 PM UTC

Raja Koduri

@RajaXg

Oct 11

Been thinking of writing a longer blog post about it...specialized compute and general compute both have to deal with same memory hierarchy (registers, scratch pad, L1,L2,near DRAM, far DRAM), and both pack a ton of flops, you get better utilization of these flops if your workload can stay on the closest memory for as long as possible, and this is what primarily driving ton of improvisations....while the primary Math engine looks more or less the same in GPUs vs special AI hardware, CUDA/SIMT control plane enables ton of creativity in exploiting the memory hierarchy...and relatively doesn't cost much to keep in hardware. Rather than think specialized vs general, we should think in terms of a math, memory hierarchy and a control plane.

Awni Hannun

@awnihannun

Oct 10

On the surface you’d think that the convergence of model architecture to the Transformer would open the door for specialized hardware. But somehow it feels like general purpose hardware (GP in GPGPU) is more useful now than ever. Like back in the RNN and conv days it was relatively uncommon to need a new kernel. On the other hand specialized kernels for models are way more common now. I think it’s in part thanks to languages like Triton which make it easier. In part the hardware has gotten so fast that the overhead of implementing your SSM or attention in high level ops is too high. But also there’s just a lot of interesting research and algorithmic changes that need custom kernels. Like MoEs, low precision matmuls, variations on attention and linear state spaces models, …

161

Raja Koduri · Oct 3, 2025 · 6:36 PM UTC

Raja Koduri

@RajaXg

Oct 3

Reminded me of this quote..

Susan Zhang

@suchenzang

Oct 2

the other bitter lesson: hard work only gets you so far. own your narrative, or else someone else will own it for you.

Raja Koduri · Sep 29, 2025 · 2:05 PM UTC

Raja Koduri

@RajaXg

Sep 29

Inspiring! openfuture.tenstorrent.com/

Open Future

We are building an open future for AI. Own your silicon future. Join us.

openfuture.tenstorrent.com

Raja Koduri · Sep 15, 2025 · 7:28 PM UTC

Raja Koduri

@RajaXg

Sep 15

jax-ml.github.io/scaling-boo… Aditya wagh shared this link on LinkedIn...looks interesting

Raja Koduri · Sep 15, 2025 · 3:29 PM UTC

Raja Koduri

@RajaXg

Sep 15

Genuine question: All the breakthrough optimizations I see - KV cache, flash attention, quantization, seem to originate from CUDA/GPU land. Are TPUs innovating differently, or is my feed just GPU-biased? Would love examples of TPU-first optimization techniques that later crossed over. Drop links if you’ve got them!

879

Jukan · Sep 13, 2025 · 7:55 AM UTC

Raja Koduri retweeted

Jukan

@Jukanlosreve

Sep 13

I wrote this piece on HBF and NAND. I hope you find it helpful. Why did I turn bullish on NAND? feat. HBF open.substack.com/pub/semico…

187

Raja Koduri · Sep 12, 2025 · 2:16 AM UTC

Raja Koduri

@RajaXg

Sep 12

Raja Koduri · Sep 11, 2025 · 4:37 AM UTC

Raja Koduri

@RajaXg

Sep 11

"tiny" box with 6 Radeons arrived today 💃

189

Raja Koduri · Sep 5, 2025 · 1:04 AM UTC

Raja Koduri

@RajaXg

Sep 5

I might have picked up the last red box 😀

the tiny corp

@__tinygrad__

Sep 4

How price sensitive is this market? We're in this for the long game, we want to offer compute so cheap that no one can compete. For a limited time, we're dropping our prices. $10k for the red, $25k for the green. That's below cost for the red. Act fast!

Raja Koduri · Sep 4, 2025 · 10:08 PM UTC

Raja Koduri

@RajaXg

Sep 4

AI videos can’t compete 😀

Deepika Narayan Bhardwaj

@DeepikaBhardwaj

Sep 4

Laughter in every second of this video !!!!

Faraz Abidi · Sep 4, 2025 · 8:49 PM UTC

Raja Koduri retweeted

Faraz Abidi

@fzfromcupertino

Sep 4

i just made the best ai ad on this whole website. mostly by ripping off great movies. ocean's 11, but it's dogs robbing a casino, to promote @barkbox (dog toys and treats) Breakdown 👇

Raja Koduri · Aug 30, 2025 · 4:29 AM UTC

Raja Koduri

@RajaXg

Aug 30

"Ha! Well, I'll make sure to clean up any Codex-generated mess I find. 😄" Claude getting cheeky