Distinguished Engineer @nvidia; working on Tile IR Prev. Co-founder @octoml. PhD @uwcse. Attempting to write about AI @ deepdreams.substack.com/

Seattle, WA
Joined June 2011
I have been verbally sharing my perspective & insights on the AI market having worked in the space for 6 years now. Yet another person told me I should be writing more often. Stay tuned for more! open.substack.com/pub/deepdr…
2
7
Jared Roesch retweeted
I’m working on a new thing, we’re so back…
Introducing OlmoEarth 🌍, state-of-the-art AI foundation models paired with ready-to-use open infrastructure to turn Earth data into clear, up-to-date insights within hours—not years.
Jared Roesch retweeted
Our team at @nvidia is looking for summer interns to work on challenging research problems in gaming, vision, and AI! Gaming Research: nvidia.eightfold.ai/careers/… Cross-Disciplinary Vision Science: nvidia.eightfold.ai/careers/…
8
37
2
364
Jared Roesch retweeted
1.1M tokens/sec on just one rack of GB300 GPUs in our Azure fleet. An industry record made possible by our longstanding co-innovation with NVIDIA and expertise of running AI at production scale! techcommunity.microsoft.com/…
Jared Roesch retweeted
i got an idea
It’s always funny what people hone in on, only 4 of these are “top” from a CS perspective 😂
OpenAI is absolutely hunting and grabbing its employees from the top universities: 1. Stanford — 422 2. UC Berkeley — 316 3. MIT — 230 4. Carnegie Mellon — 198 5. Harvard — 144 6. UCLA — 106 7. USC — 83 8. Columbia — 82 9. NYU — 81 10. Cornell — 79
Jared Roesch retweeted
Replying to @kyliebytes
crazy to hate millenials while wishing it were the literal year of the millenial
4
1
27
Jared Roesch retweeted
📢Excited to introduce Apache TVM FFI, an open ABI and FFI for ML systems, enabling compilers, libraries, DSLs, and frameworks to naturally interop with each other. Ship one library across pytorch, jax, cupy etc and runnable across python, c++, rust tvm.apache.org/2025/10/21/tv…
3
41
7
162
Jared Roesch retweeted
🚀Excited to launch FlashInfer Bench. We believe AI has the potential to help build LLM systems . To accelerate the path, we need an open schema for critical workloads and an AI-driven virtuous circle. First-class integration with FlashInfer, SGLang and vLLM support👉
🤔 Can AI optimize the systems it runs on? 🚀 Introducing FlashInfer-Bench, a workflow that makes AI systems self-improving with agents: - Standardized signature for LLM serving kernels - Implement kernels with your preferred language - Benchmark them against real-world serving workloads - Fastest kernels get day-0 integrated into production First-class integration with FlashInfer, SGLang (@lmsysorg ), and vLLM (@vllm_project ) at launch🙌 Blog post: flashinfer.ai/2025/10/21/fla… Leaderboard: bench.flashinfer.ai/
2
21
2
100
Jared Roesch retweeted
Headed to #OpenSourceAIWeek and #PyTorchCon? Our engineer, Anish Maddipoti highlights his top 5 must attend developer meetup and coding events: 1️⃣ Infra at scale with @dstackai & Lamda Labs 2️⃣ Happy hour trivia with @DeepInfra, @vllm_project, NVIDIA 3️⃣ Hands-on fine-tune with @UnslothAI & @MistralAI 4️⃣ @GPU_MODE IRL Hackathon (Friday!) 5️⃣ ️AI Plumbers Unconference with Sanjay 🙌 We will see you there! See our live blog for coverage at Open Source AI Week: blogs.nvidia.com/blog/open-s…
Jared Roesch retweeted
📢 The Fundamental Generative AI Research (GenAIR) team at NVIDIA is looking for outstanding candidates to join us as summer 2026 interns. Apply via: nvidia.wd5.myworkdayjobs.com… Email: genair-openings@nvidia.com Group website: research.nvidia.com/labs/gen… 👇
4
48
9
337
All these takes are correct, students are missing that "using AI" is not "what it takes to build AI" the effortless vibe-coded app you built is running on a huge stack built by "useless theory" and deep engineering to make the huge software stacks, and chips that it all runs on.
This is not a particularly good take and is indicative of a fundamental misunderstanding of what a top-tier technical college education is suppose to offer. Preparing to understand modern AI as a Harvard or Stanford undergrad is not about learning "prompt engineering", vibe coding, or building Slop Domain-Specific Wrapper Agent #1000, all of which can be picked up in a few days if not hours. To the contrary, the best way for a smart 18-22 year-old to understand AI is to develop a very solid intuition for undergraduate and graduate level probability, linear algebra, and classical ML. If you actually know how foundational RL topics like Q-learning work, you are 95% of the way there, and if you can't even learn that from Harvard or Stanford then this is probably a skill issue on your end. In @boazbaraktcs's excellent ML theory seminar in 2021, I don't think I wrote more than 200 lines of code cumulatively in the entire semester yet I learned an immense amount and credit that class for sparking my interest in modern AI. A year ago I couldn't coherently tell you what a transformer was, but it doesn't matter, because when you develop proper quantitative foundations in college you can figure it out in a couple of weeks. None of this stuff is really that complicated, people just like to pretend that it is.
2
8
Jared Roesch retweeted
📣Hiring! Two opportunities 1. Research internship@MSR AI Interaction and Learning (Current PhD students) 2. Multiple positions in my lab @UTiSchool on LLM Personalization/Human-AI Alignment (Prospective PhD students) Details in thread below👇
2
43
3
245
Jared Roesch retweeted
The Spatial Intelligence Lab at NVIDIA (research.nvidia.com/labs/sil…) is looking for 2026 research interns! We do all kinds of cool work across graphics/vision, geometry, physics, & ML. Now is the time to apply & reach out! nvidia.eightfold.ai/careers/… (not limited to Canada-only)
7
28
235
Jared Roesch retweeted
I've never seen the Python community embrace any tool faster than they did with uv. uv is likely the best Python tool of the last few years. If you aren't using it yet, stop what you are doing and look into it. If you are already a user, check out the attached cheatsheet.
Jared Roesch retweeted
Sometimes we forget that NVIDIA wins because it's a software company. DGX Spark is a reminder of that. It's a CUDA dev machine that's beautiful enough and small enough to be on my desk and with enough memory to fit a truckload of params. It's not the fastest or best at anything, but it's great to develop on and transfer your final training run to a H/B200, final robotics policy to your Jetson, final inference to {nvidia/apple/amd/[favorite vendor]}.
✨ We were honored to deliver some of the very first NVIDIA DGX Sparks to AI Pioneer @ylecun and AI Researcher, @soumithchintala, from @Meta and @NYUniversity in NYC. “Every PhD student in AI should have one of these,” said Yann. We couldn’t agree more. We are anticipating great things from their visionary AI research. Learn more: nvda.ws/42EGKOM #SparkSomethingBig 💫
34
70
11
1,087
Jared Roesch retweeted
ml compiler engineers breaking their backs so ml engineers can write slop without being punished for it
Fun tensor-puzzle in the wild in the recent anthropic blog post. Can anyone do it in 1 line? anthropic.com/engineering/a-…
9
39
5
1,027
Jared Roesch retweeted
A familiar pattern seems to be playing out with agent frameworks: everyone is building opinionated end-to-end solutions that bundle everything together (design, test, deploy, observe). But if we’ve learned anything from previous platform shifts, this might be the wrong approach…
6
6
1
29
The AI hype cycle is pushing false scarcity especially in talent/ability/skill. If you rewind back a decade ago there were on the order of ~100s of people working on AI systems. Today is it at least a few orders of magnitude more people. There are so many talented people, demand is just much higher.
there was ~1 guy at openai responsible for inference CUDA kernels let’s call him Bob, people would refer to his attention kernel as “the Bob kernel” it executed probably trillions of times a day on hundreds of thousands of GPUs one singular guy
1
6
I'll be giving a talk on cuTile / Tile IR at #PyTorch2025 look forward to seeing you there!
Join NVIDIA at #PyTorch2025 📣 Discover how @PyTorch and NVIDIA accelerate research, computing, #datascience & #AI innovation. Connect live with experts, watch technical sessions, and see how we’re driving the future of AI. Register now 👉 nvda.ws/4n9Zg9H
7
1
26