AI Research & Engineering @GroqInc you can just ~ship things!

Joined July 2025
Shaunak Joshi retweeted
I met someone this week who shared that in the 90s Bain paid employees to make slides for customer presentations. This was pre-PowerPoint so the job was literally using an exacto knife and ruler to cut and splice together materials for overhead transparencies. As soon as laptops, PowerPoint and digital projectors became widespread this job went away. Somehow the consulting industry is 10x the # of employees that it was then 🤣 What Aaron is describing is how almost every technology adoption cycle has played out. No evidence to suggest AI will happen differently
In 5 years from now, probably 95% of the tokens used by AI agents will be used on tasks that humans never did before. I just met with about 30 enterprises across 2 days and a dinner, and some of the most interesting use-cases that keep coming up for AI agents are on bringing automated work to areas that the companies would not have been able to apply labor to before. Most of the world hasn’t quite caught on to this point yet. We imagine AI as dropping into today’s workflows and just taking what we already do and making it more efficient by 20% or something. Yet most companies realize that most of the time they’re doing far less than they could because of the cost or limited capacity of talent. This shows up in different ways across every industry. In real estate it’s ideas like being able to read and analyze every lease agreement for every trend and business opportunity possible. In life sciences it’s being able to rapidly do drug discovery or improve quality by looking through errors in data. In financial services it’s being able to look through all past deals and figure out better future monetization. In legal it’s being able to execute on contracts or legal work for previously unprofitable segments or projects. And these are just the Box AI use cases that deal with documents and content. The same is going to be true in coding, where companies tackle software projects they wouldn’t have done before. Security of all systems and events they couldn’t get to. And so on. If you are working on AI Agents right now, the big opportunity is to bring enterprises “work” for problems that they couldn’t do before because it was nearly impossible to afford or scale. And if you’re deploying AI agents in an enterprise, consider what things you’d do more of (or differently) if the cost and speed of labor became 100X cheaper and faster. This is going to get you the real upside of automation.
41
81
14
1,452
The fact that 3IATLAS performed a non-gravitational maneuver while behind the Sun (where we couldn’t observe it) is fascinating. JPL engineers only detected it afterward through trajectory analysis. 🤔👽
🚨: According to a NASA engineer 3I/ATLAS performed a non gravitational maneuver while passing behind the Sun Davide Farnocchia, a navigation engineer at NASA’s Jet Propulsion Laboratory in Pasadena, provided the first evidence of the object’s unexplained motion. Data revealed that 3IATLAS experienced a radial acceleration of 135 kilometers per day squared away from the Sun along with a transverse acceleration of 60 kilometers per day squared.
1
Shaunak Joshi retweeted
What is @OpenAI’s Responses API, and should you use it instead of Chat Completions? 🤔 TL;DW: → Built for agents (& remote MCPs 🤫) → Better streaming control → Better structured + multimodal outputs The Responses API is available now on @GroqInc in beta.
Shaunak Joshi retweeted
MCP, but make it instant.
Shaunak Joshi retweeted
And you can tap into this MCP server via @GroqInc Cloud too!
Here’s a great use case for the Box MCP server. You can now have any documentation, product specs, or any other best practice documents inside Box accessible to coding agents. We used to keep knowledge either in our heads or trapped within the silos of individual pieces of software. In both cases, we underutilized the information that we generate in an enterprise. The power of AI is that we can tap into this knowledge from anywhere, at any time; and it grows in value the more of it we have, which historically was the opposite of what happened.
2
14
Heard Block’s CTO say code quality is overrated. Not quite, human-readable quality is overrated. Machine-verifiable quality is all that matters businessinsider.com/block-ct… Code reviews should be: ✅Automated functional tests ✅Automated performance benchmarks ❌Humans debating code style If the tests pass and AI can maintain it, merge it (no human required).
1
Shaunak Joshi retweeted
The guy who makes these, DJ, is legendary.
Vampires. Zombies. Cloud bills. Only one still haunts devs.
3
1
1
19
Shaunak Joshi retweeted
MiniMax-M2 just dropped - 230B MoE with 10B active; built for coding, agents, & tool use; MIT license🔥 > #1 open-source model on Artificial Analysis benchmarks, #5 overall > Excels at multi-file edits, test-repair loops, and BrowseComp tasks > Fast, cheap, deployable - runs like a 10B, thinks like a 200B > Works with transformers, vLLM and SGLang 🤗 huggingface.co/MiniMaxAI/Min…
Finally joined the club and upgraded to Groq orange @sundeep @geeksplainer I just couldn't resist it @RickLamers
6
Shaunak Joshi retweeted
🧠 How can we equip LLMs with memory that allows them to continually learn new things? In our new paper with @AIatMeta, we show how sparsely finetuning memory layers enables targeted updates for continual learning, w/ minimal interference with existing knowledge. While full finetuning and LoRA see drastic drops in held-out task performance (📉-89% FT, -71% LoRA on fact learning tasks), memory layers learn the same amount with far less forgetting (-11%). 🧵:
53
300
40
1,900
Shaunak Joshi retweeted
Important for the industry. Amazing to see openbench by @AarushSah_ and @GroqInc team drive the evals behind this work, so everyone can run the exact same evals open source from a single source of truth. This is our process internally for driving and prioritizing quality, great to see it reflected in our inclusion as an exacto provider. Quality delivers results.
Introducing `:exacto`, Precision Tool-Calling Endpoints OpenRouter now offers a curated subset of providers on top open source models with measurably higher tool calling accuracy delivering more reliable tool use.
3
8
30
Shaunak Joshi retweeted
One of these models is faster ⚡ and doesn’t have you buying a 2-month-old ticket to Budapest. ✈️
Shaunak Joshi retweeted
This is an awesome use-case for evals. customers should always get the highest quality implementations of models, and data is the best way to differentiate. glad that openbench was useful here 🫡🫡
Introducing `:exacto`, Precision Tool-Calling Endpoints OpenRouter now offers a curated subset of providers on top open source models with measurably higher tool calling accuracy delivering more reliable tool use.
2
4
42
The AI whale 🐳
Deepseek released a new top trending open-source OCR model and paper and crossed 100,000 AI builder followers on @huggingface (from less than 5,000 beginning of the year)!
1
3
Shaunak Joshi retweeted
The first fantastic paper on scaling RL with LLMs just dropped. I strongly recommend taking a look and will be sharing more thoughts on the blog soon. The Art of Scaling Reinforcement Learning Compute for LLMs Khatri & Madaan et al.
20
196
13
1,258
Remember when we said ‘code should be self-documenting’? AI called our bluff by reading it and explaining it back to us in the form of generated PR descriptions
1
Best way to remove cognitive load off of your users and reduce model choice fatigue!
The main breakthrough of GPT-5 was to route your messages between a couple of different models to give you the best, cheapest & fastest answer possible. This is cool but imagine if you could do this not only for a couple of models but hundreds of them, big and small, fast and slow, in any language or specialized for any task - all at inference time. This is what we're introducing with HuggingChat Omni, powered by over 100 open-source models including gpt-oss, deepseek, qwen, kimi, smolLM, gemma, aya and many more already! And this is just the beginning as there are over 2 millions open models not only for text but image, audio, video, biology, chemistry, time-series and more on @huggingface!
Shaunak Joshi retweeted
Here's how you can add web search to @Kimi_Moonshot Kimi K2 on @GroqInc with just a few lines of code! This uses @ExaAILabs remote MCP server, which gives Kimi search superpowers. All with one Responses API Call!
Shaunak Joshi retweeted
PSA: Our intern @DylanMitic (the people's prince) screwed up again and reduced prices for GPT OSS models on @GroqInc, but we're rolling with it. 20-40% price reductions + 50% off cached tokens with prompt caching for 20B (120B coming soon). Fast and even more affordable. 🫡
intern accidentally priced @openai GPT OSS at 1/100 of anthropic ‼️ bad intern but what's done is done. now go build ¯\_(ツ)_/¯ 120B - $0.15/$0.75 20B - $0.10/$0.50