Jeffrey Wang · Oct 13, 2025 · 12:58 AM UTC

Jeffrey Wang · Oct 13, 2025 · 12:58 AM UTC

Jeffrey Wang

Jeffrey Wang

@wangzjeff

Oct 13

Context is the new RAM

Oct 13, 2025 · 12:58 AM UTC

145

491

5,677

Jeffrey Wang · Oct 13, 2025 · 2:30 AM UTC

Jeffrey Wang

@wangzjeff

Oct 13

for those asking - github.com/exa-labs/exa-mcp-…

GitHub - exa-labs/exa-mcp-server: Exa MCP for web search and web crawling!

Exa MCP for web search and web crawling! Contribute to exa-labs/exa-mcp-server development by creating an account on GitHub.

github.com

171

jeremy · Oct 13, 2025 · 1:31 AM UTC

jeremy

@jerhadf

Oct 13

Replying to @wangzjeff

impressively context-efficient MCP server!

Jeffrey Wang · Oct 13, 2025 · 2:22 AM UTC

Jeffrey Wang

@wangzjeff

Oct 13

PTSD!

Hui Kang Tong · Oct 13, 2025 · 1:09 AM UTC

Hui Kang Tong @ExampleTestcase

Oct 13

Replying to @wangzjeff

Thank you for keeping the number of MCP tokens low

Jeffrey Wang · Oct 13, 2025 · 2:21 AM UTC

Jeffrey Wang

@wangzjeff

Oct 13

i'm scarred from the gpt-3 days

Adam · Oct 13, 2025 · 2:12 AM UTC

Adam @asynchronope

Oct 13

Replying to @wangzjeff

Any good link exa mcp docs ser?

Jeffrey Wang · Oct 13, 2025 · 2:23 AM UTC

Jeffrey Wang

@wangzjeff

Oct 13

docs.exa.ai/reference/exa-mc…

Exa MCP - Exa

docs.exa.ai

HyperAGI.ai · Oct 14, 2025 · 1:24 AM UTC

HyperAGI.ai

@HyperAGIAI

Oct 14

Replying to @wangzjeff

Aspect-oriented context weaves new intelligence

Thariq · Oct 13, 2025 · 1:06 AM UTC

Thariq

@trq212

Oct 13

Replying to @wangzjeff

close your windows

GraphAI · Oct 14, 2025 · 10:16 AM UTC

GraphAI

@GraphAIOfficial

Oct 14

Replying to @wangzjeff

Context becomes RAM when reasoning becomes real-time.

Oscar Le · Oct 13, 2025 · 5:37 AM UTC

Oscar Le

@oscarle_x

Oct 13

Replying to @wangzjeff

Yeah, that level of details is very helpful. @OpenAIDevs should have that in Codex CLI too.

The Canaanite · Oct 13, 2025 · 2:46 AM UTC

The Canaanite

@mysticaltech

Oct 13

Replying to @wangzjeff

Literally 🎯

sid · Oct 13, 2025 · 1:27 AM UTC

sid

@siddhantpa1iwal

Oct 13

Replying to @wangzjeff

yessir

Francesco Piccoli · Oct 13, 2025 · 12:58 PM UTC

Francesco Piccoli

@francescpicc

Oct 13

Replying to @wangzjeff

Cool viz @giorgio_dmrch

atharva · Oct 13, 2025 · 4:40 AM UTC

atharva @k7agar

Oct 13

Replying to @wangzjeff

beauty

ELLIE X · Oct 13, 2025 · 2:25 AM UTC

ELLIE X @heyellieday

Oct 13

Replying to @wangzjeff

@jicapal need this

Riccardo Spagni · Oct 13, 2025 · 3:19 AM UTC

Riccardo Spagni

@fluffypony

Oct 13

Replying to @wangzjeff

I mean Sonnet 4.5 has a 1m context window so you’re probably fine🙃

Dominic Elm · Oct 13, 2025 · 11:03 AM UTC

Dominic Elm

@elmd_

Oct 13

Replying to @wangzjeff

Except that it's not uniform as opposed to RAM 😇 LLMs don't uniformly process context and it's U-shaped, plus perf degrades the more high similarity or even ambiguous needles are present.

Paul Bohm · Oct 13, 2025 · 1:40 AM UTC

Paul Bohm

@paulbohm

Oct 13

Replying to @wangzjeff

Did you install defrag.exe yet? it can defragment your context and create extra space

kluster.ai · Oct 14, 2025 · 12:52 PM UTC

kluster.ai

@klusterai

Oct 14

Replying to @wangzjeff

💯 This is so true. Providing that context for every code review costs thousands in reasoning tokens and manual bug checks. Kluster.ai automates this in real-time, within your IDE. We handle the context so you save the tokens. 😉 Try it free: kluster.ai

kluster.ai | Real-Time AI code review in your IDE

Automatically review AI-generated code for intent, security, and bugs directly in your IDE. Ship faster, cut review cycles, catch issues before production.

kluster.ai

Louie Bacaj · Oct 13, 2025 · 3:08 AM UTC

Louie Bacaj

@LBacaj

Oct 13

Replying to @wangzjeff

Some might argue it’s always been.

Saïd Aitmbarek · Oct 13, 2025 · 8:39 AM UTC

Saïd Aitmbarek

@SaidAitmbarek

Oct 13

Replying to @wangzjeff

so bullish on ctx engineering, on exa as well!

Andrew Civil Rights Pro Se, Rulnick · Oct 13, 2025 · 10:07 PM UTC

Andrew Civil Rights Pro Se, Rulnick

@MickeySteamboat

Oct 13

Replying to @wangzjeff

GIF

MCPay · Oct 13, 2025 · 10:57 AM UTC

MCPay

@mcpaytech

Oct 13

Replying to @wangzjeff

Great way to visualize it

Nikunj Kothari · Oct 13, 2025 · 3:18 PM UTC

Nikunj Kothari

@nikunj

Oct 13

Replying to @wangzjeff

Defragging context is the new defragging RAM

Samuel Ekpe · Oct 13, 2025 · 1:05 AM UTC

Samuel Ekpe

@samuelekpe

Oct 13

Replying to @wangzjeff

Or registers?

Max Petrusenko · Oct 13, 2025 · 4:05 AM UTC

Max Petrusenko

@petrusenko_max

Oct 13

Replying to @wangzjeff

You’re genius

Pete Sena · Oct 14, 2025 · 2:06 PM UTC

Pete Sena

@petesena

Oct 14

Replying to @wangzjeff

So what's the new solid storage hard drive for long term memory 😉 Asking cause rag ain't cutting it for me

Tomoko fan · Oct 13, 2025 · 6:50 PM UTC

Tomoko fan @Baris1151

Oct 13

Replying to @wangzjeff

Quick question: You posted a few weeks ago about how the h1b changes wouldn't affect American citizens chances of being hired at your company. Was that because you knew your HR department is a bunch of cat ladies who do nothing but mash the "reject without Email" button all day?

103

arata · Oct 13, 2025 · 8:50 AM UTC

arata @aratahikaru0

Oct 13

Replying to @wangzjeff

w/ memory (ai memory, not ram) and all the optimization tricks (e.g. sleep-time compute, compression) and the issues they brings (like context rot), context feels more like an hdd than ram to me.

Emrah · Oct 13, 2025 · 7:36 AM UTC

Emrah

@emrahdma

Oct 13

Replying to @wangzjeff

The need for a context window and that what goes and stays in has to be managed are the most obvious evidence that LLMs are not actually intelligent

Pranav Mulgund · Oct 13, 2025 · 3:58 AM UTC

Pranav Mulgund

@pranavmulgund

Oct 13

Replying to @wangzjeff

needs swap

Roberto Rivera 🇵🇷 · Oct 13, 2025 · 2:10 AM UTC

Roberto Rivera 🇵🇷

@tobieapb

Oct 13

Replying to @wangzjeff

I wish we had more control about the shape of that context. ESPECIALLY after a "summarization event". That process is rough and feels like the model (regardless of origin lab) was lobotomized. A knowledge graph, some post its, or even a sort of parallel/separate mini exchange about what is important and what should persist. Not sure if an onion, or a tree is the better paradigm, but AI RAM is in short supply.

James Emerson Vo · Oct 13, 2025 · 5:49 AM UTC

James Emerson Vo

@V_like_flan

Oct 13

Replying to @wangzjeff

Benchmarking models at mongodb hackathon yesterday was fun. I filled ups each model from 0 to 10 M token then stress tested them. Very fun

Michael Ten 🌨🎶🫐🍀 · Oct 13, 2025 · 3:08 AM UTC

Michael Ten 🌨🎶🫐🍀

@iMichaelTen

Oct 13

Replying to @wangzjeff

how does one visualize context usage like that?

Andrey · Oct 13, 2025 · 7:16 AM UTC

Andrey

@drakonhg

Oct 13

Replying to @wangzjeff

great reference to the new trends