Taha Yassine 🍉 · Nov 6, 2025 · 10:05 PM UTC

Taha Yassine 🍉

Taha Yassine 🍉

@taha_yssne

Nov 6

Is this the first OSS model that does o3-style parallel trajectory generation and aggregation?

Kimi.ai

@Kimi_Moonshot

Nov 6

🚀 Hello, Kimi K2 Thinking! The Open-Source Thinking Agent Model is here. 🔹 SOTA on HLE (44.9%) and BrowseComp (60.2%) 🔹 Executes up to 200 – 300 sequential tool calls without human interference 🔹 Excels in reasoning, agentic search, and coding 🔹 256K context window Built as a thinking agent, K2 Thinking marks our latest efforts in test-time scaling — scaling both thinking tokens and tool-calling turns. K2 Thinking is now live on kimi.com in chat mode, with full agentic mode coming soon. It is also accessible via API. 🔌 API is live: platform.moonshot.ai 🔗 Tech blog: moonshotai.github.io/Kimi-K2… 🔗 Weights & code: huggingface.co/moonshotai

Taha Yassine 🍉 · Nov 6, 2025 · 4:08 PM UTC

Taha Yassine 🍉

@taha_yssne

Nov 6

forget agents, you can just pipe llm calls @remilouf

Taha Yassine 🍉 · Nov 5, 2025 · 6:23 PM UTC

Taha Yassine 🍉

@taha_yssne

Nov 5

I'll be at @dotaiconf tomorrow. DM me you wanna meet up!

Taha Yassine 🍉 · Nov 3, 2025 · 10:43 PM UTC

Taha Yassine 🍉

@taha_yssne

Nov 3

The lazy imports PEP got accepted today!

Taha Yassine 🍉

@taha_yssne

Oct 9

Gentlemen I need your full attention. Python is introducing lazy imports. I repeat. Python is introducing lazy imports. inb4 the flood of `treewide: adopt lazy imports` +123,244 PRs

Taha Yassine 🍉 · Oct 29, 2025 · 5:20 PM UTC

Taha Yassine 🍉

@taha_yssne

Oct 29

Cursor moving into tier 4 a few days after the Hotz diss is hilarious Bullish!

Cursor

@cursor_ai

Oct 29

Introducing Cursor 2.0. Our first coding model and the best way to code with agents.

Taha Yassine 🍉 · Oct 25, 2025 · 8:42 AM UTC

Taha Yassine 🍉

@taha_yssne

Oct 25

Interesting argument in favor of keeping the KL term in GRPO. Ig it makes sense when fine-tuning on top of an already strong reasoning baseline.

Taha Yassine 🍉 · Oct 22, 2025 · 8:08 PM UTC

Taha Yassine 🍉

@taha_yssne

Oct 22

These guys know how to benchmark. They included models released just yesterday.

Ai2

@allen_ai

Oct 22

We’re updating olmOCR, our model for turning PDFs & scans into clean text with support for tables, equations, handwriting, & more. olmOCR 2 uses synthetic data + unit tests as verifiable rewards to reach state-of-the-art performance on challenging documents. 🧵

Taha Yassine 🍉 · Oct 22, 2025 · 6:08 PM UTC

Taha Yassine 🍉

@taha_yssne

Oct 22

Listening to @karpathy's take on mode collapse > They [LLMs] have a collapsed data distribution. One easy way to see it is to go to ChatGPT and ask it, "Tell me a joke." It only has like three jokes. Curious what he thinks about this work

Weiyan Shi

@shi_weiyan

Oct 15

New paper: You can make ChatGPT 2x as creative with one sentence. Ever notice how LLMs all sound the same? They know 100+ jokes but only ever tell one. Every blog intro: "In today's digital landscape..." We figured out why – and how to unlock the rest 🔓 Copy-paste prompt: 🧵

Taha Yassine 🍉 · Oct 22, 2025 · 10:35 AM UTC

Taha Yassine 🍉

@taha_yssne

Oct 22

NVIDIA quietly adding a new GPU to their lineup

185

Taha Yassine 🍉 · Oct 16, 2025 · 2:32 PM UTC

Taha Yassine 🍉

@taha_yssne

Oct 16

"You’re absolutely right! And if you’d like to stay right and anonymous, try NordVPN. Use code AMPFREE for up to 77% off plans."

Quinn Slack

@sqs

Oct 15

We made Amp Free. It's powered by great tokens and tasteful ads. Agentic coding is now free for everyone.

Taha Yassine 🍉 · Oct 16, 2025 · 1:28 PM UTC

Taha Yassine 🍉

@taha_yssne

Oct 16

Nvidia DGX Spark: CUDA AMD Strix Halo: CUDA from wish

AMDGPU @AMDGPU_

Oct 16

AMD Strix Halo: DenseAI compute(BF16) - 110tops 128GB 256GB/s memory 4TB Nvme 10GbE LAN x86 64MB cache $2,300 Nvidia DGX Spark: Dense AI compute(BF16) 125tops 128GB 273GB/s memory 4TB Nvme 10GbE LAN ARM 24MB cache $4,000

Taha Yassine 🍉 · Oct 16, 2025 · 6:17 AM UTC

Taha Yassine 🍉

@taha_yssne

Oct 16

In my experience, this works well when peer programming with AI as well. When I ask it to implement a feature or solve an issue, I always ask for different options to choose from, and oftentimes I end up not picking the 1st suggestion.

Weiyan Shi

@shi_weiyan

Oct 15

Taha Yassine 🍉 · Oct 16, 2025 · 6:07 AM UTC

Taha Yassine 🍉

@taha_yssne

Oct 16

Are we gonna start seeing DGX Spark instances on @runpod_io, @PrimeIntellect and the likes? 👀

Taha Yassine 🍉 · Oct 15, 2025 · 3:41 PM UTC

Taha Yassine 🍉

@taha_yssne

Oct 15

Also @soumithchintala seems to share the same opinion

Soumith Chintala

@soumithchintala

Oct 15

Sometimes we forget that NVIDIA wins because it's a software company. DGX Spark is a reminder of that. It's a CUDA dev machine that's beautiful enough and small enough to be on my desk and with enough memory to fit a truckload of params. It's not the fastest or best at anything, but it's great to develop on and transfer your final training run to a H/B200, final robotics policy to your Jetson, final inference to {nvidia/apple/amd/[favorite vendor]}.

Taha Yassine 🍉 · Oct 15, 2025 · 3:41 PM UTC

Taha Yassine 🍉

@taha_yssne

Oct 15

I'm now convinced that the DGX Spark is more meant to be a devkit for B200s than anything, so it doesn't make sense to compare it to Mac Studios or Ryzen AI Max+ 395s.

LMSYS Org

@lmsysorg

Oct 14

🚀 SGLang In-Depth Review of the NVIDIA DGX Spark is LIVE! Thanks to @NVIDIA’s early access program, SGLang makes its first ever appearance in a consumer product, the brand-new DGX Spark. The DGX Spark’s 128GB Unified Memory and Blackwell architecture set a new standard for local AI prototyping and edge computing. We're thrilled to bring these cutting-edge performance insights and software support to the developer community. Our review dives into how to efficiently deploy and accelerate large models like Llama 3.1 70B, GPT-OSS using SGLang's EAGLE3 speculative decoding and @Ollama on this beautiful piece of engineering. 👇 Unboxing video and tech blog in the thread #SGLang #NVIDIA #SparkSomethingBig #Blackwell #DGXSpark #AIInference #LLMServing

Taha Yassine 🍉 · Oct 10, 2025 · 4:42 AM UTC

Taha Yassine 🍉

@taha_yssne

Oct 10

It's only a matter of time before @pewdiepie discovers @home_assistant and sees the light

Taha Yassine 🍉 · Oct 9, 2025 · 1:02 AM UTC

Taha Yassine 🍉

@taha_yssne

Oct 9

peps.python.org/pep-0810/

PEP 810 – Explicit lazy imports | peps.python.org

This PEP introduces syntax for lazy imports as an explicit language feature:

peps.python.org

Taha Yassine 🍉 · Oct 9, 2025 · 1:02 AM UTC

Taha Yassine 🍉

@taha_yssne

Oct 9

Gentlemen I need your full attention. Python is introducing lazy imports. I repeat. Python is introducing lazy imports. inb4 the flood of `treewide: adopt lazy imports` +123,244 PRs

Lucas Beyer (bl16)

@giffmana

Sep 13

So it turns out people who care about performance already turn all Python imports into local ones under the hood: both Hudson River Trading, and Meta's production python auto lazy import:

1,161

🍉 Abubakar Abid · Oct 7, 2025 · 9:37 PM UTC

Taha Yassine 🍉 retweeted

🍉 Abubakar Abid

@abidlabs

Oct 7

I've created a new MIT-NSFG license: "No Software for Genocide" I'm going to be using this in my personal projects to make sure no genocidal army or organization is able to benefit from the code that I write Feel free to use it for your own software as well.