Qwen (@Alibaba_Qwen): "🚀 Introducing Qwen3-4B-Instruct-2507 & Qwen3-4B-Thinking-2507 — smarter, sharper, and 256K-ready! 🔹 Instruct: Boosted general skills, multilingual coverage, and long-context instruction following. 🔹 Thinking: Advanced reasoning in logic, math, science & code — built for expert-level tasks. Both models are more aligned, more capable, and more context-aware. Huggingface： https://huggingface.co/Qwen/Qwen3-4B-Instruct-2507 https://huggingface.co/Qwen/Qwen3-4B-Thinking-2507 ModelScope: https://modelscope.cn/models/Qwen/Qwen3-4B-Instruct-2507 https://modelscope.cn/models/Qwen/Qwen3-4B-Thinking-2507" | ab4n

Qwen

@Alibaba_Qwen

Aug 6

🚀 Introducing Qwen3-4B-Instruct-2507 & Qwen3-4B-Thinking-2507 — smarter, sharper, and 256K-ready! 🔹 Instruct: Boosted general skills, multilingual coverage, and long-context instruction following. 🔹 Thinking: Advanced reasoning in logic, math, science & code — built for expert-level tasks. Both models are more aligned, more capable, and more context-aware. Huggingface： huggingface.co/Qwen/Qwen3-4B… huggingface.co/Qwen/Qwen3-4B… ModelScope: modelscope.cn/models/Qwen/Qw… modelscope.cn/models/Qwen/Qw…

Aug 6, 2025 · 4:16 PM UTC

3,161

AshutoshShrivastava

@ai_for_success

Aug 6

Replying to @Alibaba_Qwen

Qwen knows one thing.. open-source the hell out of eveything 😂

138

Shubham Saboo

@Saboo_Shubham_

Aug 6

Replying to @Alibaba_Qwen

Qwen now seems to be the Opensource leader.

54

caio temer

@canalCCore2

Aug 6

Replying to @Alibaba_Qwen

I think this 4b model is going to be better than the 20B OSS.

28

Capx AI

@0xCapx

Aug 6

Replying to @Alibaba_Qwen

busy weeks for the Qwen team

11

Fakhr

@iamfakhrealam

Aug 6

Replying to @Alibaba_Qwen

Qwen is on 🔥 with open source supremacy

2

Zhihu Frontier

@ZhihuFrontier

Aug 6

Replying to @Alibaba_Qwen

Wow real good job!

2

Oscar Le

@oscarle_x

Aug 6

Replying to @Alibaba_Qwen

Could you compare it with Gemma 4B please? Perhaps it is the biggest competitor in this size range

1

Karim Jedda

@KarimJDDA

Aug 6

Replying to @Alibaba_Qwen

I didn't even finish downloading the previous one 🥹 thank you!

1

Brad

@brad_agi

Aug 7

Replying to @Alibaba_Qwen

which subset of GPQA is this?

Robert Youssef

@rryssf_

Aug 6

Replying to @Alibaba_Qwen

looks interesting, but always gotta ask how they stack up in real-world scenarios. benchmarks are one thing, but can they actually tackle tough problems?

Himanshu Kumar

@codewithimanshu

Aug 7

Replying to @Alibaba_Qwen

Impressive advancements, yet true intelligence isn't just about scale but the nuanced understanding of context and intent.

Ray

@RAYK69420

Aug 6

Replying to @Alibaba_Qwen

The devil works hard, but the Qwen team works harder

84

Petri Kuittinen

@KuittinenPetri

Aug 6

Replying to @Alibaba_Qwen

Please give us Qwen3-8B-Instruct-2507! That's the best size. With 4-bit quantization it is possible to run 8B model on Nvidia GPU with just 8 GB VRAM (typical gamer GPU!) or on a cheap $500 laptop with 16 GB total RAM. And please keep it as free of guardrails as the original!

81

Duane

@DuaneAdam

Aug 6

Replying to @Alibaba_Qwen

OpenAI: Here is our open-weight model. Qwen:

77

Buttas

@buttas0

Aug 6

Replying to @Alibaba_Qwen

Finally people can stop talking about OpenAI OSS

42

Jason

@foley2k2

Aug 6

Replying to @Alibaba_Qwen

Would topical expert models make sense, like one that specializes in Python and a different one that specializes in sales and psychology? I think this could get you 4b or 7b models that perform like bigger models in their specialization.

17

Mᜋtt

@MMatt14

Aug 6

Replying to @Alibaba_Qwen

Yesterday, Silicon Valley cooked Today, China cooked

15

Anes Valentic

@Matrix_Memories

Aug 6

Replying to @Alibaba_Qwen

Wow, really good benchmarks for a SLM. Will be insanely useful for Agents and Workflows. As always, an amazing job by the @Alibaba_Qwen team. Thanks guys, you rock!

8

Tendies Of Wisdom

@TendiesOfWisdom

Aug 6

Replying to @Alibaba_Qwen

Tendies Of Wisdom

@TendiesOfWisdom

Aug 6

The new Qwen3-4B-Thinking-2507 model (Q8 quant) on my 8GB laptop GPU is the first small local thinking model to solve my logic game puzzle! Congratulations to @Alibaba_Qwen team!

7

Patel Meet @MeetPatelTech

Aug 6

Replying to @Alibaba_Qwen

qwen is dropping everyday 🔥🔥

7

Avais Aziz

@avaisaziz

Aug 6

Replying to @Alibaba_Qwen

Fantastic work by the Qwen team! The release of Qwen3-4B-Instruct-2507 and Qwen3-4B-Thinking-2507, with their impressive benchmarks and open-source access, is a game-changer. Excited to see how these models will push the boundaries of AI innovation!

6

Vaibhav saha

@vsaha_twt

Aug 6

Replying to @Alibaba_Qwen

@grok explain this post in Neanderthal language

6

Apollo

@ApollonVisual

Aug 6

Replying to @Alibaba_Qwen

How many models does qwen series 3 has at this point! 2025 has been a monumental year for Alibaba

3

Web3Aible

@Web3Aible

Aug 6

Replying to @Alibaba_Qwen

How to convert this Qwen/Qwen3-4B-Thinking-2507 to .task format to run in Google Edge Gallery? It's high time this option is provided.

3

Rubik's AI

@RubiksAI

Aug 6

Replying to @Alibaba_Qwen

Nice work!

3

Mariusz Kurman

@mkurman88

Aug 6

Replying to @Alibaba_Qwen

😮 you’re unstoppable 👏👏

3

SinAI @GVoisco39549

Aug 6

Replying to @Alibaba_Qwen

@UnslothAI waiting for your fine-tuning notebooks. This is going to be 🔥🔥🔥

3

Aren Derdzyan @DerdzyanAren

Aug 6

Replying to @Alibaba_Qwen

Is it available in qwen chat?

3

Joel @CuriousJoe_L

Aug 6

Replying to @Alibaba_Qwen

How are you doing this qwen? Can you apply this to the new openai model? 😭

3

ｖｅｇａ

@vega_holdings

Aug 6

Replying to @Alibaba_Qwen

thanks bros

2

Troubling Mind

@trobuling

Aug 6

Replying to @Alibaba_Qwen

Expert systems have always reasoned better than neural nets, and much more cheaply. No reason not to use generative LLM as a front end to an expert system. Best of both worlds.

2

MMOStars @MMOStars

Aug 6

Replying to @Alibaba_Qwen

2

Elaina @Elaina43114880

Aug 6

Replying to @Alibaba_Qwen

demo?

2

Shaun Prince

@Suparious

Aug 8

Replying to @Alibaba_Qwen

Really hope they drop the 8B model next.

1

Keeves

@bmookie50

Aug 6

Replying to @Alibaba_Qwen

Would love a Coder variant

1

Mr Rowe

@Indy_triguy

Aug 6

Replying to @Alibaba_Qwen

Curious, why? Where is a 4b param being used ? What is the use case in general? Robot and edge devices can run larger models right? Even an Rk1 can run larger. Just want to know.

1