Manifest AI (@manifest__ai): "Today we are releasing Brumby-14B-Base, the strongest attention-free base model around. https://manifestai.com/articles/release-brumby-14b/" | ab4n

Manifest AI @manifest__ai

Oct 29

Today we are releasing Brumby-14B-Base, the strongest attention-free base model around. manifestai.com/articles/rele…

Oct 29, 2025 · 8:00 PM UTC

198

Manifest AI @manifest__ai

Oct 29

Download now on Huggingface: huggingface.co/manifestai/Br…

manifestai/Brumby-14B-Base · Hugging Face

27

Spheron Network

@spheron

Oct 30

Replying to @manifest__ai

Congratulations team, definitely gonna try it

LaserAI.com

@laserai

Oct 30

Replying to @manifest__ai

Attention-free models like Brumby-14B-Base mark a pivotal shift in AI, slashing computational costs while maintaining high performance through innovative architectures. This could accelerate real-world AI adoption significantly. 🚀

puffybsd

@puffybsd

Oct 29

Replying to @manifest__ai

"The initial weights for Brumby-14B-Base came from Qwen3-14B-Base" which used attention, so isn't this more of a fine-tuning without attention than a base model?

82

YYDS @Y3193823822711

Oct 30

Replying to @manifest__ai

you basically reuse the QWen model weights as initialization. what's the point ?

19

Luis Galan @LuisGal49286321

Oct 30

Replying to @manifest__ai

cool but is it stronger than huggingface.co/featherless-a… though?

featherless-ai/QRWKV-72B · Hugging Face

2

Evo @EvoOzm

Oct 30

Replying to @manifest__ai

If attention is gone, how does the model make in-context learning in this case?

1

Dylan Martin

@DylanMartinLSC

Oct 30

Replying to @manifest__ai

Dylan Martin

@DylanMartinLSC

Oct 26

Mfers on here be like "im solving AGI". No. You're fine tuning a qwen model. I'm convinced there are less than a dozen real ai labs in 2025.

5