new blog post "There Are No New Ideas In AI.... Only New Datasets" in which i summarize LLMs in exactly four breakthroughs and explain why it was really *data* all along that mattered... not algorithms
62
306
44
2,795
Replying to @jxmnop
Yes, exactly! Coincidentally, I just gave a talk about that today!

Apr 11, 2025 · 2:43 AM UTC

2
37
Replying to @rasbt @jxmnop
So the main recent additions are efficiency tweaks. Like Mixture of Experts (but that's also a few years old) and Multi-head Latent Attention. There are a few ideas like Mamba, Samba etc. but they have not been scaled yet, so it's not yet clear if they can compete with SOTA LLMs
5
Replying to @rasbt @jxmnop
Thanks Sebastian, love your work! Link for others to your full talk LLMs Then and Now: sebastianraschka.com/pdf/sli…
4