Abraham Owodunni · Nov 7, 2025 · 7:59 PM UTC

Abraham Owodunni

Abraham Owodunni @AbrahamOwos

Nov 7

I highlighted some wins and ongoing work toward building AI that reflects African realities, and closed with open research questions we’re exploring. If this aligns with your interests, I’d love to connect and collaborate. Slides: docs.google.com/presentation…

Beyond Text with AI4African Langs.pptx

1 SKAI LAB Going Beyond Text with AI for African Languages October 2025 Abraham Owodunni SKAI Lab, The Ohio State University Ex: Lead Researcher @ BioRAMP, Lanfrica Labs Wednesday, October 27th, 2025...

docs.google.com

Abraham Owodunni · Nov 7, 2025 · 7:59 PM UTC

Abraham Owodunni @AbrahamOwos

Nov 7

Last week, I gave a talk at the Center for African Studies at OSU on **Going Beyond Text with AI for African Languages**. Particularly, I shared how effective communication is multimodal and that AI systems in the space should also support this. Slides: 👇

François Chollet · Nov 5, 2025 · 7:05 PM UTC

Abraham Owodunni retweeted

François Chollet

@fchollet

Nov 5

ML research is an engineering discipline, not a philosophy seminar. You build, you test, you learn. Untested ideas are just speculation.

109

245

2,660

elie · Oct 30, 2025 · 4:13 PM UTC

Abraham Owodunni retweeted

elie

@eliebakouch

Oct 30

Training LLMs end to end is hard. Very excited to share our new blog (book?) that cover the full pipeline: pre-training, post-training and infra. 200+ pages of what worked, what didn’t, and how to make it run reliably huggingface.co/spaces/Huggin…

117

896

136

5,757

Peyman Milanfar · Oct 29, 2025 · 3:13 AM UTC

Abraham Owodunni retweeted

Peyman Milanfar

@docmilanfar

Oct 29

What's the secret to finding impactful work? Good judgment. How do you get good judgment? Experience. How do you get experience? Bad judgment

182

David Ifeoluwa Adelani 🇳🇬 · Oct 15, 2025 · 2:42 PM UTC

Abraham Owodunni retweeted

David Ifeoluwa Adelani 🇳🇬 @davlanade

Oct 15

Join my lab! I’m currently recruiting 1-2 PhD students for admission in the fall of 2026 at @Mila_Quebec mila.quebec/en/prospective-s… Are you interested in multilingual NLP + VLMs / AI Safety? I would encourage you to apply. Deadline: December 1

Supervision Requests | Mila

Each student at Mila is supervised by one of our affiliated professors. Applicants are selected through the supervision request process.

mila.quebec

Mila - Institut québécois d'IA

@Mila_Quebec

Oct 15

Mila's annual supervision request process is now open to receive MSc and PhD applications for Fall 2026 admission! For more information, visit mila.quebec/en/prospective-s…

101

264

Abraham Owodunni · Oct 10, 2025 · 7:24 PM UTC

Abraham Owodunni @AbrahamOwos

Oct 10

Come check out our poster on Multilingual Continual Learning at #COLM's MELT workshop today!!!

Taco Cohen · Oct 8, 2025 · 4:32 PM UTC

Abraham Owodunni retweeted

Taco Cohen

@TacoCohen

Oct 8

🚨 Attention aspiring PhD students 🚨 Meta / FAIR is looking for candidates for a joint academic/industry PhD! Keywords: AI for Math & Code. LLMs, RL, formal and informal reasoning. You will be co-advised by prof. @Amaury_Hayat from ecole des ponts and yours truly. You'll have the opportunity to collaborate with the excellent FAIR codegen team and fellow FAIR PhD students in Paris, have access to state of the art pre- and post-training infra, and significant amounts of compute. A joint industry / academic PhD gives you the best of both worlds: academic freedom, open science & open source, a ton of compute, and talented colleagues working as a team. Ideal candidates should have strong engineering & experimentation skills, strong math skills, and a solid understanding of ML & RL foundations. We want to move fast so apply ASAP at the link below!

121

896

Google for Health · Sep 24, 2025 · 10:22 PM UTC

Abraham Owodunni retweeted

Google for Health @GoogleForHealth

Sep 24

Introducing AfriMed-QA – the first large-scale pan-African dataset designed to help evaluate & develop optimized and effective LLMs for African healthcare.

Abraham Owodunni · Sep 24, 2025 · 11:37 PM UTC

Abraham Owodunni @AbrahamOwos

Sep 24

Do check out our work in collaboration with Google research!!!

Google Research

@GoogleResearch

Sep 24

Ensuring generalization of LLMs in response to distribution shifts is especially important for medical and health-related models. Here we describe AfriMed-QA, an open-source benchmark question–answer dataset sourced from countries across Africa. More at goo.gle/4mRQCfv

Abraham Owodunni · Sep 16, 2025 · 9:23 PM UTC

Abraham Owodunni @AbrahamOwos

Sep 16

I'm taking a computer vision class this semester, should be fun!

Abraham Owodunni · Aug 23, 2025 · 3:50 PM UTC

Abraham Owodunni @AbrahamOwos

Aug 23

Completed the first chapter of my PhD! I'm eagerly looking forward to the many beautiful chapters ahead 💪🏽

Patrick Queiroz Da Silva · Aug 19, 2025 · 2:46 PM UTC

Abraham Owodunni retweeted

Patrick Queiroz Da Silva @patrickqdasilva

Aug 19

🚨 Participants wanted! 🚨 💬 We're looking for feedback on our new multi-domain research proposal evaluator. Be first to test it using your own research ideas! 🎁 ~$25/hour task, repeatable 4 times (total $100) 📄 You must have 1+ published papers 👉 Sign up below!

IleriOluwaKìíye♡ · Aug 15, 2025 · 1:44 PM UTC

Abraham Owodunni retweeted

IleriOluwaKìíye♡ @llerioluwakiiye

Aug 15

I am really excited to share that my first research paper, under @ml_collective too, has been accepted into a workshop of #MICCAI, the world's largest medical imaging AI conference. Notably, it has also been selected for an oral presentation during the conference!

371

Rohan Paul · Jul 19, 2025 · 8:30 AM UTC

Abraham Owodunni retweeted

Rohan Paul

@rohanpaul_ai

Jul 19

This paper introduces FlexiTokens, a language model that learns its own boundaries and shifts them during finetuning. Subword tokenizers break when text looks different, so models waste compute on endless tiny pieces. FlexiTokens works at byte level, runs a lightweight transformer that marks possible split points, then pools bytes into variable segments before the usual layers. Instead of forcing a fixed compression ratio, the authors add a hinge style loss that only cares if a sequence gets too many splits, leaving extra freedom in the other direction. During adaptation the loss lets the boundary predictor loosen or tighten, so medical notes, Turkish verbs, or code get the chunk sizes they deserve. Across 6 languages and 7 tasks the model cuts token counts by up to 2x and still lifts accuracy by about 10%. A 1B parameter version even beats a larger static BPE setup while staying faster because input gets shorter. The same model handles unseen Urdu script without retraining a tokenizer, showing the approach is truly language agnostic. ---- Paper – arxiv. org/abs/2507.12720 Paper Title: "FLEXITOKENS: Flexible Tokenization for Evolving Language Models"

Zach Mueller · Aug 10, 2025 · 12:17 PM UTC

Abraham Owodunni retweeted

Zach Mueller

@TheZachMueller

Aug 10

In order to celebrate the release of the print version for the Ultra-Scale Playbook (of which I have no affiliation with and love deeply), I'm going to be giving away 5 copies! To enter, simply like + retweet this tweet. Winners will be selected at random 10AM EST on the 13th

155

314

Àrẹ̀mú Adéọlá Jr. · Aug 4, 2025 · 10:32 AM UTC

Abraham Owodunni retweeted

Àrẹ̀mú Adéọlá Jr. @aremuadeolajr

Aug 4

This talk will explore the current progress and persistent challenges in developing Natural Language Processing (NLP) technologies for African languages. With over 2,000 languages spoken across the continent, most remain underrepresented in the AI ecosystem due to limited data,

Abraham Owodunni · Aug 2, 2025 · 1:52 PM UTC

Abraham Owodunni @AbrahamOwos

Aug 2

This!!!

Ramah Kizito

@RamahKizito

Aug 1

His prompt can’t be the same as yours bro. We all used calculators in maths tests but we all didn’t get 100%.

Abraham Owodunni · Aug 1, 2025 · 10:12 PM UTC

Abraham Owodunni @AbrahamOwos

Aug 1

I’ll disagree with this. When transformers came out people said it is too expensive to train, but guess what, 14B models are the new “small” lens. These also inspired several efficiency works including LoRA. Likewise, newer methods for byte level token interpretation will emerge!

slm tokens @tulkenss

Aug 1

Killing tokenizers is a bad idea. You're just replacing the input features by something which is less interpretable (chunked bytes) So if you have trouble interpreting token sequences right now, I don't know what you think will happen once you switch to bytes.

Abraham Owodunni · Jul 31, 2025 · 10:36 PM UTC

Abraham Owodunni @AbrahamOwos

Jul 31

I'm super proud of the amazing work @paul_okewunmi, @FavourJhay and other community members at @ml_collective NG did on this project! You guys deserve this award!!

paul @paul_okewunmi

Jul 31

🏆 We won the best paper award at AfricaNLP!! Huge shoutout to @AbrahamOwos and the @ml_collective (Nigeria) community, this idea was first shared on our discord. Grateful to see it grow into something impactful.✨