Mohammad Pezeshki · Nov 7, 2025 · 12:45 AM UTC

Mohammad Pezeshki · Nov 7, 2025 · 12:45 AM UTC

Mohammad Pezeshki

Mohammad Pezeshki @mpezeshki91

Nov 7

We show a phase transition for optimal data curation: For strong models, concentrating on difficult samples drives further improvement (LIMO). In contrast, weaker models benefit from the conventional "More is More" where broad data exposure is essential to learn core capabilities

Elvis Dohmatob @dohmatobelvis

Nov 6

1/n "Less is More" (s1, etc.) vs "More is More", which mantra is correct for the training/fine-tuning large LLMs? In our recent preprint, we reconcile both of these. They correspond to different parts of a complex phase diagram

Nov 7, 2025 · 12:45 AM UTC