Assistant professor of computer science @TechnionLive; visiting scholar @KempnerInst 2025-2026.

Joined April 2012
BlackboxNLP will be co-located with #EMNLP2025 in Suzhou this November! 📷This edition will feature a new shared task on circuits/causal variable localization in LMs, details: blackboxnlp.github.io/2025/t… If you're into mech interp and care about evaluation, please submit!
Yonatan Belinkov retweeted
Last day of @emnlpmeeting presenting two more posters with @Itay_itzhak_ and @AdiSimhi
3
42
Yonatan Belinkov retweeted
Excited to announce this year's best paper award: 🏆 "Language Dominance in Multilingual Large Language Models" by Nadav Shani and Ali Basirat 🏆 This paper challenges a common conception that multilingual models perform computation via a dominant language. Congratulations!
5
19
Go work with @sarahwiegreffe , you won’t regret it
I am recruiting 2 PhD students to work on LM interpretability at UMD @umdcs starting in fall 2026! We are #3 in AI and #4 in NLP research on @CSrankings. Come join us in our lovely building just a few miles from Washington, D.C. Details in 🧵
1
21
Yonatan Belinkov retweeted
Next up: Circuit-Tracer: A New Library for Finding Feature Circuits presented by @michaelwhanna
1
4
20
Yonatan Belinkov retweeted
Starting soon! See you in rooms A102-103
3
15
Yonatan Belinkov retweeted
Presenting a poster today at #BlackboxNLP? You are welcome to present in either poster session, or both! See the full program on our website: blackboxnlp.github.io/2025/
2
5
Yonatan Belinkov retweeted
Follow @BlackboxNLP for the live tweeting of the event!
Word cloud for this year's submissions! Excited to see so many interesting topics, and the growing interest in reasoning
4
11
Yonatan Belinkov retweeted
Nadav Shani is giving the first oral presentation of the day: Language Dominance in Multilingual Large Language Models
1
4
12
Yonatan Belinkov retweeted
Amazing work @mtutek @fatemehc__ @anmarasovic and @boknilev 👏🏼👏🏼👏🏼
Very honored to be one out of seven outstanding papers at this years' EMNLP :) Huge thanks to my amazing collaborators @fatemehc__ @anmarasovic @boknilev, this would not have been possible without them!
4
16
Yonatan Belinkov retweeted
Very honored to be one out of seven outstanding papers at this years' EMNLP :) Huge thanks to my amazing collaborators @fatemehc__ @anmarasovic @boknilev, this would not have been possible without them!
8
13
3
110
Yonatan Belinkov retweeted
Excited to announce our oral presentations 📢 Join us on Sunday (November 9th) at room A102-103!
1
6
18
Yonatan Belinkov retweeted
Had great visits at @MITCS, @KempnerInst , and @BrownUniversity (+ @celtics)! Now heading to #EMNLP to present 3 papers: 1️⃣ "Trust Me, I'm Wrong" 2️⃣ Our part in a new multilingual PIQA-style dataset 3️⃣ Our shared-task submission at @BlackboxNLP! Come say hi and let’s chat!
6
49
Yonatan Belinkov retweeted
Going to @emnlpmeeting!!✈️ On November 6th, @Itay_Itzhak_, @FazlBarez, and I will present our work "Trust Me, I'm Wrong: LLMs Hallucinate with Certainty Despite Knowing the Answer" at a poster session findings 2 at 12:30. w\ @GabiStanovsky, and @boknilev. arxiv.org/abs/2502.12964
Yonatan Belinkov retweeted
On my way to @emnlpmeeting 🇨🇳✈️ Happy to chat about hallucinations and model safety 🤖
5
45
Yonatan Belinkov retweeted
On my way to @emnlpmeeting! If you’re into interpretability and/or vision–language models, let’s chat 🤩
4
11
43
On the one hand, oh dear. On the other hand, thank you for being open about this @percyliang and the team. Who knows how frequent that kind of leak is.
Replying to @percyliang
Digging into this, we realized to our horror that we had actually trained on the GSM8K test set. 😱 Due some comedy of errors, we had trained on a version with the wrong formatting, so that made our GSM8K numbers exceptionally bad instead of exceptionally good.🤔
64
Yonatan Belinkov retweeted
LLMs can hallucinate due to different reasons: ❌They don't know (lack of knowledge) ❌ They "know" but are uncertain ❌They "know" and are certain New Extended version of our paper that combines our understanding of hallucination on the knowledge and certainty axis is out🧵
3
11
36
Q: which of these can be checked by an LLM as well as an overly loaded human reviewer? Appropriateness Formatting Length Anonymity Limitations Responsible Checklist Potential Violation Justification Need Ethics Review Ethics Review Justification aclrollingreview.org/reviewe…
1
1
11
Yonatan Belinkov retweeted
.@Cornell is recruiting for multiple postdoctoral positions in AI as part of two programs: Empire AI Fellows and Foundational AI Fellows. Positions are available in NYC and Ithaca. Deadline for full consideration is Nov 20, 2025! academicjobsonline.org/ajo/j…
2
37
6
113