Yonatan Belinkov · May 15, 2025 · 8:12 AM UTC

Yonatan Belinkov

Pinned Tweet

Yonatan Belinkov @boknilev

May 15

BlackboxNLP will be co-located with #EMNLP2025 in Suzhou this November! 📷This edition will feature a new shared task on circuits/causal variable localization in LMs, details: blackboxnlp.github.io/2025/t… If you're into mech interp and care about evaluation, please submit!

BlackboxNLP 2025

The Eight Workshop on Analyzing and Interpreting Neural Networks for NLP

blackboxnlp.github.io

Dana Arad 🛫 EMNLP25 · Nov 9, 2025 · 8:25 AM UTC

Yonatan Belinkov retweeted

Dana Arad 🛫 EMNLP25 @dana_arad4

10h

Last day of @emnlpmeeting presenting two more posters with @Itay_itzhak_ and @AdiSimhi

BlackboxNLP · Nov 9, 2025 · 10:13 AM UTC

Yonatan Belinkov retweeted

BlackboxNLP @BlackboxNLP

Excited to announce this year's best paper award: 🏆 "Language Dominance in Multilingual Large Language Models" by Nadav Shani and Ali Basirat 🏆 This paper challenges a common conception that multilingual models perform computation via a dominant language. Congratulations!

Yonatan Belinkov · Nov 9, 2025 · 3:59 AM UTC

Yonatan Belinkov @boknilev

14h

Go work with @sarahwiegreffe , you won’t regret it

Sarah Wiegreffe @sarahwiegreffe

Nov 4

I am recruiting 2 PhD students to work on LM interpretability at UMD @umdcs starting in fall 2026! We are #3 in AI and #4 in NLP research on @CSrankings. Come join us in our lovely building just a few miles from Washington, D.C. Details in 🧵

BlackboxNLP · Nov 9, 2025 · 2:16 AM UTC

Yonatan Belinkov retweeted

BlackboxNLP @BlackboxNLP

16h

Next up: Circuit-Tracer: A New Library for Finding Feature Circuits presented by @michaelwhanna

BlackboxNLP · Nov 9, 2025 · 12:58 AM UTC

Yonatan Belinkov retweeted

BlackboxNLP @BlackboxNLP

18h

Starting soon! See you in rooms A102-103

BlackboxNLP · Nov 9, 2025 · 12:56 AM UTC

Yonatan Belinkov retweeted

BlackboxNLP @BlackboxNLP

18h

Presenting a poster today at #BlackboxNLP? You are welcome to present in either poster session, or both! See the full program on our website: blackboxnlp.github.io/2025/

BlackboxNLP 2025

The Eight Workshop on Analyzing and Interpreting Neural Networks for NLP

blackboxnlp.github.io

Gabriele Sarti · Nov 9, 2025 · 2:20 AM UTC

Yonatan Belinkov retweeted

Gabriele Sarti @gsarti_

16h

Follow @BlackboxNLP for the live tweeting of the event!

BlackboxNLP @BlackboxNLP

17h

Word cloud for this year's submissions! Excited to see so many interesting topics, and the growing interest in reasoning

BlackboxNLP · Nov 9, 2025 · 2:01 AM UTC

Yonatan Belinkov retweeted

BlackboxNLP @BlackboxNLP

16h

Nadav Shani is giving the first oral presentation of the day: Language Dominance in Multilingual Large Language Models

Technion CS NLP · Nov 7, 2025 · 9:03 AM UTC

Yonatan Belinkov retweeted

Technion CS NLP @Technion_CS_NLP

Nov 7

Amazing work @mtutek @fatemehc__ @anmarasovic and @boknilev 👏🏼👏🏼👏🏼

Martin Tutek @ EMNLP @mtutek

Nov 7

Very honored to be one out of seven outstanding papers at this years' EMNLP :) Huge thanks to my amazing collaborators @fatemehc__ @anmarasovic @boknilev, this would not have been possible without them!

Martin Tutek @ EMNLP · Nov 7, 2025 · 8:57 AM UTC

Yonatan Belinkov retweeted

Martin Tutek @ EMNLP @mtutek

Nov 7

110

BlackboxNLP · Nov 5, 2025 · 3:47 AM UTC

Yonatan Belinkov retweeted

BlackboxNLP @BlackboxNLP

Nov 5

Excited to announce our oral presentations 📢 Join us on Sunday (November 9th) at room A102-103!

Itay Itzhak @ EMNLP 🇨🇳 · Nov 3, 2025 · 2:57 PM UTC

Yonatan Belinkov retweeted

Itay Itzhak @ EMNLP 🇨🇳 @Itay_itzhak_

Nov 3

Had great visits at @MITCS, @KempnerInst , and @BrownUniversity (+ @celtics)! Now heading to #EMNLP to present 3 papers: 1️⃣ "Trust Me, I'm Wrong" 2️⃣ Our part in a new multilingual PIQA-style dataset 3️⃣ Our shared-task submission at @BlackboxNLP! Come say hi and let’s chat!

Adi Simhi ✈️ EMNLP · Nov 3, 2025 · 9:58 AM UTC

Yonatan Belinkov retweeted

Adi Simhi ✈️ EMNLP @AdiSimhi

Nov 3

Going to @emnlpmeeting!!✈️ On November 6th, @Itay_Itzhak_, @FazlBarez, and I will present our work "Trust Me, I'm Wrong: LLMs Hallucinate with Certainty Despite Knowing the Answer" at a poster session findings 2 at 12:30. w\ @GabiStanovsky, and @boknilev. arxiv.org/abs/2502.12964

Adi Simhi ✈️ EMNLP · Oct 31, 2025 · 10:16 AM UTC

Yonatan Belinkov retweeted

Adi Simhi ✈️ EMNLP @AdiSimhi

Oct 31

On my way to @emnlpmeeting 🇨🇳✈️ Happy to chat about hallucinations and model safety 🤖

Dana Arad 🛫 EMNLP25 · Oct 31, 2025 · 10:15 AM UTC

Yonatan Belinkov retweeted

Dana Arad 🛫 EMNLP25 @dana_arad4

Oct 31

On my way to @emnlpmeeting! If you’re into interpretability and/or vision–language models, let’s chat 🤩

BlackboxNLP · Nov 2, 2025 · 2:46 PM UTC

Yonatan Belinkov retweeted

BlackboxNLP @BlackboxNLP

Nov 2

Only one week to go! The list of accepted papers is now up on our website: blackboxnlp.github.io/2025/a…

BlackboxNLP 2025

The Eight Workshop on Analyzing and Interpreting Neural Networks for NLP

blackboxnlp.github.io

Yonatan Belinkov · Oct 31, 2025 · 2:33 AM UTC

Yonatan Belinkov @boknilev

Oct 31

On the one hand, oh dear. On the other hand, thank you for being open about this @percyliang and the team. Who knows how frequent that kind of leak is.

Percy Liang

@percyliang

Oct 29

Replying to @percyliang

Digging into this, we realized to our horror that we had actually trained on the GSM8K test set. 😱 Due some comedy of errors, we had trained on a version with the wrong formatting, so that made our GSM8K numbers exceptionally bad instead of exceptionally good.🤔

Adi Simhi ✈️ EMNLP · Oct 30, 2025 · 12:20 PM UTC

Yonatan Belinkov retweeted

Adi Simhi ✈️ EMNLP @AdiSimhi

Oct 30

LLMs can hallucinate due to different reasons: ❌They don't know (lack of knowledge) ❌ They "know" but are uncertain ❌They "know" and are certain New Extended version of our paper that combines our understanding of hallucination on the knowledge and certainty axis is out🧵

Yonatan Belinkov · Oct 30, 2025 · 3:08 PM UTC

Yonatan Belinkov @boknilev

Oct 30

Q: which of these can be checked by an LLM as well as an overly loaded human reviewer? Appropriateness Formatting Length Anonymity Limitations Responsible Checklist Potential Violation Justification Need Ethics Review Ethics Review Justification aclrollingreview.org/reviewe…

ARR Reviewer Guidelines

A peer review platform for the Association for Computational Linguistics

aclrollingreview.org

Yoav Artzi · Oct 28, 2025 · 5:56 PM UTC

Yonatan Belinkov retweeted

Yoav Artzi

@yoavartzi

Oct 28

.@Cornell is recruiting for multiple postdoctoral positions in AI as part of two programs: Empire AI Fellows and Foundational AI Fellows. Positions are available in NYC and Ithaca. Deadline for full consideration is Nov 20, 2025! academicjobsonline.org/ajo/j…

113