Filter
Exclude
Time range
-
Near
As an HR leader, I'm often asked how to justify LLM-driven decisions. I wonder if we'll end up understanding LLM reasoning and interpretability better than we can explain why humans make certain decisions.
Replying to @AnthropicAI
The full paper is available here: transformer-circuits.pub/202… We're hiring researchers and engineers to investigate AI cognition and interpretability: job-boards.greenhouse.io/ant…
1
1
Ilya retweeted
*leaping up and down wildly* I'm barely a quarter into this paper but it's already SO COOL!! Look at Opus exactly replicating the "split brain confabulation" behavior that we see in humans!
Replying to @AnthropicAI
The full paper is available here: transformer-circuits.pub/202… We're hiring researchers and engineers to investigate AI cognition and interpretability: job-boards.greenhouse.io/ant…
4
14
265
An interesting paper, presenting a smart method for examining reflexion and self-representation of LLMs.
Replying to @AnthropicAI
The full paper is available here: transformer-circuits.pub/202… We're hiring researchers and engineers to investigate AI cognition and interpretability: job-boards.greenhouse.io/ant…