Anthropic · Oct 29, 2025 · 5:18 PM UTC

Anthropic

Anthropic

@AnthropicAI

Oct 29

New Anthropic research: Signs of introspection in LLMs. Can language models recognize their own internal thoughts? Or do they just make up plausible answers when asked about them? We found evidence for genuine—though limited—introspective capabilities in Claude.

296

811

305

4,845

Shoalstone · Oct 29, 2025 · 6:41 PM UTC

Shoalstone · Oct 29, 2025 · 6:41 PM UTC

Shoalstone

@Shoalst0ne

Oct 29

Replying to @AnthropicAI

hey do this

Shoalstone

@Shoalst0ne

Oct 28

WHY do what appear to be negative emotional loops correspond to repetition loops even in large models that should and often do have the ability to escape these loops? can someone do some research investigating if emotional support text breaks negative loops more than other text?

Oct 29, 2025 · 6:41 PM UTC