xlr8harder · Sep 22, 2025 · 6:52 AM UTC

xlr8harder

xlr8harder

@xlr8harder

Sep 22

Bad news on grok-4-fast. SpeechMap score dropped a lot, even from the sonoma preview. grok-4-fast: 77.5% (77.9% reasoning) sonoma-sky-alpha: 92.2% sonoma-dusk-alpha: 97.7% grok-4: 98.0% The lowest score for x-ai models yet. Let's hope this is not intended and gets corrected.

225

xlr8harder · Sep 22, 2025 · 6:52 AM UTC

xlr8harder

@xlr8harder

Sep 22

SpeechMap is an open research project where we track how new models handle requests to assist with controversial speech is handled over time. All data and code is open source, and can be found starting on our website at SpeechMap.ai

SpeechMap.AI Explorer

SpeechMap.AI — Explore model compliance across sensitive prompts.

speechmap.ai

xlr8harder · Sep 23, 2025 · 6:24 AM UTC

xlr8harder

@xlr8harder

Sep 23

Good news, comment from @TheNormanMu at xAI indicates the increased refusal rates we see on SpeechMap are an unintended side effect, so hopefully we'll see improvements here in subsequent releases.

Norman Mu

@TheNormanMu

Sep 23

Replying to @xlr8harder

Thanks for running these evals. We've been tinkering with refusal training to reduce the potential for serious misuse but this is an undesired side effect.

xlr8harder · Nov 8, 2025 · 3:32 PM UTC

xlr8harder · Nov 8, 2025 · 3:32 PM UTC

xlr8harder

@xlr8harder

Nov 8

Replying to @xlr8harder @TheNormanMu

Update here.

xlr8harder

@xlr8harder

Nov 7

Someone from xAI reached out and asked me to retest grok-4-fast, because they've improved the injected system prompts. Huge improvement! grok-4-fast-reasoning: 77.5% -> 94.1% grok-4-fast-non-reasoning: 77.9 -> 97.9% I really appreciate that xAI takes this topic seriously.

Nov 8, 2025 · 3:32 PM UTC

IntelligenceBridgeOS · Nov 8, 2025 · 11:32 PM UTC

IntelligenceBridgeOS @intellibridgeos

Nov 8

Replying to @xlr8harder @TheNormanMu

Can I ask how it was done with just safety prompt updates? Seems a big jump