Doug Safreno · May 7, 2025 · 7:10 PM UTC

Doug Safreno

Doug Safreno

@dougsafreno

May 7

Had a fun chat about evals with Ben on The Chief AI Officer podcast. We discussed: • Why many companies don't have a trustworthy eval stack today • How to create great LLM-as-a-judge evals with an "unfair advantage" • Why "100% accuracy" is often a red flag Links below:

Doug Safreno · May 7, 2025 · 7:10 PM UTC

Doug Safreno · May 7, 2025 · 7:10 PM UTC

Doug Safreno

@dougsafreno

May 7

Spotify: open.spotify.com/episode/4y3… Apple: podcasts.apple.com/us/podcas…

Gentrace’s Doug Safreno on Escaping POC Purgatory with Collaborative AI Evaluation

Podcast Episode · The Chief AI Officer Show · 05/06/2025 · 43m

podcasts.apple.com

May 7, 2025 · 7:10 PM UTC