The paper says AGI should mean balanced skills across all key abilities.
Simple averages hide weak spots, so they mislead about real generality.
The old way of measuring AGI used a simple average of scores across different skills, like reasoning, memory, vision, and language.
This means if a model was amazing at some skills but terrible at others, the good scores could still push its average up, making it look more “general” than it really was.
The new method in this paper changes that by checking how balanced the abilities are, not just how high the average score is.
Instead of one average, it measures many averages under different “strictness” settings, from forgiving (where strong areas can cover weak ones) to strict (where one weak area drags the whole score down).
----
Paper – arxiv. org/abs/2510.20784
Paper Title: "A Coherence-Based Measure of AGI"