10Hardware·Oct 23
Rethinking how we measure AI intelligence
Game Arena is a new, open-source platform for rigorous evaluation of AI models. It allows for head-to-head comparison of frontier systems in environments with clear winning conditions.
Covered by 1 source
- GGoogle DeepMind Blog↗Oct 23