X @Demis Hassabis
Demis Hassabis·2026-02-02 18:07
The AI field is in need of harder benchmarks to test capabilities of the latest AI models. This update to @Kaggle Game Arena with werewolf and poker (heads-up) plus chess, gives us new objective measures of real-world skills like planning and decision making under uncertainty.Kaggle (@kaggle):📌 Mark Your Calendar: Live Game Arena Event This Monday!We are releasing two new games, Poker and Werewolf, along with an updated Chess leaderboard next Monday, February 2, running daily from 9:30 AM PT to 11:30 AM PT ...