Gemini 3, AI Chess Champion: Game Arena Expands to Poker and Werewolf

Gemini 3 Tops the Game Arena Chess Leaderboard

  • Google DeepMind adds Poker and Werewolf to Game Arena
  • Gemini 3 Pro and Flash sweep all three game leaderboards
  • Three-day livestream featuring Hikaru Nakamura and Doug Polk

What Happened?

Google DeepMind has expanded its AI benchmark platform Game Arena. In addition to chess, they have added Poker and Werewolf.[Google Blog] Gemini 3 Pro and Gemini 3 Flash claimed the top spots in all three games, sweeping the leaderboards.

Poker was played in Heads-Up No-Limit Texas Holdem format. GPT-5.2, Gemini 3, and Claude played 900,000 hands.[Doug Polk] Werewolf is the first team-based game played entirely through natural language, requiring reasoning through dialogue amid imperfect information.

Why Does This Matter?

Chess tests logical thinking. But Poker and Werewolf are different. Poker requires risk management and bluffing, while Werewolf demands social reasoning and persuasion.[ChromeUnboxed] This has become a new standard for evaluating AI soft skills.

Gemini 3 showed significant performance improvement in chess compared to Gemini 2.5. Rapid capability gains between generations were confirmed.[The Decoder] Gemini models are dominating in strategic board games.

What Comes Next?

A three-day livestream tournament ran from February 2 to 4. Chess Grandmaster Hikaru Nakamura and poker legends Liv Boeree and Doug Polk co-hosted.[Kaggle] The final poker leaderboard was revealed on February 4 at kaggle.com/game-arena.

Game Arena is expected to become a standard benchmark for evaluating multifaceted AI capabilities. It tests not just calculation but strategy, psychology, and negotiation skills.

Frequently Asked Questions (FAQ)

Q: Which AI models participated in Game Arena?

A: Major AI models including GPT-5.2, Gemini 3 Pro, Gemini 3 Flash, and Claude participated. The Gemini 3 series ranked first across all games.

Q: How is the Werewolf game played?

A: It is a team-based social deduction game conducted entirely through natural language dialogue. AI models must distinguish between villagers and werewolves through conversation.

Q: Where can I check the Game Arena results?

A: You can view the full leaderboard and game-specific rankings at kaggle.com/game-arena.

Leave a Comment