CS 175 · UC Irvine · AI Project

The Game of 2048,
Taught to Think

A deep reinforcement learning experiment pitting three algorithms against each other — DQN vs MCTS vs PPO — to see which one masters the art of exponential tile merging.

↗ Read Final Report ⌥ View Source

PPO reaches
tile 2048

The Proximal Policy Optimization agent navigates the 4×4 grid through trial and error, learning corner strategies and merge sequences that push tiles to their theoretical maximum.

2048 Best Tile

MCTS
& PPO Algorithm

3 Models

Project Reports

03 documents

01 Proposal ↗ 02 Status ↗ 03 Final ↗

References

05 sources

2048 GitHub Repository github.com/Quentin18/gymnasium-2048 ↗ Stable Baselines3 DQN stable-baselines3.readthedocs.io ↗ DeepMind DQN Paper arxiv.org/pdf/1312.5602 ↗ Stanford 2048 Paper arxiv.org/html/2507.05465v1 ↗ Medium — A Puzzle for AI medium.com/data-science ↗

⌥

PowerOf2 Source code on GitHub

View Repository ↗

Power Of 2 CS 175: Project in AI

The Game of 2048,Taught to Think

PPO reachestile 2048

Project Reports

References

The Game of 2048,
Taught to Think

PPO reaches
tile 2048