Reinforcement Learning

Minimax

Minimax is a decision-making algorithm used in adversarial settings where one player tries to maximize their score while the other minimizes it. It is the classical approach for game-playing AI systems.

Understanding Minimax

Minimax is a decision-making algorithm used in adversarial games and competitive scenarios where one player tries to maximize their score while the opponent tries to minimize it. The algorithm recursively explores a game tree, evaluating all possible moves and counter-moves to determine the optimal strategy assuming both players play perfectly. Alpha-beta pruning is a critical optimization that eliminates branches of the tree that cannot influence the final decision, dramatically reducing the search space without changing the result. Minimax was fundamental to early AI game-playing systems like Deep Blue, which defeated world chess champion Garry Kasparov in 1997. In modern AI, minimax principles extend beyond board games to adversarial training, robust optimization, and the generator-discriminator dynamics of generative adversarial networks. The algorithm connects to broader concepts in Markov decision processes and multi-agent systems.

Is AI recommending your brand?

Find out if ChatGPT, Perplexity, and Gemini mention you when people search your industry.

Check your brand — $9

Mixture of Experts

Back to full glossary

Minimax

Understanding Minimax

Is AI recommending your brand?

Related Reinforcement Learning Terms

Deep Reinforcement Learning

Exploration vs Exploitation

Imitation Learning

Inverse Reinforcement Learning

Markov Decision Process

Policy

Q-Learning

Reinforcement Learning