In: Computer Science
Explain, how the Mini-Max algorithm is used in decision-making and game theory. Make sure to explain how this algorithm applies the utility function to get the utility values for the terminal states. Feel free to add any diagram/tree structure to represent all the possible moves that allow a game to move from one state to the next state. Also, discuss how the alpha-beta pruning approach is used for optimization.
Minimax is a kind of backtracking algorithm that is used in decision making and game theory to find the optimal move for a player, assuming that your opponent also plays optimally. It is widely used in two player turn-based games such as Tic-Tac-Toe, Backgammon, Mancala, Chess, etc.
In Minimax the two players are called maximizer and minimizer. The maximizer tries to get the highest score possible while the minimizer tries to do the opposite and get the lowest score possible.
Every board state has a value associated with it. In a given state if the maximizer has upper hand then, the score of the board will tend to be some positive value. If the minimizer has the upper hand in that board state then it will tend to be some negative value. The values of the board are calculated by some heuristics which are unique for every type of game.
Example:
Consider a game which has 4 final states and paths to reach final
state are from root to 4 leaves of a perfect binary tree as shown
below. Assume you are the maximizing player and you get the first
chance to move, i.e., you are at the root and your opponent at next
level. Which move you would make as a maximizing player
considering that your opponent also plays optimally?
Since this is a backtracking based algorithm, it tries all possible moves, then backtracks and makes a decision.
Being the maximizer you would choose the larger value that is 3. Hence the optimal move for the maximizer is to go LEFT and the optimal value is 3.
Now the game tree looks like below :
The above tree shows two possible scores when maximizer makes left and right moves.
Note: Even though there is a value of 9 on the right subtree, the minimizer will never pick that. We must always assume that our opponent plays optimally.
Alpha-Beta pruning is not actually a new algorithm, rather an optimization technique for minimax algorithm. It reduces the computation time by a huge factor. This allows us to search much faster and even go into deeper levels in the game tree. It cuts off branches in the game tree which need not be searched because there already exists a better move available. It is called Alpha-Beta pruning because it passes 2 extra parameters in the minimax function, namely alpha and beta.
Let’s define the parameters alpha and beta.
Alpha is the best value that the
maximizer currently can guarantee at that level or
above.
Beta is the best value that the
minimizer currently can guarantee at that level or
above.