AI Lec03 Adversarial Search

COSC2129
Semester 3, 2024
Artificial Intelligence
Adversarial Search
Lecturer: Thuy Nguyen

Office location: 2.4.27
Road Map for Today
Revision of classic search strategies
Games as adversarial search
Minimax search
How to improve the efficiency of minimax search?

Alpha-beta pruning
Cutting off search
Heuristic evaluation functions
How to deal with games with chances?

-2-
Book AIMA, Chapter 6
Classic Search Strategies
Uninformed (blind) search
Breadth-first search (BFS)
Uniform-cost search (UCS)
Hands-on
Depth-first search (DFS) Examples
Depth-limited search (DLS)

Iterative deepening search (IDS)
Bi-directional search (BDS)
Informed (heuristic) search

Greedy search
A* search
Performance of Search Strategies
Completeness: Is it guaranteed to find a solution if one exists?
Optimality: Will it find an optimal solution?
Time complexity: How many nodes get expanded?
Space complexity: How much memory is needed?
Time and space complexity is calculated based on:

b Branching factor
d Depth of the shallowest goal node
m Maximum length of any path in the state space

Properties of Search Strategies
Breadth-first (BFS): FIFO Q, complete (if b is finite), optimal (if step cost is
identical)
Uniform-cost (UCS): Priority Q ordered by g(n), complete (if b is limited and step
cost is positive), optimal
Depth-first (DFS): LIFO Q, non-complete, non-optimal
Depth-limited (DLS): LIFO Q, non-complete, non-optimal
Iterative deepening (IDS): LIFO Q, complete (if b is finite), optimal (if step cost is
identical)
Bi-directional (BDS): complete (if b is finite and both directions are BFS), optimal (if
step cost is identical and both directions are BFS)
Greedy: Priority Q ordered by h(n), complete (if graph search in finite spaces), non-
optimal
A*: Priority Q ordered by g(n) + h(n), complete (if b is finite and step cost is positive),
optimal (if h(n) is admissible for tree search or consistent for graph search)
Properties of Heuristic Functions
Admissibility
h(n) admissible if it never overestimates the cost from n to reach the goal.
This means that A* search approaches the optimal cost C* from below.
Every node with f(n) < C* will be expanded.
Consistency
h(n) <= c(n, a, n’) + h(n’), n’ is any successor of n
Informedness
h2(n) is more informed than h1(n) if h2(n) ≥ h1(n).
Why is h2(n) better than h1(n)?
Heuristics allow us to prune the search space: if h2(n) ≥ h1(n) then h2(n) will
make us to search only a subset of the nodes searched by h1(n).
Games
Games as Adversarial Search
8-puzzle problems involve only one player. A game usually have two or more
players, such as chess, tic-tac-toe, checker, Othello, etc.
Adversarial search if the goals of
Can we formulate game playing as search problems? YES different players are conflict!
Can we build a search tree to represent a game from the very beginning to every
possible ending scenario of the game? YES, such as the tree for tic-tac-toe:
My Possible Moves
Opponent's
Possible Moves How many nodes on this level?
My Possible Moves And how many on this level?
How many levels?
Ending States
-1 0 1
Games as Adversarial Search (cont’d)
It is almost impossible (and also unnecessary) to build a search tree
to represent the entire search space for even a simple game like tic-
tac-toe.
A realistic search method is to find the “best” next move based on the
current state of the game. This move should ideally maximize the
chance of winning.
To find this move, we need to search through the search space

(entire?), starting from the current state.
The opponent will reply against our move. Then we perform the search
again based on the new state to find the next “best” move.
This process will repeat until the game is over.
In game playing, minimax search is often used to find next moves.

Definitions
A game can be formally defined as a kind of search problem
with the following elements:
Initial state s0: How is the game set up at the start
Player(s): Who will move in state s
Actions(s): The set of legal moves in state s
Result(s, a): transition model defining the result of a move
Terminal-test(s): true if the game is over (i.e., s is a terminal state)
Utility(s, p): utility value for a game ending at a terminal state s for a
player p
e.g., utility value for chess is:-1, 0 or 1 (zero-sum game)
Minimax Game
Two players: MAX and MIN
Initial state is a MAX node

Children of MAX nodes are MIN (or leaf) nodes
Children of MIN nodes are MAX (or leaf) nodes
Calculate Minimax(s)
Utility(s) if Terminal-test(s)
Maxa Minimax(Result(s, a)) if Player(s) = MAX
Mina Minimax(Result(s, a)) if Player(s) = MIN
Calculate in the depth-first manner
Minimax Game Example
Ply
Depth limit of minimax games can be specified

via ply:
2-ply: 1 move for me, 1 move for my opponent

4-ply: 2 moves each
…
2n-ply: n moves each
Puts a depth bound on minimax search

Minimax Search Algorithm
Consider a trivial game that only involves two moves from start to end:
▲ is a “MAX node”. It is MAX ’s turn to move (to find the max value).
▼ is a “MIN node”. It is MIN ’s turn to move (to find the min value).
The bottom nodes are leave nodes on which utility function applies.
The sequence of assigning values to nodes is:

3→b1 , 12→b2 , 8→b3 , 3→B , 2→c1 , 4→c2, 6→c3 , 2→C , 14→d1, 5→d2 , 2→d3 , 2→D, 3→A
Minimax Search Algorithm (cont’d)
In this example, MAX’s best move at the root node is a1 because it

leads to the successor with the highest minimax value. MIN’s best
reply is b1 because it leads to the successor with the lowest utility
value.
Always assume that the opponent will play perfectly (infalliable).

Alpha-Beta Pruning
Alpha-Beta Pruning
Not all branches need expanding.
If one MAX’s child has value 3, no MIN’s child (with this

MAX being along its path to the root) having value 3 or less
will change this.
If one MIN’s child has value 2, no MAX’s child (with this

MIN being along its path to the root) having value 2 or more
will change this.
The highest current value of any MAX node along the path
to the root is alpha.
The lowest current value of any MIN node along the path to
the root is beta.
Alpha-Beta Pruning (cont’d)
Same search procedure as for minimax except:
MIN node: stop expan if current value ≤ alpha of

its parent (this node can’t be higher than alpha)
MAX node: stop if current value ≥ beta of its

parent (this node can’t be lower than beta)
This also means that the whole subtree for a

pruned node is not expanded.
We use the same two-ply game tree as an example for α-β pruning.
α=3 A
- Expand the first branch

β=3
- Get utility values for each terminal node.
- So B‛s current utility value (β) is 3.
- So A‛s current utility value (α) is 3.
A α=3 - Expand node C by c1

- Reach the terminal node
β - The utility value of this terminal node < α=3.

- So C‛s current utility value is 2, and no
further expansion on C.
A beta-pruning is performed here - a MIN cannot have a higher value than

its parent MAX.
What if we expand by c2 and c3? The corresponding utility values are 4, 6. So
β of C will remain 2. Minimax search will still not choose C (because at least B
has a higher value 3).
The search outcome is not affected by the utility values of the terminal
nodes led by c2 and c3.
So expanding by c2 and c3 are unnecessary.

- Expand node D by d1
α=3 A
- The utility value of this terminal
β=14 node > α=3.
- So D‛s current utility value is 14,
and continue the expansion on D.
α=3 A - Expand node D by d2

β=5 - The utility value of this terminal
node > α=3.
and continue the expansion on D.
Can we stop expansion at d2?

No! What if d3 = 2, 4, 10?
- Expand node D by d3.

α=3
- Reach the terminal node.
- The utility value of this terminal
β=2
node < α=3, and the final branch
and no further expansion on D.
- Identify a1 as the next move.
The outcome of this alpha-beta pruning is same with that from minimax. Less
nodes are expanded here. So alpha-beta pruning is more efficient.
However, the expansion of d1 or d2 seems not worthwhile. The low value of d3

bring down the value of D to 2.
Quality of pruning depends on the order of

expansion
Try to generate children in “desired” order
Can reduce minimax from O(bm) to O(bm/2)
Reduces effective branching factor from b to b1/2
Searches a tree twice as deep as minimax in the

same amount of time
Search for Games
Set up a game as a kind of search tree
Use minimax method to choose moves
Use alpha-beta pruning to cut down alternatives
Some implementation tricks: transposition

tables as hash tables.
Search for Games (cont’d)
Games involve search
Search space can be huge (e.g., chess: 1040 states)
Not all moves may be chosen by the opponent(s)
Time limit may degrade search
Exhaustive search is not how humans do it!

Search for Games (cont’d)
Assume optimal opponent
Prune search space
Use heuristic approximation of the expected

utility
Deal with chance and imperfect information

Heuristic Evaluation Function
Idea goes back to Claude Shannon in 1950(!)
Limit depth and estimate utility via a

heuristic evaluation function (educated guess)
Calculate H-Minimax(s)
Eval(s) if Cutoff-test(s, d)
Maxa H-Minimax(Result(s, a),d+1) if Player(s) = MAX
Mina H-Minimax(Result(s, a),d+1) if Player(s) = MIN

Heuristic Evaluation Function (cont’d)
So what is a good evaluation function?
Computationally efficient
Match the ranking order of the utility values of

terminal states
e.g., winning better than draws better than losses
Strongly correlate with actual winning chances on

non-terminal states
Will have some level of uncertainty (as it is an

estimate)
Often a weighted sum of various features:
Eval(s) = w1f1(s) + w2f2(s) + … + wnfn(s)
Assumes features are independent.
Example chess function:

9*#Queens + 4*#Bishops + 3*#Knights + 1*#pawns
Features and weights are NOT part of rules of

chess, but come from previous experience.
For chess the weights and factors could be:

w1 f1(s) = 9 * (number of white Queen – number of black queen)
w2 f2(s) = 4 * (number of white Bishop – number of black Bishop)
w3f3(s) = 3 * (number of white knight – number of black knight)
w4 f4(s) = 1 * (number of white pawn – number of black pawn)
… so on and so forth…
By this simple function we can estimate which player is closer to winning.

Apply only to quiescent states (not ‘I am about to

lose my queen!’)
Can check for non-quiescent states (i.e., something

is under attack) and insist that they are expanded
horizon effect and singular extension

Different Types of Games
Perfect Imperfect
Information Information
Deterministic Chess, Draughts Kriegspiel,

(checkers), Go, Battleship
Othello, Noughts
and Crosses (tic-
tac-toe), …
Stochastic Backgammon, Bridge, 500,

Monopoly Scrabble
Stochastic Games
Involving some “chance” factors:

Dice
Deal in a card game
Tossing a coin
Backgammon:
Deterministic moves, but which ones are legal are
determined by two dices
Require chance nodes in the game tree

Stochastic Games (cont’d)
Calculate ExpectMinimax(s) as:

Utility(s) if Terminal-test(s)
Maxa ExpectMinimax(Result(s,a)) if Player(s) = MAX
Mina ExpectMinimax(Result(s,a)) if Player(s) = MIN
SUMr P(r) * ExpectMinimax(Result(s,r)) if Player(s) = CHANCE
Complicate alpha-beta pruning
Can use Monte Carlo (random) choices over a large

enough sample
State-of-the-art Game Programs
Chess
Deep Blue (1997)
-- alpha-beta search on lots of specialist processors
-- Typically 14-ply, sometimes up to 40-ply (!!)
-- Opening book of 4000 positions
-- Database of 700,000 grandmaster games
-- Endgame database of all 5 piece checkmates
-- A good PC with the right program can match a human world champion …
Draughts: Chinook, using alpha-beta search, beat human master in 1990; Now plays
perfectly with a vast endgame database
Othello: Computers too good for humans
Backgammon: Computers competitive with humans
Go: Humans still ahead (branch factor 361 …); Monte Carlo methods used
Conclusions
Revision of classic search strategies
Games as adversarial search
Minimax search: assuming optimal players
How to improve the efficiency of minimax search?
Alpha-beta pruning
Cutting off search
Heuristic evaluation functions
How to deal with games with chance?

Questions?

AI Lec03 Adversarial Search

Uploaded by

Copyright:

Available Formats

AI Lec03 Adversarial Search

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

AI Lec03 Adversarial Search

Uploaded by

Copyright:

Available Formats

COSC2129

Lecturer: Thuy Nguyen

Games as adversarial search

How to improve the efficiency of minimax search?

How to deal with games with chances?

Depth-limited search (DLS)

Informed (heuristic) search

Optimality: Will it find an optimal solution?

Time complexity: How many nodes get expanded?

Space complexity: How much memory is needed?

Time and space complexity is calculated based on:

d Depth of the shallowest goal node

m Maximum length of any path in the state space

Depth-first (DFS): LIFO Q, non-complete, non-optimal

Depth-limited (DLS): LIFO Q, non-complete, non-optimal

My Possible Moves And how many on this level?

How many levels?

To find this move, we need to search through the search space

This process will repeat until the game is over.

In game playing, minimax search is often used to find next moves.

Initial state s0: How is the game set up at the start

Player(s): Who will move in state s

Actions(s): The set of legal moves in state s

Result(s, a): transition model defining the result of a move

Terminal-test(s): true if the game is over (i.e., s is a terminal state)

Initial state is a MAX node

Depth limit of minimax games can be specified

2-ply: 1 move for me, 1 move for my opponent

Puts a depth bound on minimax search

The sequence of assigning values to nodes is:

In this example, MAX’s best move at the root node is a1 because it

Always assume that the opponent will play perfectly (infalliable).

If one MAX’s child has value 3, no MIN’s child (with this

If one MIN’s child has value 2, no MAX’s child (with this

Same search procedure as for minimax except:

MIN node: stop expan if current value ≤ alpha of

MAX node: stop if current value ≥ beta of its

This also means that the whole subtree for a

- Expand the first branch

A α=3 - Expand node C by c1

β - The utility value of this terminal node < α=3.

A beta-pruning is performed here - a MIN cannot have a higher value than

So expanding by c2 and c3 are unnecessary.

α=3 A - Expand node D by d2

Can we stop expansion at d2?

- Expand node D by d3.

nodes are expanded here. So alpha-beta pruning is more efficient.

However, the expansion of d1 or d2 seems not worthwhile. The low value of d3

Quality of pruning depends on the order of

Try to generate children in “desired” order

Can reduce minimax from O(bm) to O(bm/2)

Reduces effective branching factor from b to b1/2

Searches a tree twice as deep as minimax in the

Set up a game as a kind of search tree

Use minimax method to choose moves

Use alpha-beta pruning to cut down alternatives

Some implementation tricks: transposition

Games involve search

Search space can be huge (e.g., chess: 1040 states)

Not all moves may be chosen by the opponent(s)