2. Games with ω-regular winning conditions

Mikołaj Bojańczyk

zajęcia / courses

2. Games with ω-regular winning conditions

In this lecture, we consider games played by two players (called 0 and 1), which are zero-sum, perfect information, and most importantly, of potentially infinite duration. Suppose that $W \subseteq \Sigma^\omega$ is a set of $\omega$ -words. Define a game with winning condition $W$ to be:

• a directed graph, not necessarily finite, whose vertices will be called positions of the game;
• a distinguished initial position;
• partition of the positions into positions controlled by player 0 and positions controlled by player 1;
• a labelling function that maps each position to a label from $\Sigma$ .

The game is played as follows. The game begins in the initial position. The player who controls the initial position chooses an outgoing edge, leading to a new position. The player who controls the new position chooses an outgoing edge, leading to a new position, and so on. If the play reaches a position with no outgoing edges, then the player who controls the position loses immediately. Otherwise, the play continues forever, and yields an infinite path. By applying the labelling function to the path, we get an word in $\Sigma^\omega$ ; if this word belongs to $W$ then player 0 wins, otherwise player 1 wins.

When formalizing the notions of the above paragraph, one uses the concept of a strategy. A strategy for player $i \in \set{0,1}$ is a function which inputs a history of the play so far (a path from the initial position to some position controlled by player $i$ ), and outputs the new position (consistent with the edge relation in the graph). Given strategies for both players, call these $\sigma_0$ and $\sigma_1$ , a unique play is determined, which is either a finite path ending in a terminal position (no outgoing edges), or an infinite path. This play is called winning for player $0$ if it is finite and ends in a terminal position controlled by the opposing player $1$ ; or if it is infinite and satisfies the winning condition after applying the labelling function. Otherwise, the play is winning for player $1$ . A winning strategy for player $i$ is defined to be a strategy $\sigma_i$ such that for every possible strategy $\sigma_{1-i}$ of the opponent, the resulting play is winning for player $i$ .

Determinacy. A game is called determined if one of the players has a winning strategy. Clearly it cannot be the case that both players have winning strategies. One could be tempted to think that, because of the perfect information, one of the players must have a winning strategy. However, because of the infinite duration, one can come up with strange games (e.g. using the axiom of choice) which are not determined because none of the players has a winning strategy.

The goal of this lecture is to show a theorem by Büchi and Landweber: if the winning condition of the game is recognised by an automaton, then the game is determined, and furthermore the winning player has a finite memory winning strategy, in the following sense.

Finite memory strategy. Consider a game where the positions are $V$ . Let $i$ be one of the players. A strategy for player $i$ with memory $M$ is given by:
• a deterministic automaton with states $M$ and input alphabet $V$ ; and
• for every position $v$ controlled by $i$ , a function $f_v$ from $M$ to the neighbors of $v$ .
The two ingredients above define a strategy for player $i$ in the following way: the next move chosen by player $i$ in a position $v$ is obtained by applying the function $f_{v}$ to the state of the automaton after reading the history of the play, including $v$ . We will apply this definition also to games with infinitely many positions, but we will only care about finite memory sets $M$ .

An important special case is when the set $M$ has only one element, in which case the strategy is called memoryless. In this case, the new position chosen by the player only depends on the current position, and not on the history of the game before that.

Theorem. (Büchi-Landweber) For every $\omega$ -regular language $W$ there exists a finite set $M$ such that for every game with winning condition $W$ , one of the players has a winning strategy that uses memory $M$ .

The proof of the above theorem has two parts. The first part is to identify a special case of games with $\omega$ -regular winning conditions, called parity conditions. Define the $n$ -rank parity condition to be the set of $\omega$ -words over the alphabet $\set{1,\ldots,n}$ where the smallest number appearing infinitely often is even. A parity game is a game where the winning condition is the $n$ -rank parity language for some $n$ .

Parity games are important because not only can they be won using finite memory strategies, but even memoryless strategies are enough:

Theorem 1. For every parity game, one of the players has a memoryless winning strategy.

Theorem 1 is proved here. The second step of the Büchi-Landweber theorem is the reduction to parity games. This essentially boils down to transforming deterministic Muller automata into something called deterministic parity automata. In a parity automaton, there is a ranking function from states to numbers, and a run is considered accepting if the minimal rank appearing infinitely often is even. This is a special case of the Muller condition, but it turns out to be expressively complete in the following sense:

Theorem 2. For every deterministic Muller automaton, there exists an equivalent deterministic parity automaton.

Theorem 2 is proved here. Let us now combine the two theorems to get the Büchi-Landweber theorem. Consider a game with an $\omega$ -regular winning condition $L \subseteq \Sigma^\omega$ . By Theorem 2, there is a deterministic parity automaton which recognises the language $L$ . Consider a new game, call it the product game, where the positions are pairs (position of the original game, state of the deterministic parity automaton). This is a parity game, with the ranks inherited from the automaton. In a position $(v,q)$ , the player controlling position $v$ chooses an edge in the original game, and the state is updated deterministically according to the transition function of the automaton. It is not difficult to see that the following conditions are equivalent for every position $v$ of the original game and every player $i \in \set{0,1}$ :
1. player $i$ wins from position $v$ in the original game;
2. player $i$ wins from position $(v,q)$ in the product game, where $q$ is the initial state of the automaton.

The implication from 1 to 2 crucially uses determinism of the automaton and would fail if a nondeterministic automaton were used (under an appropriate definition of a product game). Since the product game is a parity game, for every position $v$ , condition 2 must hold for either player 0 or 1; furthermore, a positional strategy in the product game corresponds to a finite memory strategy in the original game, where the memory is the states of the automaton.

COMMENTS

hub

November 1, 2019

We consider games played on graphs equipped with costs on edges, and introduce two winning conditions, cost-parity and cost-Streett, which require bounds on the cost between requests and their responses.

zajęcia / courses

2. Games with ω-regular winning conditions

2. Games with ω-regular winning conditions

Leave a Reply