A Monte Carlo Algorithm for Time-Constrained General Game Playing

Victor Scherer Putrich; Anderson Rocha Tavares; Felipe Meneguzzi

A Monte Carlo Algorithm for Time-Constrained General Game Playing

Victor Scherer Putrich PUCRS
Anderson Rocha Tavares UFRGS
Felipe Meneguzzi PUCRS / University of Aberdeen

Resumo

General Game Playing (GGP) is a challenging domain for AI agents, as it requires them to play diverse games without prior knowledge. In this paper, we develop a strategy to improve move suggestions in time-constrained GGP settings. This strategy consists of a hybrid version of UCT that combines Sequential Halving and UCB, favoring information acquisition in the root node, rather than overspend time on the most rewarding actions. Empirical evaluation using a GGP competition scheme from the Ludii framework shows that our strategy improves the average payoff over the entire competition set of games. Moreover, our agent makes better use of extended time budgets, when available.

Springer (English)

Publicado

25/09/2023

Como Citar

Selecione um Formato

PUTRICH, Victor Scherer; TAVARES, Anderson Rocha; MENEGUZZI, Felipe. A Monte Carlo Algorithm for Time-Constrained General Game Playing. In: BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 12. , 2023, Belo Horizonte/MG. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2023 . p. 97-111. ISSN 2643-6264.