A Monte Carlo Algorithm for Time-Constrained General Game Playing

  • Victor Scherer Putrich PUCRS
  • Anderson Rocha Tavares UFRGS
  • Felipe Meneguzzi PUCRS / University of Aberdeen

Resumo


General Game Playing (GGP) is a challenging domain for AI agents, as it requires them to play diverse games without prior knowledge. In this paper, we develop a strategy to improve move suggestions in time-constrained GGP settings. This strategy consists of a hybrid version of UCT that combines Sequential Halving and UCB, favoring information acquisition in the root node, rather than overspend time on the most rewarding actions. Empirical evaluation using a GGP competition scheme from the Ludii framework shows that our strategy improves the average payoff over the entire competition set of games. Moreover, our agent makes better use of extended time budgets, when available.

Publicado
25/09/2023
PUTRICH, Victor Scherer; TAVARES, Anderson Rocha; MENEGUZZI, Felipe. A Monte Carlo Algorithm for Time-Constrained General Game Playing. In: BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 12. , 2023, Belo Horizonte/MG. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2023 . p. 97-111. ISSN 2643-6264.