Identifying and Tracking Switching, Non-stationary Opponents: a Bayesian Approach

Abstract

In many situations, agents are required to use a set of strategies (behaviors) and switch among them during the course of an interaction. This work focuses on the problem of recognizing the strategy used by an agent within a small number of interactions. We propose using a Bayesian framework to address this problem. Bayesian policy reuse (BPR) has been empirically shown to be efficient at correctly detecting the best policy to use from a library in sequential decision tasks. In this paper we extend BPR to adversarial settings, in particular, to opponents that switch from one stationary strategy to another. Our proposed extension enables learning new models in an online fashion when the learning agent detects that the current policies are not performing optimally. Experiments presented in repeated games show that our approach is capable of efficiently detecting opponent strategies and reacting quickly to behavior switches, thereby yielding better performance than state-of-the-art approaches in terms of average rewards.

Publication
Workshop on Multiagent Interaction without Prior Coordination at AAAI

Documentation: https://wowchemy.com/docs/managing-content/

title: ‘Identifying and tracking switching, non-stationary opponents: A Bayesian approach’ subtitle: ’' summary: ’' authors:

  • Pablo Hernandez-Leal
  • Matthew E Taylor
  • Benjamin Rosman
  • L Enrique Sucar
  • Enrique Munoz De Cote tags: [] categories: [] date: ‘2016-01-01’ lastmod: 2022-09-17T14:22:55+02:00 featured: false draft: false

Featured image

To use, add an image named featured.jpg/png to your page’s folder.

Focal points: Smart, Center, TopLeft, Top, TopRight, Left, Right, BottomLeft, Bottom, BottomRight.

image: caption: ’' focal_point: ’' preview_only: false

Projects (optional).

Associate this post with one or more of your projects.

Simply enter your project’s folder or file name without extension.

E.g. projects = ["internal-project"] references content/project/deep-learning/index.md.

Otherwise, set projects = [].

projects: [] publishDate: ‘2022-09-17T12:22:53.803356Z’ publication_types:

  • ‘1’ abstract: ’' publication: ‘Workshops at the Thirtieth AAAI Conference on Artificial Intelligence

Benjamin Rosman
Benjamin Rosman
Lab Director

I am a Professor in the School of Computer Science and Applied Mathematics at the University of the Witwatersrand in Johannesburg. I work in robotics, artificial intelligence, decision theory and machine learning.