Who Should I Trust? Cautiously Learning with Unreliable Experts

Tamlin Love, Ritesh Ajoodha, Benjamin Rosman

September 2022

Abstract

An important problem in reinforcement learning is the need for greater sample efficiency. One approach to dealing with this problem is to incorporate external information elicited from a domain expert in the learning process. Indeed, it has been shown that incorporating expert advice in the learning process can improve the rate at which an agent’s policy converges. However, these approaches typically assume a single, infallible expert; learning from multiple and/or unreliable experts is considered an open problem in assisted reinforcement learning. We present CLUE (cautiously learning with unreliable experts), a framework for learning single-stage decision problems with action advice from multiple, potentially unreliable experts that augments an unassisted learning with a model of expert reliability and a Bayesian method of pooling advice to select actions during exploration. Our results show that CLUE maintains the benefits of traditional approaches when advised by reliable experts, but is robust to the presence of unreliable experts. When learning with multiple experts, CLUE is able to rank experts by their reliability and differentiate experts based on their reliability.

Type

Journal article

Publication

Neural Computing and Applications

Reinforcement Learning Human Computer Interaction

Who Should I Trust? Cautiously Learning with Unreliable Experts

Abstract

Tamlin Love

PhD Student

Ritesh Ajoodha

Benjamin Rosman

Lab Director