9

Schema-Based Understandability in Human-Robot Interactions: A Cognitive Framework

Robots are entering hospitals, airports, classrooms and homes, yet a persistent barrier to effective human–robot interaction is often …

Victoria Williams, Benjamin Rosman

The Five Senses: Assessing Non-verbal Communication in Multicultural Human–Robot Interaction

As social robots move from research laboratories into everyday settings, they increasingly encounter users whose sensory expectations …

Victoria Williams, Benjamin Rosman

Finding the FrameStack: Learning What to Remember for Non-Markovian Reinforcement Learning

Recent success in developing increasingly general purpose agents based on sequence models has led to increased focus on the problem of …

Geraud Nangue Tasse, Matthew Riemer, Benjamin Rosman, Tim Klinger

Procedural Generation of Semantically Correct Levels in Video Games using Reward Shaping

The generation of video game levels traditionally relies on manual efforts from skilled professionals, resulting in significant …

Luke Kerker, Branden Ingram, Pravesh Ranchod

A Linear Network Theory of Iterated Learning

Language provides one of the primary examples of human’s ability to systematically generalize — reasoning about new …

Devon Jarvis, Richard Klein, Benjamin Rosman, Andrew Saxe

Optimal Task Generalisation in Cooperative Multi-Agent Reinforcement Learning

While task generalisation is widely studied in the context of single-agent reinforcement learning (RL), little research exists in the …

Simon Rosen, Abdel Mfougouon Njupoun, Geraud Nangue Tasse, Steven James, Benjamin Rosman

ROSARL: Reward-Only Safe Reinforcement Learning

An important problem in reinforcement learning is designing agents that learn to solve tasks safely in an environment. A common …

Geraud Nangue Tasse, Tamlin Love, Mark Nemecek, Steven James, Benjamin Rosman

MinePlanner: A Benchmark for Long-Horizon Planning in Large Minecraft Worlds

We propose a new benchmark for planning tasks based on the Minecraft game. Our benchmark contains 45 tasks overall, but also provides …

William Hill, Ireton Liu, Anita De Mello Koch, Damion Harvey, Nishanth Kumar, George Konidaris, Steven James

Counting Reward Automata: Sample Efficient Reinforcement Learning Through the Exploitation of Reward Function Structure

We present counting reward automata—a finite state machine variant capable of modelling any reward function expressible as a …

Tristan Bester, Benjamin Rosman, Steven James, Geraud Nangue Tasse

Towards Financially Inclusive Credit Products Through Financial Time Series Clustering

Financial inclusion ensures that individuals have access to financial products and services that meet their needs. As a key …

Tristan Bester, Benjamin Rosman