Home
Projects
Publications
People
Join the Lab
Contact
Login
9
Finding the FrameStack: Learning What to Remember for Non-Markovian Reinforcement Learning
Recent success in developing increasingly general purpose agents based on sequence models has led to increased focus on the problem of …
Geraud Nangue Tasse
,
Matthew Riemer
,
Benjamin Rosman
,
Tim Klinger
PDF
Cite
Procedural Generation of Semantically Correct Levels in Video Games using Reward Shaping
The generation of video game levels traditionally relies on manual efforts from skilled professionals, resulting in significant …
Luke Kerker
,
Branden Ingram
,
Pravesh Ranchod
PDF
Cite
Project
A Linear Network Theory of Iterated Learning
Language provides one of the primary examples of human’s ability to systematically generalize — reasoning about new …
Devon Jarvis
,
Richard Klein
,
Benjamin Rosman
,
Andrew Saxe
PDF
Cite
Optimal Task Generalisation in Cooperative Multi-Agent Reinforcement Learning
While task generalisation is widely studied in the context of single-agent reinforcement learning (RL), little research exists in the …
Simon Rosen
,
Abdel Mfougouon Njupoun
,
Geraud Nangue Tasse
,
Steven James
,
Benjamin Rosman
PDF
Cite
Project
ROSARL: Reward-Only Safe Reinforcement Learning
An important problem in reinforcement learning is designing agents that learn to solve tasks safely in an environment. A common …
Geraud Nangue Tasse
,
Tamlin Love
,
Mark Nemecek
,
Steven James
,
Benjamin Rosman
PDF
Cite
MinePlanner: A Benchmark for Long-Horizon Planning in Large Minecraft Worlds
We propose a new benchmark for planning tasks based on the Minecraft game. Our benchmark contains 45 tasks overall, but also provides …
William Hill
,
Ireton Liu
,
Anita De Mello Koch
,
Damion Harvey
,
Nishanth Kumar
,
George Konidaris
,
Steven James
PDF
Cite
Counting Reward Automata: Sample Efficient Reinforcement Learning Through the Exploitation of Reward Function Structure
We present counting reward automata—a finite state machine variant capable of modelling any reward function expressible as a …
Tristan Bester
,
Benjamin Rosman
,
Steven James
,
Geraud Nangue Tasse
PDF
Cite
Project
Towards Financially Inclusive Credit Products Through Financial Time Series Clustering
Financial inclusion ensures that individuals have access to financial products and services that meet their needs. As a key …
Tristan Bester
,
Benjamin Rosman
PDF
Cite
Generalisable Agents for Neural Network Optimisation
Optimising deep neural networks is a challenging task due to complex training dynamics, high computational requirements, and long …
Kale-ab Tessera
,
Callum Tilbury
,
Sasha Abramowitz
,
Ruan de Kock
,
Omayma Mahjoub
,
Benjamin Rosman
,
Sara Hooker
,
Arnu Pretorius
PDF
Cite
Hierarchical Reinforcement Learning with AI Planning Models
Deep Reinforcement Learning (DRL) has shown breakthroughs in solving challenging problems, such as pixel-based games and continuous …
Junkyu Lee
,
Michael Katz
,
Don Joven Agravante
,
Miao Liu
,
Geraud Nangue Tasse
,
Tim Klinger
,
Shirin Sohrabi
PDF
Cite
»
Cite
×