Home
Projects
Publications
People
Join the Lab
Contact
Login
9
Optimal Task Generalisation in Cooperative Multi-Agent Reinforcement Learning
While task generalisation is widely studied in the context of single-agent reinforcement learning (RL), little research exists in the …
Simon Rosen
,
Abdel Mfougouon Njupoun
,
Geraud Nangue Tasse
,
Steven James
,
Benjamin Rosman
PDF
Cite
Project
MinePlanner: A Benchmark for Long-Horizon Planning in Large Minecraft Worlds
We propose a new benchmark for planning tasks based on the Minecraft game. Our benchmark contains 45 tasks overall, but also provides …
William Hill
,
Ireton Liu
,
Anita De Mello Koch
,
Damion Harvey
,
Nishanth Kumar
,
George Konidaris
,
Steven James
PDF
Cite
Counting Reward Automata: Sample Efficient Reinforcement Learning Through the Exploitation of Reward Function Structure
We present counting reward automata—a finite state machine variant capable of modelling any reward function expressible as a …
Tristan Bester
,
Benjamin Rosman
,
Steven James
,
Geraud Nangue Tasse
PDF
Cite
Project
Towards Financially Inclusive Credit Products Through Financial Time Series Clustering
Financial inclusion ensures that individuals have access to financial products and services that meet their needs. As a key …
Tristan Bester
,
Benjamin Rosman
PDF
Cite
Generalisable Agents for Neural Network Optimisation
Optimising deep neural networks is a challenging task due to complex training dynamics, high computational requirements, and long …
Kale-ab Tessera
,
Callum Tilbury
,
Sasha Abramowitz
,
Ruan de Kock
,
Omayma Mahjoub
,
Benjamin Rosman
,
Sara Hooker
,
Arnu Pretorius
PDF
Cite
Hierarchical Reinforcement Learning with AI Planning Models
Deep Reinforcement Learning (DRL) has shown breakthroughs in solving challenging problems, such as pixel-based games and continuous …
Junkyu Lee
,
Michael Katz
,
Don Joven Agravante
,
Miao Liu
,
Geraud Nangue Tasse
,
Tim Klinger
,
Shirin Sohrabi
PDF
Cite
Preparing the Vuk'uzenzele and ZA-gov-multilingual South African Multilingual Corpora
This paper introduces two multilingual government themed corpora in various South African languages. The corpora were collected by …
Richard Lastrucci
,
Isheanesu Dzingirai
,
Jenalea Rajab
,
Andani Madodonga
,
Matimba Shingange
,
Daniel Njini
,
Vukosi Marivate
PDF
Cite
A Framework for Grassroots Research Collaboration in Machine Learning and Global Health
Traditional top-down approaches for global health have historically failed to achieve social progress (Hoffman et al., 2015; Hoffman …
Christopher Currin
,
Mercy Asiedu
,
Chris Fourie
,
Benjamin Rosman
,
Houcemeddine Turki
,
Atnafu Tonja
,
Jade Abbott
,
Marvellous Ajala
,
Sadiq Adedayo
,
Chris Emezue
,
Daphne Machangara
PDF
Cite
End-to-End Learning to Follow Language Instructions with Compositional Policies
We develop an end-to-end model for learning to follow language instructions with compositional policies. Our model combines large …
Vanya Cohen
,
Geraud Nangue Tasse
,
Nakul Gopalan
,
Steven James
,
Raymond Mooney
,
Benjamin Rosman
PDF
Cite
Project
Skill Machines: Temporal Logic Composition in Reinforcement Learning
A major challenge in reinforcement learning is specifying tasks in a manner that is both interpretable and verifiable. One common …
Geraud Nangue Tasse
,
Devon Jarvis
,
Steven James
,
Benjamin Rosman
PDF
Cite
Project
»
Cite
×