Home
Projects
Publications
People
Join the Lab
Contact
Login
Alignment
MoralityGym: A Benchmark for Evaluating Hierarchical Moral Alignment in Sequential Decision-Making Agents
Evaluating moral alignment in agents navigating conflicting, hierarchically structured human norms is a critical challenge at the …
Simon Rosen
,
Siddarth Singh
,
Ebenezer Gelo
,
Helen Sarah Robertson
,
Ibrahim Suder
,
Victoria Williams
,
Benjamin Rosman
,
Geraud Nangue Tasse
,
Steven James
PDF
Cite
Cite
×