top of page
Research Interests
AI policy and governance, institutional design, existential risks to humanity, moral philosophy, political philosophy, animal minds
Publications
Aligned with Whom? Direct and Social Goals for AI Systems with Anton Korinek. NBER. May 2022.
https://www.nber.org/papers/w30017
Truthful AI: Developing and governing AI that does not lie, with Owain Evans, Owen Cotton-Barratt, Lukas Finnveden, Adam Bales, Avital Balwit, Peter Wills, Luca Righetti, William Saunders, October 2021. https://arxiv.org/abs/2110.06674
Research Assistant Work
bottom of page