STELA: a community-centred approach to norm elicitation for AI alignment (2024)

Author(s): Stevie Bergman, Nahema Marchal, John Mellor, Shakir Mohamed et al
Journal: Scientific Reports

The Ethics of Advanced AI Assistants (2024)

Author(s): Iason Gabriel, Arianna Manzini, Geoff Keeling, Lisa Anne Hendricks et al
Journal: arXiv

Representation in AI Evaluations (2023)

Author(s): A Stevie Bergman, Lisa Anne Hendricks, Maribeth Rauh, Boxi Wu et al
Journal: Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency

Using the Veil of Ignorance to align AI systems with principles of justice (2023)

Author(s): Laura Weidinger, Kevin R McKee, Richard Everett, Saffron Huang et al
Journal: Proceedings of the National Academy of Sciences

In Conversation with Artificial Intelligence: Aligning Language Models with Human Values (2023)

Author(s): A Kasirzadeh, I Gabriel
Journal: Philosophy & Technology

The Challenge of Value Alignment: From Fairer Algorithms to AI Safety (2022)

Author(s): I Gabriel, V Ghazavi
Journal: The Oxford Handbook of Digital Ethics (ed) Carissa Veliz (OUP, 2022)

Power to the People? Opportunities and Challenges for Participatory AI (2022)

Author(s): A Birhane, W Isaac, V Prabhakaran, M Díaz, MC Elish, I Gabriel et al
Journal: ACM EAAMO

A Human Rights-Based Approach to Responsible AI (2022)

Author(s): V Prabhakaran, M Mitchell, T Gebru, I Gabriel
Journal: ACM EAAMO poster

Toward a Theory of Justice for Artificial Intelligence (2022)

Author(s): Iason Gabriel
Journal: Daedalus 151 (2), 218-231

Artificial Intelligence, Values, and Alignment (2020)

Author(s): I Gabriel
Journal: Minds and machines 30 (3), 411-437

End of content

End of content