Multi-Objective Reinforcement Learning for Designing Ethical Environments

Rodríguez Soto, Manel; López Sánchez, Maite; Rodríguez-Aguilar, Juan A. (Juan Antonio)

Multi-Objective Reinforcement Learning for Designing Ethical Environments

dc.contributor.author

Rodríguez Soto, Manel

dc.contributor.author

López Sánchez, Maite

dc.contributor.author

Rodríguez-Aguilar, Juan A. (Juan Antonio)

dc.date.issued

2025-01-22T09:02:54Z

dc.date.issued

2025-01-22T09:02:54Z

dc.date.issued

2021

dc.identifier

https://hdl.handle.net/2445/217805

dc.description.abstract

AI research is being challenged with ensuring that autonomous agents learn to behave ethically, namely in alignment with moral values. A common approach, founded on the exploitation of Reinforcement Learning techniques, is to design environments that incentivise agents to behave ethically. However, to the best of our knowledge, current approaches do not theoretically guarantee that an agent will learn to behave ethically. Here, we make headway along this direction by proposing a novel way of designing environments wherein it is formally guaranteed that an agent learns to behave ethically while pursuing its individual objectives. Our theoretical results develop within the formal framework of Multi-Objective Reinforcement Learning to ease the handling of an agent's individual and ethical objectives. As a further contribution, we leverage on our theoretical results to introduce an algorithm that automates the design of ethical environments.

dc.format

7 p.

dc.format

application/pdf

dc.format

application/pdf

dc.language

eng

dc.publisher

International Joint Conferences on Artificial Intelligence

dc.relation

Reproducció del document disponible a: https://doi.org/10.24963/ijcai.2021/76

dc.relation

Comunicació a: 30th International Joint Conference on Artificial Intelligence (IJCAI 2021)

dc.relation

https://doi.org/10.24963/ijcai.2021/76

dc.rights

info:eu-repo/semantics/openAccess

dc.source

Comunicacions a congressos (Matemàtiques i Informàtica)

dc.subject

Intel·ligència artificial

dc.subject

Ètica

dc.subject

Aprenentatge per reforç (Intel·ligència artificial)

dc.subject

Artificial intelligence

dc.subject

Ethics

dc.subject

Reinforcement learning

dc.title

Multi-Objective Reinforcement Learning for Designing Ethical Environments

dc.type

info:eu-repo/semantics/conferenceObject

dc.type

info:eu-repo/semantics/publishedVersion

Ficheros en el ítem

Ficheros	Tamaño	Formato	Ver
No hay ficheros asociados a este ítem.

Este ítem aparece en la(s) siguiente(s) colección(ones)

ISGlobal - Institut de Salut Global de Barcelona [60808]

Matemàtiques i Informàtica [1007]