Safe robot execution in model-based reinforcement learning

Inicio | ¿Qué es? | Contacto

English | Català

Consultar RECERCAT

Por comunidades y
colecciones Por fecha Por autores Por títulos Por temas (CDU)

Consultar departamento

Por fecha Por autores Por títulos Por temas (CDU)

Estadisticas

Del documento Todo RECERCAT

Mi RECERCAT

Entrar Alertas por correo-e

Directorio de otros repositorios

RECERCAT Principal > Universitat Politècnica de Catalunya > Documents de recerca > Visualizar documento

Para acceder a los documentos con el texto completo, por favor, siga el siguiente enlace: http://hdl.handle.net/2117/85331

Título:	Safe robot execution in model-based reinforcement learning
Autor/a:	Martínez Martínez, David; Alenyà Ribas, Guillem; Torras, Carme
Otros autores:	Institut de Robòtica i Informàtica Industrial; Universitat Politècnica de Catalunya. ROBiri - Grup de Robòtica de l'IRI
Abstract:	Task learning in robotics requires repeatedly executing the same actions in different states to learn the model of the task. However, in real-world domains, there are usually sequences of actions that, if executed, may produce unrecoverable errors (e.g. breaking an object). Robots should avoid repeating such errors when learning, and thus explore the state space in a more intelligent way. This requires identifying dangerous action effects to avoid including such actions in the generated plans, while at the same time enforcing that the learned models are complete enough for the planner not to fall into dead-ends. We thus propose a new learning method that allows a robot to reason about dead-ends and their causes. Some such causes may be dangerous action effects (i.e., leading to unrecoverable errors if the action were executed in the given state) so that the method allows the robot to skip the exploration of risky actions and guarantees the safety of planned actions. If a plan might lead to a dead-end (e.g., one that includes a dangerous action effect), the robot tries to find an alternative safe plan and, if not found, it actively asks a teacher whether the risky action should be executed. This method permits learning safe policies as well as minimizing unrecoverable errors during the learning process. Experimental validation of the approach is provided in two different scenarios: a robotic task and a simulated problem from the international planning competition. Our approach greatly increases success ratios in problems where previous approaches had high probabilities of failing.
Abstract:	Peer Reviewed
Materia(s):	-Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial -learning (artificial intelligence) -manipulators -planning (artificial intelligence). -Classificació INSPEC::Cybernetics::Artificial intelligence::Learning (artificial intelligence)
Derechos:	http://creativecommons.org/licenses/by-nc-nd/3.0/es/
Tipo de documento:	Artículo - Versión presentada Objeto de conferencia
Editor:	Institute of Electrical and Electronics Engineers (IEEE)
Compartir:

Mostrar el registro completo del ítem

Documentos relacionados

Otros documentos del mismo autor/a

V-MIN: efficient reinforcement learning through demonstrations and relaxed reward demands

Martínez Martínez, David; Alenyà Ribas, Guillem; Torras, Carme

Learning probabilistic action models from interpretation transitions

Martínez Martínez, David; Ribeiro, Tony; Inoue, Katsumi; Alenyà Ribas, Guillem; Torras, Carme

Planning surface cleaning tasks by learning uncertain drag actions outcomes

Martínez Martínez, David; Alenyà Ribas, Guillem; Torras, Carme

Finding safe policies in model-based active learning

Martínez Martínez, David; Alenyà Ribas, Guillem; Torras, Carme

Active learning of manipulation sequences

Martínez Martínez, David; Alenyà Ribas, Guillem; Jimenez Schlegl, Pablo; Torras, Carme; Rossmann, Jürgen; Wantia, Nils; Eren Erdal, Aksoy; Haller, Simon; Piater, Justus

Accesibilidad | Aviso legal | Política de Cookies | Documentos de uso interno

Coordinación

Patrocinio