Autonomous agent with visual sensing for game environments

Guiró i Meléndez, Manel

Autonomous agent with visual sensing for game environments

dc.contributor

Universitat Politècnica de Catalunya. Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial

dc.contributor

Puig Cayuela, Vicenç

dc.contributor.author

Guiró i Meléndez, Manel

dc.date.accessioned

2026-02-25T04:14:56Z

dc.date.available

2026-02-25T04:14:56Z

dc.date.issued

2026-01-26

dc.identifier

https://hdl.handle.net/2117/456130

dc.identifier

PRISMA-201448

dc.identifier

PRISMA-201450

dc.identifier.uri

https://hdl.handle.net/2117/456130

dc.description.abstract

This paper presents the design and validation of a DRL reinforcement learning pipeline that operates exclusively using logical visual information. Unlike conventional approaches based on convolutional neural networks (CNNs) trained on raw pixels, the proposed system prioritizes the use of explicit Feature Engineering based on computer vision (CV) techniques, with the aim of favoring both almost direct sim-to-real transfer and complete adaptability to any version of the game, both with only slight modifications to the feature engineering without losing the DRLtraining performed, thus reducing computational cost and improving interpretability. The 1978 video game Space Invaders is used as a controlled experimental environment and is deliberately treated as a black box, being observed onlythroughscreenshotsandwithoutaccess to internal source code states. Visual observations are processed in real time to construct a structured state representation that encodesthemainelementsofthegame. Thisrepresentation is encapsulated in a customized Gym environment that standardizes interaction with a Deep Q-Network (DQN)agent. Training is performed under computational constraints, prioritizing functional validation of the pipeline over optimal performance. Beyondtheperformanceachieved,themaincontributionofthisworkistodemonstratethatFeature Engineering (in this case based on computer vision CV), integrated in a modular way with reinforcement learning, constitutes an optimal and efficient alternative to variations for highly complex DRL models, leaving a more viable option to take advantage of simulation training in real environments (sim-to-real).

dc.format

application/zip

dc.format

application/pdf

dc.language

eng

dc.publisher

Universitat Politècnica de Catalunya

dc.rights

Open Access

dc.subject

Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial

dc.subject

Machine learning

dc.subject

Video games

dc.subject

Aprenentatge automàtic

dc.subject

Videojocs

dc.title

Autonomous agent with visual sensing for game environments

dc.type

Master thesis

Ficheros en el ítem

Ficheros	Tamaño	Formato	Ver
No hay ficheros asociados a este ítem.

Este ítem aparece en la(s) siguiente(s) colección(ones)

Treballs acadèmics [82075]