2026-04-17T05:29:45Zhttps://recercat.cat/oai/request

oai:recercat.cat:2117/1270772025-07-17T16:06:05Zcom_2072_1033col_2072_452951

00925njm 22002777a 4500 dc Surís Coll-Vinent, Dídac author 2018-10-17 To be defined at MIT. Deep learning models, and more specifically computer vision systems, have achieved great results in recent years. However, the interpretability and understanding of these models is still in its early stages. Interpretability can be approached from a low-level or filter level perspective, but the representations learned by neural networks encompass a much higher-level knowledge that has to be approached from a semantic point of view, with concepts in mind. The goal of this project is to investigate the concepts neural networks learn implicitly when they are trained in an unsupervised scenario, with a special focus on the multimodal matching of words to visual objects and attributes. We study how we can detect these concepts, as well as how we can force the networks to learn more meaningful ones, both providing analytical insights and getting practical results. Àrees temàtiques de la UPC::Enginyeria de la telecomunicació Neural networks (Computer science) Computer vision Neural networks concepts vision and language sound speech convolutional networks multimodal learning unsupervised learning Xarxes neuronals (Informàtica) Visió per ordinador How concepts emerge in neural networks