Psycholinguistic probing of language models' internal layers

dc.contributor.author
Rivera Hidalgo de Torralba, Paula
dc.date.accessioned
2025-10-07T19:21:34Z
dc.date.available
2025-10-07T19:21:34Z
dc.date.issued
2025-10-06T15:37:57Z
dc.date.issued
2025-10-06T15:37:57Z
dc.date.issued
2025
dc.identifier
http://hdl.handle.net/10230/71399
dc.identifier.uri
http://hdl.handle.net/10230/71399
dc.description.abstract
Treball de fi de màster en Lingüística Teòrica i Aplicada
dc.description.abstract
Directors: : Dra. Dra. Iria de Dios Flores i Dr. Corentin Kervadec
dc.description.abstract
This study investigates how transformer-based large language models (LLMs) resolve subject–verb agreement across syntactic structures of varying complexity and distractor conditions. We hypothesized that LLMs would succeed in simple configurations but struggle under deeper embedding and number mismatches. Using a controlled dataset adapted from psycholinguistic research, we analyze model behavior across six sentence structures, two attractor conditions (mismatch vs. no mismatch), and four lexical variants. Using the Pythia 6.9B model, we apply three evaluation metrics—accuracy, prediction depth, and the Tuned Lens interpretability method—to track how agreement resolution evolves across layers. Results confirm our hypothesis: the model performs reliably in simple structures but fails in deeply embedded object-relative clauses. Prediction depth shows early resolution in simple cases and delayed or failed resolution in complex ones. These findings clarify LLM limitations in syntactic processing and highlight the importance of using linguistically informed evaluation methods to better understand model behavior across structural configurations.
dc.format
application/pdf
dc.language
eng
dc.rights
Llicència CC Reconeixement-NoComercial-SenseObraDerivada 4.0 Internacional (CC BY-NC-ND 4.0)
dc.rights
https://creativecommons.org/licenses/by-nc-nd/4.0/
dc.rights
info:eu-repo/semantics/openAccess
dc.subject
Psicolingüística
dc.title
Psycholinguistic probing of language models' internal layers
dc.type
info:eu-repo/semantics/masterThesis


Ficheros en el ítem

FicherosTamañoFormatoVer

No hay ficheros asociados a este ítem.

Este ítem aparece en la(s) siguiente(s) colección(ones)