Psycholinguistic probing of language models' internal layers

Rivera Hidalgo de Torralba, Paula

Psycholinguistic probing of language models' internal layers

dc.contributor.author

Rivera Hidalgo de Torralba, Paula

dc.date.accessioned

2025-10-07T19:21:34Z

dc.date.available

2025-10-07T19:21:34Z

dc.date.issued

2025-10-06T15:37:57Z

dc.date.issued

2025-10-06T15:37:57Z

dc.date.issued

2025

dc.identifier

http://hdl.handle.net/10230/71399

dc.identifier.uri

http://hdl.handle.net/10230/71399

dc.description.abstract

Treball de fi de màster en Lingüística Teòrica i Aplicada

dc.description.abstract

Directors: : Dra. Dra. Iria de Dios Flores i Dr. Corentin Kervadec

dc.description.abstract

This study investigates how transformer-based large language models (LLMs) resolve subject–verb agreement across syntactic structures of varying complexity and distractor conditions. We hypothesized that LLMs would succeed in simple configurations but struggle under deeper embedding and number mismatches. Using a controlled dataset adapted from psycholinguistic research, we analyze model behavior across six sentence structures, two attractor conditions (mismatch vs. no mismatch), and four lexical variants. Using the Pythia 6.9B model, we apply three evaluation metrics—accuracy, prediction depth, and the Tuned Lens interpretability method—to track how agreement resolution evolves across layers. Results confirm our hypothesis: the model performs reliably in simple structures but fails in deeply embedded object-relative clauses. Prediction depth shows early resolution in simple cases and delayed or failed resolution in complex ones. These findings clarify LLM limitations in syntactic processing and highlight the importance of using linguistically informed evaluation methods to better understand model behavior across structural configurations.

dc.format

application/pdf

dc.language

eng

dc.rights

Llicència CC Reconeixement-NoComercial-SenseObraDerivada 4.0 Internacional (CC BY-NC-ND 4.0)

dc.rights

https://creativecommons.org/licenses/by-nc-nd/4.0/

dc.rights

info:eu-repo/semantics/openAccess

dc.subject

Psicolingüística

dc.title

Psycholinguistic probing of language models' internal layers

dc.type

info:eu-repo/semantics/masterThesis

Ficheros en el ítem

Ficheros	Tamaño	Formato	Ver
No hay ficheros asociados a este ítem.

Este ítem aparece en la(s) siguiente(s) colección(ones)

Treballs d'estudiants [4945]