Set covering machine on t-cell receptor LLM representations for lung cancer prediction

dc.contributor.author
Vegas Morales, Neus
dc.date.accessioned
2025-11-05T20:36:24Z
dc.date.available
2025-11-05T20:36:24Z
dc.date.issued
2025-11-04T16:03:40Z
dc.date.issued
2025-11-04T16:03:40Z
dc.date.issued
2025
dc.identifier
http://hdl.handle.net/10230/71765
dc.identifier.uri
http://hdl.handle.net/10230/71765
dc.description.abstract
Treball fi de màster de: Erasmus Mundus joint Master in Artificial Intelligence (EMAI)
dc.description.abstract
Supervisors: Benny Chain and John Shawe-Taylor
dc.description.abstract
Tutor: Massimo Mecella
dc.description.abstract
T-cell receptors (TCRs) provide insights into immune recognition of cancer. We explore whether interpretable rule-based classifiers derived from SCEPTR embeddings of TCR sequences can differentiate cancer repertoires from healthy controls. Using the Set Covering Machine algorithm, we developed models with hyperplane and similarity based rules across alpha and beta chains. Despite strong performance on training data, models failed to generalize to external datasets. Unexpectedly, alpha-chain models often outperformed beta-chain models, and single rules sometimes achieved high training accuracy, suggesting overfitting. Our findings highlight challenges in detecting cancer-specific TCR signatures and indicate current embeddings may capture technical patterns rather than biological signal. We propose future directions including improved rule generation strategies and validation with functionally annotated repertoires.
dc.format
application/pdf
dc.language
eng
dc.rights
Llicència CC Reconeixement-NoComercial-SenseObraDerivada 4.0 Internacional (CC BY-NC-ND 4.0)
dc.rights
https://creativecommons.org/licenses/by-nc-nd/4.0/
dc.rights
info:eu-repo/semantics/openAccess
dc.subject
Càncer
dc.title
Set covering machine on t-cell receptor LLM representations for lung cancer prediction
dc.type
info:eu-repo/semantics/masterThesis


Fitxers en aquest element

FitxersGrandàriaFormatVisualització

No hi ha fitxers associats a aquest element.

Aquest element apareix en la col·lecció o col·leccions següent(s)