Institut Català de la Salut
[Esteban LM] Department of Applied Mathematics, Escuela Universitaria Politécnica de La Almunia, Universidad de Zaragoza, La Almunia de Doña Godina, Spain. Institute for Biocomputation and Physics of Complex Systems (BIFI), Zaragoza, Spain. [Borque-Fernando Á] Department of Urology, Miguel Servet University Hospital, Zaragoza, Spain. Zaragoza, Spain. Health Research Institute of Aragon Foundation, Zaragoza, Spain. [Escorihuela ME] Department of Applied Mathematics, Escuela Universitaria Politécnica de La Almunia, Universidad de Zaragoza, La Almunia de Doña Godina, Spain. [Esteban-Escaño J] Department of Electronic Engineering and Communications, Escuela Universitaria Politécnica de La Almunia, Universidad de Zaragoza, La Almunia de Doña Godina, Spain. [Abascal JM] Department of Urology, Department of Surgery, Parc de Salut Mar, Universitat Pompeu Fabra, Barcelona, Spain. [Servian P] Department of Urology, Hospital Germans Trias i Pujol, Badalona, Spain. [Morote J] Servei d’Urologia, Vall d’Hebron Hospital Universitari, Barcelona, Spain. Department of Surgery, Universitat Autònoma de Barcelona, Bellaterra, Spain. Grup de Recerca Biomèdica en Urologia, Vall d’Hebron Institut de Recerca (VHIR), Barcelona, Spain
Vall d'Hebron Barcelona Hospital Campus
2025-03-21T08:05:16Z
2025-03-21T08:05:16Z
2025-02-04
Clinical utility; Clinically significant prostate cancer; Machine learning
Utilitat clínica; Càncer de pròstata clínicament significatiu; Aprenentatge automàtic
Utilidad clínica; Cáncer de próstata clínicamente significativo; Aprendizaje automático
In prostate cancer (PCa), risk calculators have been proposed, relying on clinical parameters and magnetic resonance imaging (MRI) enable early prediction of clinically significant cancer (CsPCa). The prostate imaging–reporting and data system (PI-RADS) is combined with clinical variables predominantly based on logistic regression models. This study explores modeling using regularization techniques such as ridge regression, LASSO, elastic net, classification tree, tree ensemble models like random forest or XGBoost, and neural networks to predict CsPCa in a dataset of 4799 patients in Catalonia (Spain). An 80–20% split was employed for training and validation. We used predictor variables such as age, prostate-specific antigen (PSA), prostate volume, PSA density (PSAD), digital rectal exam (DRE) findings, family history of PCa, a previous negative biopsy, and PI-RADS categories. When considering a sensitivity of 0.9, in the validation set, the XGBoost model outperforms others with a specificity of 0.640, followed closely by random forest (0.638), neural network (0.634), and logistic regression (0.620). In terms of clinical utility, for a 10% missclassification of CsPCa, XGBoost can avoid 41.77% of unnecessary biopsies, followed closely by random forest (41.67%) and neural networks (41.46%), while logistic regression has a lower rate of 40.62%. Using SHAP values for model explainability, PI-RADS emerges as the most influential risk factor, particularly for individuals with PI-RADS 4 and 5. Additionally, a positive digital rectal examination (DRE) or family history of prostate cancer proves highly influential for certain individuals, while a previous negative biopsy serves as a protective factor for others.
L.M.E., M.E.E., J.E.-E. and A.B.-F. received support by the Government of Aragon [Grant Number T69_23R]; L.M.E. and A.B.-F. received support by the Ministerio de Ciencia e Innovación [Grant Number PID2020-116873GB-I00]; J.M.A, P.S. and J.M. received support by the Instituto de Salut Carlos III and the European Union [Grant Number PI20/01666].
Artículo
Versión publicada
Inglés
Pròstata - Càncer - Imatgeria; Pròstata - Càncer - Imatgeria per ressonància magnètica; Aprenentatge automàtic; DISEASES::Neoplasms::Neoplasms by Site::Urogenital Neoplasms::Genital Neoplasms, Male::Prostatic Neoplasms; Other subheadings::Other subheadings::Other subheadings::/diagnostic imaging; ANALYTICAL, DIAGNOSTIC AND THERAPEUTIC TECHNIQUES, AND EQUIPMENT::Diagnosis::Diagnostic Techniques and Procedures::Diagnostic Imaging::Tomography::Magnetic Resonance Imaging; PHENOMENA AND PROCESSES::Mathematical Concepts::Algorithms::Artificial Intelligence::Machine Learning; ENFERMEDADES::neoplasias::neoplasias por localización::neoplasias urogenitales::neoplasias de los genitales masculinos::neoplasias de la próstata; Otros calificadores::Otros calificadores::Otros calificadores::/diagnóstico por imagen; TÉCNICAS Y EQUIPOS ANALÍTICOS, DIAGNÓSTICOS Y TERAPÉUTICOS::diagnóstico::técnicas y procedimientos diagnósticos::diagnóstico por imagen::tomografía::imagen por resonancia magnética; FENÓMENOS Y PROCESOS::conceptos matemáticos::algoritmos::inteligencia artificial::aprendizaje automático
Nature Portfolio
Scientific Reports;15
https://doi.org/10.1038/s41598-025-88297-6
Attribution-NonCommercial-NoDerivatives 4.0 International
http://creativecommons.org/licenses/by-nc-nd/4.0/
Articles científics - VHIR [1655]