Prediction models using artificial intelligence and longitudinal data from electronic health records: a systematic methodological review

Other authors

[Carrasco-Ribelles LA] Fundació Institut Universitari per a la recerca a l’Atenció Primària de Salut Jordi Gol I Gurina (IDIAPJGol), Barcelona, 08007, Spain. Department of Signal Theory and Communications, Universitat Politècnica de Catalunya (UPC), Barcelona, 08034, Spain. Unitat de Suport a la Recerca Metropolitana Nord, Fundació Institut Universitari per a la recerca a l’Atencio Primaria de Salut Jordi Gol I Gurina (IDIAPJGol), Mataró, 08303, Spain. [Llanes-Jurado J] Instituto de Investigacion e Innovacion en Bioingenierıa (i3B), Universitat Politècnica de València (UPV), València, 46022, Spain. [Gallego-Moll C] Fundació Institut Universitari per a la recerca a l’Atenció Primària de Salut Jordi Gol I Gurina (IDIAPJGol), Barcelona, 08007, Spain. Unitat de Suport a la Recerca Metropolitana Nord, Fundació Institut Universitari per a la recerca a l’Atenció Primària de Salut Jordi Gol I Gurina (IDIAPJGol), Mataró, 08303, Spain. [Cabrera-Bean M] Department of Signal Theory and Communications, Universitat Politècnica de Catalunya (UPC), Barcelona, 08034, Spain. [Monteagudo-Zaragoza M] Fundació Institut Universitari per a la recerca a l’Atenció Primària de Salut Jordi Gol I Gurina (IDIAPJGol), Barcelona, 08007, Spain. [Violán C] Unitat de Suport a la Recerca Metropolitana Nord, Fundació Institut Universitari per a la recerca a l’Atenció Primària de Salut Jordi Gol I Gurina (IDIAPJGol), Mataró, 08303, Spain. Direcció d’Atenció Primària Metropolitana Nord, Institut Catala de Salut, Badalona, 08915, Spain. Fundació Institut d’Investigació en Ciències de la Salut Germans Trias i Pujol (IGTP), Badalona, 08916, Spain. Fundació UAB, Universitat Autònoma de Barcelona, Cerdanyola del Vallès, 08193, Spain. [Zabaleta-del-Olmo E] Fundació Institut Universitari per a la recerca a l’Atenció Primària de Salut Jordi Gol I Gurina (IDIAPJGol), Barcelona, 08007, Spain. Gerència Territorial de Barcelona, Institut Català de la Salut, Barcelona, 08007, Spain. Nursing Department, Faculty of Nursing, Universitat de Girona, Girona, 17003, Spain

IDIAP Jordi Gol

Publication date

2023-09-26T12:21:44Z

2023-09-26T12:21:44Z

2023-09



Abstract

Intel·ligència artificial; Registres sanitaris electrònics; Model predictiu


Inteligencia artificial; Registros sanitarios electrónicos; Modelo predictivo


Artificial intelligence; Electronic health records; Predictive model


Objective: To describe and appraise the use of artificial intelligence (AI) techniques that can cope with longitudinal data from electronic health records (EHRs) to predict health-related outcomes. Methods: This review included studies in any language that: EHR was at least one of the data sources, collected longitudinal data, used an AI technique capable of handling longitudinal data, and predicted any health-related outcomes. We searched MEDLINE, Scopus, Web of Science, and IEEE Xplorer from inception to January 3, 2022. Information on the dataset, prediction task, data preprocessing, feature selection, method, validation, performance, and implementation was extracted and summarized using descriptive statistics. Risk of bias and completeness of reporting were assessed using a short form of PROBAST and TRIPOD, respectively. Results: Eighty-one studies were included. Follow-up time and number of registers per patient varied greatly, and most predicted disease development or next event based on diagnoses and drug treatments. Architectures generally were based on Recurrent Neural Networks-like layers, though in recent years combining different layers or transformers has become more popular. About half of the included studies performed hyperparameter tuning and used attention mechanisms. Most performed a single train-test partition and could not correctly assess the variability of the model’s performance. Reporting quality was poor, and a third of the studies were at high risk of bias. Conclusions: AI models are increasingly using longitudinal data. However, the heterogeneity in reporting methodology and results, and the lack of public EHR datasets and code sharing, complicate the possibility of replication.

Document Type

Article


Published version

Language

English

Publisher

Oxford

Related items

Journal of the American Medical Informatics Association;

https://doi.org/10.1093/jamia/ocad168

Recommended citation

This citation was generated automatically.

Rights

Attribution-NonCommercial-ShareAlike 4.0 International

http://creativecommons.org/licenses/by-nc-sa/4.0/

This item appears in the following Collection(s)