CCLR-DL: A novel statistics and deep learning hybrid method for feature selection and forecasting healthcare demand

Hernández Guillamet, Guillem; López Seguí, Francesc; Vidal-Alaball, Josep; López Ibáñez, Beatriz; Hernández Guillamet, Guillem; López Seguí, Francesc; Vidal-Alaball, Josep; López Ibáñez, Beatriz

CCLR-DL: A novel statistics and deep learning hybrid method for feature selection and forecasting healthcare demand

Para acceder a los documentos con el texto completo, por favor, siga el siguiente enlace: http://hdl.handle.net/10256/27320

Autor/a

Hernández Guillamet, Guillem

López Seguí, Francesc

Vidal-Alaball, Josep

López Ibáñez, Beatriz

Fecha de publicación

2025-12-01

Resumen

Background and Objective: Hybrid forecasting methods aim to overcome the limitations of classical statistical approaches and deep learning models. While statistical methods provide interpretability, they often lack predictive power. Conversely, deep learning models achieve high accuracy but act as “black boxes.” This study introduces the Comprehensive Cross-Correlation and Lagged Linear Regression Deep Learning (CCLR-DL) framework, combining statistical and deep learning techniques to enhance both forecasting accuracy and interpretability. Unlike existing hybrid methods that combine statistical filtering with deep learning, CCLR-DL integrates causal statistical selection with neural forecasting, producing interpretable predictors and consistently achieving higher accuracy than models without feature selection or other standard baselines. Methods: The CCLR-DL framework integrates cross-correlation analysis, lagged multiple linear regression, and Granger causality testing with advanced deep learning architectures. This dual-phase approach first identifies causally significant predictors and then fits them into a deep learning model for multivariate time series forecasting. The framework was validated using a real-world dataset of clinical visits and diagnoses from 6.3 million individuals collected over 10 years. Results: In the evaluated setting, the CCLR-DL framework outperformed baseline models, achieving an average Root Mean Square Error (RMSE) improvement of 19.8% over univariate models, 60.1% over no feature selection, and 51.9% over random selection. The causality phase ensured that all selected predictors demonstrated a significant Granger-causal (GC) relationship. Simpler recurrent architectures, particularly bidirectional Long Short-Term Memory units (BiLSTM), yielded the most accurate forecasts by effectively capturing nonlinear temporal dependencies. Conclusions: By addressing the challenges of both prediction accuracy and model transparency, the CCLR-DL framework offers a new approach for high-dimensional, multivariate time series forecasting. In healthcare settings, it may enable decision-makers to anticipate demand shifts with greater reliability, allowing earlier staff scheduling, more efficient resource allocation, and reduced waiting times. In our evaluation, it consistently outperformed baseline strategies, delivering measurable improvements that translate into thousands of patient visits being forecasted more accurately across large populations

This work was conducted with the support of the Secretary of Universities and Research of the Department of Business and Knowledge at the Generalitat de Catalunya 2021 SGR 01125, and founded by the Industrial Doctorate Plan 2021 DI 106, provided by the Agència de Gestió d’Ajuts Universitaris i de Recerca (AGAUR). Open Access funding provided thanks to the CRUE-CSIC agreement with Elsevier

Tipo de documento

Artículo

Versión publicada

peer-reviewed

Lengua

Inglés

Materias y palabras clave

Aprenentatge profund; Anàlisi multivariable; Multivariate analysis; Deep learning

Publicado por

Elsevier

Documentos relacionados

info:eu-repo/semantics/altIdentifier/doi/10.1016/j.cmpb.2025.109057

info:eu-repo/semantics/altIdentifier/issn/0169-2607

info:eu-repo/semantics/altIdentifier/eissn/1872-7565

Citación recomendada

Esta citación se ha generado automáticamente.

Exportar

DIDL MARC MARC_CCUC METS OAI_DC ORE QDC RDF

Derechos

Attribution-NonCommercial 4.0 International

http://creativecommons.org/licenses/by-nc/4.0/

Este ítem aparece en la(s) siguiente(s) colección(ones)

Articles publicats Departament d'Enginyeria Elèctrica, Electrònica i Automàtica [181]

CCLR-DL: A novel statistics and deep learning hybrid method for feature selection and forecasting healthcare demand

Autor/a

Fecha de publicación

Compartir

Resumen

Tipo de documento

Lengua

Materias y palabras clave

Publicado por

Documentos relacionados

Citación recomendada

Exportar

Derechos

Este ítem aparece en la(s) siguiente(s) colección(ones)