A regional perspective on the accuracy of machine learning forecasts of tourism demand based on data characteristics

Publication date

2018-04-06T08:40:00Z

2018-04-06T08:40:00Z

2018

2018-04-06T08:40:00Z

Abstract

In this work we assess the role of data characteristics in the accuracy of machine learning (ML) tourism forecasts from a spatial perspective. First, we apply a seasonal-trend decomposition procedure based on non-parametric regression to isolate the different components of the time series of international tourism demand to all Spanish regions. This approach allows us to compute a set of measures to describe the features of the data. Second, we analyse the performance of several ML models in a recursive multiple-step-ahead forecasting experiment. In a third step, we rank all seventeen regions according to their characteristics and the obtained forecasting performance, and use the rankings as the input for a multivariate analysis to evaluate the interactions between time series features and the accuracy of the predictions. By means of dimensionality reduction techniques we summarise all the information into two components and project all Spanish regions into perceptual maps. We find that entropy and dispersion show a negative relation with accuracy, while the effect of other data characteristics on forecast accuracy is heavily dependent on the forecast horizon.

Document Type

Working document

Language

English

Publisher

Universitat de Barcelona. Facultat d'Economia i Empresa

Related items

Reproducció del document publicat a: http://www.ub.edu/irea/working_papers/2018/201805.pdf

IREA – Working Papers, 2018, IR18/05

AQR – Working Papers, 2018, AQR18/02

[WP E-IR18/05]

[WP E-AQR18/02]

Recommended citation

This citation was generated automatically.

Rights

cc-by-nc-nd, (c) Clavería González et al., 2018

http://creativecommons.org/licenses/by-nc-nd/3.0/