Influence of TTS systems performance on reaction times in people with aphasia

Fecha de publicación

2022-02-07T18:03:07Z

2022-02-07T18:03:07Z

2021-11-29

2022-02-07T18:03:07Z

Resumen

Text-to-speech (TTS) systems provide fundamental reading support for people with aphasia and reading difficulties. However, artificial voices are more difficult to process than natural voices. The current study is an extended analysis of the results of a clinical experiment investigating which, among three artificial voices and a digitised human voice, is more suitable for people with aphasia and reading impairments. Such results show that the voice synthesised with Ogmios TTS, a concatenative speech synthesis system, caused significantly slower reaction times than the other three voices used in the experiment. The present study explores whether and what voice quality metrics are linked to delayed reaction times. For this purpose, the voices were analysed using an automatic assessment of intelligibility, naturalness, and jitter and shimmer voice quality parameters. This analysis revealed that Ogmios TTS, in general, performed worse than the other voices in all parameters. These observations could explain the significantly delayed reaction times in people with aphasia and reading impairments when listening to Ogmios TTS and could open up consideration about which TTS to choose for compensative devices for these patients based on the voice analysis of these parameters.

Tipo de documento

Artículo


Versión publicada

Lengua

Inglés

Publicado por

MDPI

Documentos relacionados

Reproducció del document publicat a: https://doi.org/10.3390/app112311320

Applied Sciences, 2021, vol. 11, num. 23, p. 11320

https://doi.org/10.3390/app112311320

Citación recomendada

Esta citación se ha generado automáticamente.

Derechos

cc-by (c) Cistola, Giorgia et al., 2021

https://creativecommons.org/licenses/by/4.0/

Este ítem aparece en la(s) siguiente(s) colección(ones)