Influence of TTS systems performance on reaction times in people with aphasia

Publication date

2022-02-07T18:03:07Z

2022-02-07T18:03:07Z

2021-11-29

2022-02-07T18:03:07Z

Abstract

Text-to-speech (TTS) systems provide fundamental reading support for people with aphasia and reading difficulties. However, artificial voices are more difficult to process than natural voices. The current study is an extended analysis of the results of a clinical experiment investigating which, among three artificial voices and a digitised human voice, is more suitable for people with aphasia and reading impairments. Such results show that the voice synthesised with Ogmios TTS, a concatenative speech synthesis system, caused significantly slower reaction times than the other three voices used in the experiment. The present study explores whether and what voice quality metrics are linked to delayed reaction times. For this purpose, the voices were analysed using an automatic assessment of intelligibility, naturalness, and jitter and shimmer voice quality parameters. This analysis revealed that Ogmios TTS, in general, performed worse than the other voices in all parameters. These observations could explain the significantly delayed reaction times in people with aphasia and reading impairments when listening to Ogmios TTS and could open up consideration about which TTS to choose for compensative devices for these patients based on the voice analysis of these parameters.

Document Type

Article


Published version

Language

English

Publisher

MDPI

Related items

Reproducció del document publicat a: https://doi.org/10.3390/app112311320

Applied Sciences, 2021, vol. 11, num. 23, p. 11320

https://doi.org/10.3390/app112311320

Recommended citation

This citation was generated automatically.

Rights

cc-by (c) Cistola, Giorgia et al., 2021

https://creativecommons.org/licenses/by/4.0/

This item appears in the following Collection(s)