The Information Structure-prosody interface in text-to-speech technologies. An empirical perspective

Domínguez, Mónica; Farrús, Mireia; Wanner, Leo; Domínguez, Mónica; Farrús, Mireia; Wanner, Leo

The Information Structure-prosody interface in text-to-speech technologies. An empirical perspective

Author

Domínguez, Mónica

Farrús, Mireia

Wanner, Leo

Publication date

2022-02-11T14:35:31Z

2022

2022-02-11T14:35:32Z

Abstract

The correspondence between the communicative intention of a speaker in terms of Information Structure and the way this speaker reflects communicative aspects by means of prosody have been a fruitful field of study in Linguistics. However, text-to-speech applications still lack the variability and richness found in human speech in terms of how humans display their communication skills. Some attempts were made in the past to model one aspect of Information Structure, namely thematicity for its application to intonation generation in text-to-speech technologies. Yet, these applications suffer from two limitations: (i) they draw upon a small number of made-up simple question-answer pairs rather than on real (spoken or written) corpus material; and (ii) they do not explore whether any other interpretation would better suit a wider range of textual genres beyond dialogs. In this paper, two different interpretations of thematicity in the field of speech technologies are examined: the state-of-art binary (and flat) theme-rheme, and the hierarchical thematicity defined by Igor Mel'čuk within the Meaning-Text Theory. The outcome of the experiments on a corpus of native speakers of US English suggests that the latter interpretation of thematicity has a versatile implementation potential for text-to-speech applications of the Information Structure-prosody interface.

Document Type

Article

Published version

Language

English

Subjects and keywords

Lingüística computacional; Anàlisi prosòdica (Lingüística); Entonació (Fonètica); Tema i rema; Corpus (Lingüística); Computational linguistics; Prosodic analysis (Linguistics); Intonation (Phonetics); Topic and comment; Corpora (Linguistics)

Publisher

De Gruyter Mouton

Related items

Reproducció del document publicat a: https://doi.org/10.1515/cllt-2020-0008

Corpus Linguistics and Linguistic Theory, 2022, vol. 18, num. 2, p. 419-445

https://doi.org/10.1515/cllt-2020-0008

info:eu-repo/grantAgreement/EC/H2020/870930/EU//WELCOME

info:eu-repo/grantAgreement/EC/H2020/645012/EU//KRISTINA

Recommended citation

This citation was generated automatically.

Export

DIDL MARC MARC_CCUC METS OAI_DC ORE QDC RDF

Rights

This item appears in the following Collection(s)

Filologia Catalana i Lingüística General [949]

ISGlobal - Institut de Salut Global de Barcelona [60793]

The Information Structure-prosody interface in text-to-speech technologies. An empirical perspective

Author

Publication date

Share

Abstract

Document Type

Language

Subjects and keywords

Publisher

Related items

Recommended citation

Export

Rights

This item appears in the following Collection(s)