The Information Structure-prosody interface in text-to-speech technologies. An empirical perspective

dc.contributor.author
Domínguez, Mónica
dc.contributor.author
Farrús, Mireia
dc.contributor.author
Wanner, Leo
dc.date.issued
2022-02-11T14:35:31Z
dc.date.issued
2022-02-11T14:35:31Z
dc.date.issued
2022
dc.date.issued
2022-02-11T14:35:32Z
dc.identifier
1613-7027
dc.identifier
https://hdl.handle.net/2445/183091
dc.identifier
706552
dc.description.abstract
The correspondence between the communicative intention of a speaker in terms of Information Structure and the way this speaker reflects communicative aspects by means of prosody have been a fruitful field of study in Linguistics. However, text-to-speech applications still lack the variability and richness found in human speech in terms of how humans display their communication skills. Some attempts were made in the past to model one aspect of Information Structure, namely thematicity for its application to intonation generation in text-to-speech technologies. Yet, these applications suffer from two limitations: (i) they draw upon a small number of made-up simple question-answer pairs rather than on real (spoken or written) corpus material; and (ii) they do not explore whether any other interpretation would better suit a wider range of textual genres beyond dialogs. In this paper, two different interpretations of thematicity in the field of speech technologies are examined: the state-of-art binary (and flat) theme-rheme, and the hierarchical thematicity defined by Igor Mel'čuk within the Meaning-Text Theory. The outcome of the experiments on a corpus of native speakers of US English suggests that the latter interpretation of thematicity has a versatile implementation potential for text-to-speech applications of the Information Structure-prosody interface.
dc.format
application/pdf
dc.format
application/pdf
dc.language
eng
dc.publisher
De Gruyter Mouton
dc.relation
Reproducció del document publicat a: https://doi.org/10.1515/cllt-2020-0008
dc.relation
Corpus Linguistics and Linguistic Theory, 2022, vol. 18, num. 2, p. 419-445
dc.relation
https://doi.org/10.1515/cllt-2020-0008
dc.relation
info:eu-repo/grantAgreement/EC/H2020/870930/EU//WELCOME
dc.relation
info:eu-repo/grantAgreement/EC/H2020/645012/EU//KRISTINA
dc.rights
(c) Domínguez, Mónica et al., 2022
dc.rights
info:eu-repo/semantics/openAccess
dc.source
Articles publicats en revistes (Filologia Catalana i Lingüística General)
dc.subject
Lingüística computacional
dc.subject
Anàlisi prosòdica (Lingüística)
dc.subject
Entonació (Fonètica)
dc.subject
Tema i rema
dc.subject
Corpus (Lingüística)
dc.subject
Computational linguistics
dc.subject
Prosodic analysis (Linguistics)
dc.subject
Intonation (Phonetics)
dc.subject
Topic and comment
dc.subject
Corpora (Linguistics)
dc.title
The Information Structure-prosody interface in text-to-speech technologies. An empirical perspective
dc.type
info:eu-repo/semantics/article
dc.type
info:eu-repo/semantics/publishedVersion


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)