Expressive speech synthesis using sentiment embeddings

Inicio | ¿Qué es? | Contacto

English | Català

Consultar RECERCAT

Por comunidades y
colecciones Por fecha Por autores Por títulos Por temas (CDU)

Consultar departamento

Por fecha Por autores Por títulos Por temas (CDU)

Estadisticas

Del documento Todo RECERCAT

Mi RECERCAT

Entrar Alertas por correo-e

Directorio de otros repositorios

RECERCAT Principal > Universitat Politècnica de Catalunya > Documents de recerca > Visualizar documento

Para acceder a los documentos con el texto completo, por favor, siga el siguiente enlace: http://hdl.handle.net/2117/123860

Título:	Expressive speech synthesis using sentiment embeddings
Autor/a:	Jauk, Igor; Lorenzo Trueba, J.; Yamagishi, J.; Bonafonte Cávez, Antonio
Otros autores:	Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions; Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
Abstract:	In this paper we present a DNN based speech synthesis system trained on an audiobook including sentiment features predicted by the Stanford sentiment parser. The baseline system uses DNN to predict acoustic parameters based on conventional linguistic features, as they have been used in statistical parametric speech synthesis. The predicted parameters are transformed into speech using a conventional high-quality vocoder. In this paper, the conventional linguistic features are enriched using sentiment features. Different sentiment representations have been considered, combining sentiment probabilities with hierarchical distance and context. After preliminary analysis a listening experiment is conducted, where participants evaluate the different systems. The results show the usefulness of the proposed features and reveal differences between expert and non-expert TTS user.
Abstract:	Peer Reviewed
Materia(s):	-Àrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic -Automatic speech recognition -DNN -Expressive speech synthesis -Sentiment analysis -TTS Linguistics -Sentiment analysis -Speech synthesis -Acoustic parameters -Baseline systems -Expressive speech synthesis -Linguistic features -Preliminary analysis -Sentiment features -Speech synthesis system -Statistical parametric speech synthesis -Speech communication -Reconeixement automàtic de la parla
Derechos:
Tipo de documento:	Artículo - Versión publicada Objeto de conferencia
Editor:	International Speech Communication Association (ISCA)
Compartir:

Mostrar el registro completo del ítem

Documentos relacionados

Otros documentos del mismo autor/a

Corpus for cyberbullying prevention

Moreno Bilbao, M. Asunción; Bonafonte Cávez, Antonio; Jauk, Igor; Tarrés, Laia; Pereira, Victor

Creating expressive synthetic voices by unsupervised clustering of audiobooks

Jauk, Igor; Bonafonte Cávez, Antonio; López Otero, Paula; Docio Fernández, Laura

Direct expressive voice training based on semantic selection

Jauk, Igor; Bonafonte Cávez, Antonio

Prosodic and spectral iVectors for expressive speech synthesis

Jauk, Igor; Bonafonte Cávez, Antonio

Acoustic feature prediction from semantic features for expressive speech using deep neural networks

Jauk, Igor; Bonafonte Cávez, Antonio; Pascual, Santiago

Accesibilidad | Aviso legal | Política de Cookies | Documentos de uso interno

Coordinación

Patrocinio