i-Vector modeling with deep belief networks for multi-session speaker recognition

Inicio | ¿Qué es? | Contacto

English | Català

Consultar RECERCAT

Por comunidades y
colecciones Por fecha Por autores Por títulos Por temas (CDU)

Consultar departamento

Por fecha Por autores Por títulos Por temas (CDU)

Estadisticas

Del documento Todo RECERCAT

Mi RECERCAT

Entrar Alertas por correo-e

Directorio de otros repositorios

RECERCAT Principal > Universitat Politècnica de Catalunya > Documents de recerca > Visualizar documento

Para acceder a los documentos con el texto completo, por favor, siga el siguiente enlace: http://hdl.handle.net/2117/27035

Título:	i-Vector modeling with deep belief networks for multi-session speaker recognition
Autor/a:	Ghahabi Esfahani, Omid; Hernando Pericás, Francisco Javier
Otros autores:	Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions; Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
Abstract:	In this paper we propose an impostor selection method for a Deep Belief Network (DBN) based system which models i-vectors in a multi-session speaker verification task. In the proposed method, instead of choosing a fixed number of most informative impostors, a threshold is defined according to the frequencies of impostors. The selected impostors are then clustered and the centroids are considered as the final impostors for target speakers. The system first trains each target speaker unsupervisingly by an adaptation method and then models discriminatively each target speaker using the impostor centroids and target i-vectors. The evaluation is performed on the NIST 2014 i-vector challenge database and it is shown that the proposed DBN-based system achieves 23% relative improvement of minDCF over the baseline system in the challenge
Materia(s):	-Àrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic -Speech processing systems -Automatic speech recognition -Reconeixement automàtic de la parla -Processament de la parla
Derechos:
Tipo de documento:	Artículo - Versión publicada Objeto de conferencia
Compartir:

Mostrar el registro completo del ítem

Documentos relacionados

Otros documentos del mismo autor/a

Global impostor selection for DBNs in multi-session i-vector speaker recognition

Ghahabi Esfahani, Omid; Hernando Pericás, Francisco Javier

Deep belief networks for i-vector based speaker recognition

Ghahabi Esfahani, Omid; Hernando Pericás, Francisco Javier

Deep neural networks for i-vector language identification of short utterances in cars

Ghahabi Esfahani, Omid; Bonafonte Cávez, Antonio; Hernando Pericás, Francisco Javier; Moreno Bilbao, M. Asunción

On the acoustic environment of a neonatal intensive care unit: initial description, and detection of equipment alarms

Raboshchuk, Ganna; Nadeu Camprubí, Climent; Ghahabi Esfahani, Omid; Solvez, Sergi; Muñoz Mahamud, Blanca; Riverola de Veciana, Ana; Navarro Hervas, Santiago

On the improvement of speaker diarization by detecting overlapped speech

Hernando Pericás, Francisco Javier; Hernando Pericás, Francisco Javier

Accesibilidad | Aviso legal | Política de Cookies | Documentos de uso interno

Coordinación

Patrocinio