Título:
|
Feature classification by means of Deep Belief Networks for speaker recognition
|
Autor/a:
|
Safari, Pooyan; Ghahabi, Omid; Hernando Pericás, Francisco Javier
|
Otros autores:
|
Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions; Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla |
Abstract:
|
In this paper, we propose to discriminatively model target
and impostor spectral features using Deep Belief Networks
(DBNs) for speaker recognition. In the feature level, the number
of impostor samples is considerably large compared to
previous works based on i-vectors. Therefore, those i-vector
based impostor selection algorithms are not computationally
practical. On the other hand, the number of samples for each
target speaker is different from one speaker to another which
makes the training process more difficult. In this work, we
take advantage of DBN unsupervised learning to train a global
model, which will be referred to as Universal DBN (UDBN).
Then we adapt this UDBN to the data of each target speaker.
The evaluation is performed on the core test condition of the
NIST SRE 2006 database and it is shown that the proposed
architecture achieves more than 8% relative improvement in
comparison to the conventional Multilayer Perceptron (MLP). |
Abstract:
|
Peer Reviewed |
Materia(s):
|
-Àrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic -Automatic speech recognition -Speaker recognition -Deep Belief Network -Restricted Boltzmann Machine -Feature classification -Reconeixement automàtic de la parla |
Derechos:
|
|
Tipo de documento:
|
Artículo - Versión publicada Objeto de conferencia |
Editor:
|
Institute of Electrical and Electronics Engineers (IEEE)
|
Compartir:
|
|