To access the full text documents, please follow this link: http://hdl.handle.net/10854/2106

A non-linear VAD for noisy environments
Solé-Casals, Jordi; Zaiats, Vladimir
Universitat de Vic. Escola Politècnica Superior; Universitat de Vic. Grup de Recerca en Tecnologies Digitals
This paper deals with non-linear transformations for improving the performance of an entropy-based voice activity detector (VAD). The idea to use a non-linear transformation has already been applied in the field of speech linear prediction, or linear predictive coding (LPC), based on source separation techniques, where a score function is added to classical equations in order to take into account the true distribution of the signal. We explore the possibility of estimating the entropy of frames after calculating its score function, instead of using original frames. We observe that if the signal is clean, the estimated entropy is essentially the same; if the signal is noisy, however, the frames transformed using the score function may give entropy that is different in voiced frames as compared to nonvoiced ones. Experimental evidence is given to show that this fact enables voice activity detection under high noise, where the simple entropy method fails.
-Veu, Processament de
Tots els drets reservats
(c) Springer (The original publication is available at www.springerlink.com)
Article
info:eu-repo/acceptedVersion
Springer
         

Full text files in this document

Files Size Format View
artconlli_a2010_sole_casals_jordi_non_linear.pdf 502.1 KB application/pdf View/Open

Show full item record

Related documents

Other documents of the same author

 

Coordination

 

Supporters