The demiphone:an efficient subword unit for Continuous Speech Recognition

Mariño Acebal, José Bernardo; Nogueiras Rodríguez, Albino; Bonafonte Cávez, Antonio; Mariño Acebal, José Bernardo; Nogueiras Rodríguez, Albino; Bonafonte Cávez, Antonio

The demiphone:an efficient subword unit for Continuous Speech Recognition

Author

Mariño Acebal, José Bernardo

Nogueiras Rodríguez, Albino

Bonafonte Cávez, Antonio

Other authors

Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions

Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla

Publication date

1997

Abstract

In this paper we introduce the demiphone as a contextual phonetic unit for continuous speech recognition. A phone is divided into two parts: a left demiphone that accounts for the left side coarticulation and a right demiphone that copes with the right side context. This new unit discards the dependence between the effects of both side contexts, but provides a better training of the transition between phones. The demiphone can be seen as a heuristic clustering of states that allows a more smoothed training of hidden Markov models and additionally supplies a simple way to create unseen triphones. We report experimental evidence that demiphones outperform the usual combination of triphones, right-side and left-side biphones and monophones.

Peer Reviewed

Postprint (published version)

Document Type

Conference report

Language

English

Subjects and keywords

Àrees temàtiques de la UPC::Enginyeria de la telecomunicació; Telecommunication; Telecomunicació

Publisher

Editors: G. Kokkinakis, N. Fakotakis, E. Dermatas; Editorial: WCL, University of Patras, Grece

Related items

http://www.isca-speech.org/archive/eurospeech_1997/e97_1215.html

Recommended citation

This citation was generated automatically.

Export

DIDL MARC MARC_CCUC METS OAI_DC ORE QDC RDF

Rights

http://creativecommons.org/licenses/by-nc-nd/3.0/es/

Open Access

Attribution-NonCommercial-NoDerivs 3.0 Spain

This item appears in the following Collection(s)

E-prints [73012]

The demiphone:an efficient subword unit for Continuous Speech Recognition

Author

Other authors

Publication date

Share

Abstract

Document Type

Language

Subjects and keywords

Publisher

Related items

Recommended citation

Export

Rights

This item appears in the following Collection(s)