To access the full text documents, please follow this link:

Audio segmentation of broadcast news in the Albayzin-2010 evaluation: overview, results, and discussion
Butko, Taras; Nadeu Camprubí, Climent
Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions; Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
Recently, audio segmentation has attracted research interest because of its usefulness in several applications like audio indexing and retrieval, subtitling, monitoring of acoustic scenes, etc. Moreover, a previous audio segmentation stage may be useful to improve the robustness of speech technologies like automatic speech recognition and speaker diarization. In this article, we present the evaluation of broadcast news audio segmentation systems carried out in the context of the Albayzín-2010 evaluation campaign. That evaluation consisted of segmenting audio from the 3/24 Catalan TV channel into five acoustic classes: music, speech, speech over music, speech over noise, and the other. The evaluation results displayed the difficulty of this segmentation task. In this article, after presenting the database and metric, as well as the feature extraction methods and segmentation techniques used by the submitted systems, the experimental results are analyzed and compared, with the aim of gaining an insight into the proposed solutions, and looking for directions which are promising.
Peer Reviewed
Àrees temàtiques de la UPC::Enginyeria electrònica i telecomunicacions::Processament del senyal::Processament de la parla i del senyal acústic
Audio segmentation
Broadcast news
Albayzin 2010
So -- Processament de dades

Show full item record

Related documents

Other documents of the same author

Butko, Taras; Canton Ferrer, Cristian; Segura Perales, Carlos; Giró Nieto, Xavier; Nadeu Camprubí, Climent; Hernando Pericás, Francisco Javier; Casas Pla, Josep Ramon
Butko, Taras; Nadeu Camprubí, Climent; Schulz, Henrik