Audio clip classification using social tags and the effect of tag expansion

Home | About RECERCAT | Contact

Català | Castellano

All of RECERCAT

By Communities &
Collections By Defense Date By Authors By Titles By Subject

This Collection

By Defense Date By Authors By Titles By Subject

Statistics

View Statistics All RECERCAT

My RECERCAT

Other repositories directory

RECERCAT Home > Universitat Pompeu Fabra > Articles, congressos, llibres > View document

To access the full text documents, please follow this link: http://hdl.handle.net/10230/35014

Title:	Audio clip classification using social tags and the effect of tag expansion
Author:	Font Corbera, Frederic; Serrà Julià, Joan; Serra, Xavier
Abstract:	Comunicació presentada a la 53rd International Conference: Semantic audio, celebrada els dies 27 a 29 de gener de 2014 a Londres, Regne Unit.
Abstract:	Methods for automatic sound and music classification are of great value when trying to organise the large amounts of unstructured, user-contributed audio content uploaded to online sharing platforms. Currently, most of these methods are based on the audio signal, leaving the exploitation of users’ annotations or other contextual data rather unexplored. In this paper, we describe a method for the automatic classification of audio clips which is solely based on user-supplied tags. As a novelty, the method includes a tag expansion step for increasing classification accuracy when audio clips are scarcely tagged. Our results suggest that very high accuracies can be achieved in tag-based audio classification (even for poorly or badly annotated clips), and that the proposed tag expansion step can, in some cases, significantly increase classification performance. We are interested in the use of the described classification method as a first step for tailoring assistive tagging systems to the particularities of different audio categories, and as a way to improve the overall quality of online user annotations.
Abstract:	This work has been supported by BES-2010-037309 FPI from the Spanish Ministry of Science and Innovation (TIN2009-14247-C02-01; F.F.), 2009-SGR-1434 from Generalitat de Catalunya (J.S.), JAEDOC069/2010 from Consejo Superior de Investigaciones Cient´ıficas (J.S.), ICT-2011-8-318770 from the European Commission (J.S.), and FP7-2007-2013 / ERC grant agreement 267583 (CompMusic).
Subject(s):	-Classificació automàtica -Formes musicals
Rights:	© Audio Engineering Society 2014. This paper was presented at the 53rd Conference of the Audio Engineering Society, as paper number 2-3. The full published version can be found at http://www.aes.org/e-lib/browse.cfm?elib=17091
Document type:	Conference Object Article - Accepted version
Published by:	Audio Engineering Society
Share:

Show full item record

All of RECERCAT

This Collection

Statistics

My RECERCAT

Related documents

Other documents of the same author