On the use of the descriptive variable for enhancing the aggregation of crowdsourced labels

dc.contributor.author
Beñaran-Muñoz, Iker
dc.contributor.author
Hernández-González, Jerónimo
dc.contributor.author
Pérez, Aritz
dc.date.issued
2022-10-03T09:02:56Z
dc.date.issued
2022-10-03T09:02:56Z
dc.date.issued
2022-09-30
dc.date.issued
2022-10-03T09:02:57Z
dc.identifier
0219-1377
dc.identifier
https://hdl.handle.net/2445/189541
dc.identifier
725388
dc.description.abstract
The use of crowdsourcing for annotating data has become a popular and cheap alternative to expert labelling. As a consequence, an aggregation task is required to combine the different labels provided and agree on a single one per example. Most aggregation techniques, including the simple and robust majority voting¿to select the label with the largest number of votes¿disregard the descriptive information provided by the explanatory variable. In this paper, we propose domain-aware voting, an extension of majority voting which incorporates the descriptive variable and the rest of the instances of the dataset for aggregating the label of every instance. The experimental results with simulated and real-world crowdsourced data suggest that domain-aware voting is a competitive alternative to majority voting, especially when a part of the dataset is unlabelled. We elaborate on practical criteria for the use of domain-aware voting.
dc.format
application/pdf
dc.format
application/pdf
dc.language
eng
dc.publisher
Springer Verlag
dc.relation
Reproducció del document publicat a: https://doi.org/10.1007/s10115-022-01743-z
dc.relation
Knowledge and Information Systems, 2022
dc.relation
https://doi.org/10.1007/s10115-022-01743-z
dc.rights
cc by (c) Iker Beñaran-Muñoz, et al., 2022
dc.rights
http://creativecommons.org/licenses/by/3.0/es/
dc.rights
info:eu-repo/semantics/openAccess
dc.source
Articles publicats en revistes (Matemàtiques i Informàtica)
dc.subject
Aprenentatge automàtic
dc.subject
Cultura participativa
dc.subject
Dades massives
dc.subject
Machine learning
dc.subject
Participatory culture
dc.subject
Big data
dc.title
On the use of the descriptive variable for enhancing the aggregation of crowdsourced labels
dc.type
info:eu-repo/semantics/article
dc.type
info:eu-repo/semantics/publishedVersion


Ficheros en el ítem

FicherosTamañoFormatoVer

No hay ficheros asociados a este ítem.

Este ítem aparece en la(s) siguiente(s) colección(ones)