On the use of the descriptive variable for enhancing the aggregation of crowdsourced labels

Beñaran-Muñoz, Iker; Hernández-González, Jerónimo; Pérez, Aritz; Beñaran-Muñoz, Iker; Hernández-González, Jerónimo; Pérez, Aritz

On the use of the descriptive variable for enhancing the aggregation of crowdsourced labels

Autor/a

Beñaran-Muñoz, Iker

Hernández-González, Jerónimo

Pérez, Aritz

Fecha de publicación

2022-10-03T09:02:56Z

2022-09-30

2022-10-03T09:02:57Z

Resumen

The use of crowdsourcing for annotating data has become a popular and cheap alternative to expert labelling. As a consequence, an aggregation task is required to combine the different labels provided and agree on a single one per example. Most aggregation techniques, including the simple and robust majority voting¿to select the label with the largest number of votes¿disregard the descriptive information provided by the explanatory variable. In this paper, we propose domain-aware voting, an extension of majority voting which incorporates the descriptive variable and the rest of the instances of the dataset for aggregating the label of every instance. The experimental results with simulated and real-world crowdsourced data suggest that domain-aware voting is a competitive alternative to majority voting, especially when a part of the dataset is unlabelled. We elaborate on practical criteria for the use of domain-aware voting.

Tipo de documento

Artículo

Versión publicada

Lengua

Inglés

Materias y palabras clave

Aprenentatge automàtic; Cultura participativa; Dades massives; Machine learning; Participatory culture; Big data

Publicado por

Springer Verlag

Documentos relacionados

Reproducció del document publicat a: https://doi.org/10.1007/s10115-022-01743-z

Knowledge and Information Systems, 2022

https://doi.org/10.1007/s10115-022-01743-z

Citación recomendada

Esta citación se ha generado automáticamente.

Exportar

DIDL MARC MARC_CCUC METS OAI_DC ORE QDC RDF

Derechos

cc by (c) Iker Beñaran-Muñoz, et al., 2022

http://creativecommons.org/licenses/by/3.0/es/

Este ítem aparece en la(s) siguiente(s) colección(ones)

ISGlobal - Institut de Salut Global de Barcelona [60808]

Matemàtiques i Informàtica [1007]

On the use of the descriptive variable for enhancing the aggregation of crowdsourced labels

Autor/a

Fecha de publicación

Compartir

Resumen

Tipo de documento

Lengua

Materias y palabras clave

Publicado por

Documentos relacionados

Citación recomendada

Exportar

Derechos

Este ítem aparece en la(s) siguiente(s) colección(ones)