Título:
|
UPC system for the 2016 MediaEval multimodal person discovery in broadcast TV task
|
Autor/a:
|
India Massana, Miquel Àngel; Martí Juan, Gerard; Sayrol Clols, Elisa; Morros Rubió, Josep Ramon; Hernando Pericás, Francisco Javier; Cortillas, Carla; Bouritsas, Giorgos
|
Otros autores:
|
Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions; Universitat Politècnica de Catalunya. GPI - Grup de Processament d'Imatge i Vídeo; Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla |
Abstract:
|
The UPC system works by extracting monomodal signal segments (face tracks, speech segments) that overlap with the person names overlaid in the video signal. These segments are assigned directly with the name of the person and used as a reference to compare against the non-overlapping (unassigned) signal segments. This process is performed independently both on the speech and video signals. A simple fusion scheme is used to combine both monomodal annotations into a single one. |
Materia(s):
|
-Speech processing -Video processing -Broadcasting -Television -Video signal |
Derechos:
|
|
Tipo de documento:
|
Artículo - Versión publicada Objeto de conferencia |
Editor:
|
CEUR-WS.org
|
Compartir:
|
|