dc.contributor |
Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions |
dc.contributor |
Universitat Politècnica de Catalunya. GPI - Grup de Processament d'Imatge i Vídeo |
dc.contributor.author |
Lin, Xiao |
dc.contributor.author |
Casas Pla, Josep Ramon |
dc.contributor.author |
Pardàs Feliu, Montse |
dc.date |
2018-03-02 |
dc.identifier.citation |
Lin, X., Casas, J., Pardas, M. Temporally coherent 3D point cloud video segmentation in generic scenes. "IEEE transactions on image processing", 2 Març 2018, vol. 27, núm. 6, p. 3087-3099. |
dc.identifier.citation |
1057-7149 |
dc.identifier.citation |
10.1109/TIP.2018.2811541 |
dc.identifier.uri |
http://hdl.handle.net/2117/120434 |
dc.language.iso |
eng |
dc.relation |
https://ieeexplore.ieee.org/document/8306148/ |
dc.relation |
info:eu-repo/grantAgreement/ES/1PE/TEC2013-43935-R |
dc.relation |
info:eu-repo/grantAgreement/ES/1PE/TEC2016-75976-R |
dc.rights |
info:eu-repo/semantics/openAccess |
dc.subject |
Àrees temàtiques de la UPC::So, imatge i multimèdia::Creació multimèdia::Imatge digital |
dc.subject |
Image processing -- Digital techniques |
dc.subject |
Video segmentation |
dc.subject |
RGBD data |
dc.subject |
Point clouds |
dc.subject |
3D connectivity |
dc.subject |
Hierarchical segmentation |
dc.subject |
Imatges -- Processament -- Tècniques digitals |
dc.title |
Temporally coherent 3D point cloud video segmentation in generic scenes |
dc.type |
info:eu-repo/semantics/submittedVersion |
dc.type |
info:eu-repo/semantics/article |
dc.description.abstract |
© 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes,creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. |
dc.description.abstract |
Video segmentation is an important building block for high level applications, such as scene understanding and interaction analysis. While outstanding results are achieved in this field by the state-of-the-art learning and model-based methods, they are restricted to certain types of scenes or require a large amount of annotated training data to achieve object segmentation in generic scenes. On the other hand, RGBD data, widely available with the introduction of consumer depth sensors, provide actual world 3D geometry compared with 2D images. The explicit geometry in RGBD data greatly help in computer vision tasks, but the lack of annotations in this type of data may also hinder the extension of learning-based methods to RGBD. In this paper, we present a novel generic segmentation approach for 3D point cloud video (stream data) thoroughly exploiting the explicit geometry in RGBD. Our proposal is only based on low level features, such as connectivity and compactness. We exploit temporal coherence by representing the rough estimation of objects in a single frame with a hierarchical structure and propagating this hierarchy along time. The hierarchical structure provides an efficient way to establish temporal correspondences at different scales of object-connectivity and to temporally manage the splits and merges of objects. This allows updating the segmentation according to the evidence observed in the history. The proposed method is evaluated on several challenging data sets, with promising results for the presented approach. |
dc.description.abstract |
Peer Reviewed |