Bridging realities: training visuo-haptic object recognition models for robots using 3D virtual simulations

This paper proposes an approach for training visuo-haptic object recognition models for robots using synthetic datasets generated by 3D virtual simulations. In robotics, where visual object recognition has witnessed considerable progress due to an abundance of image datasets, the scarcity of diverse haptic samples has resulted in a noticeable gap in research on machine learning incorporating the haptic sense. Our proposed methodology addresses this challenge by utilizing 3D virtual simulations to create realistic synthetic datasets, offering a scalable and cost-effective solution to integrate haptic and visual cues for object recognition seamlessly. Acknowledging the importance of multimodal perception, particularly in robotic applications, our research not only closes the existing gap but envisions a future where intelligent agents possess a holistic understanding of their environment derived from both visual and haptic senses. Our experiments show that synthetic datasets can be used for training object recognition in haptic and visual modes by incorporating noise, performing some preprocessing, data augmentation, or domain adaptation. This work contributes to the advancement of multimodal machine learning toward a more nuanced and comprehensive robotic perception.

Tipus de document

Article

Versió del document

Versió publicada

Llengua

Anglès

Matèries CDU

004 - Informàtica; 61 - Medicina; 62 - Enginyeria. Tecnologia; 68 - Indústries, oficis i comerç d'articles acabats. Tecnologia cibernètica i automàtica

Matèries i paraules clau

Object recognition; Rehabilitation robotics; Robotic engineering; Robotics; Sensorimotor processing; Virtual and augmented reality; Synthetic data generation for computer vision applications

Pàgines

13 p.

Publicat per

Springer

Publicat a

The Visual Computer, 2024. Vol. 40

Citació recomanada

Aquesta citació s'ha generat automàticament.

Exportar

DIDL MARC MARC_CCUC METS OAI_DC ORE QDC RDF

Drets

Attribution 4.0 International

Aquest element apareix en la col·lecció o col·leccions següent(s)

La Salle [1096]