Bridging realities: training visuo-haptic object recognition models for robots using 3D virtual simulations

Altres autors/es

Universitat Ramon Llull. La Salle

De La Salle University

Data de publicació

2024-07



Resum

This paper proposes an approach for training visuo-haptic object recognition models for robots using synthetic datasets generated by 3D virtual simulations. In robotics, where visual object recognition has witnessed considerable progress due to an abundance of image datasets, the scarcity of diverse haptic samples has resulted in a noticeable gap in research on machine learning incorporating the haptic sense. Our proposed methodology addresses this challenge by utilizing 3D virtual simulations to create realistic synthetic datasets, offering a scalable and cost-effective solution to integrate haptic and visual cues for object recognition seamlessly. Acknowledging the importance of multimodal perception, particularly in robotic applications, our research not only closes the existing gap but envisions a future where intelligent agents possess a holistic understanding of their environment derived from both visual and haptic senses. Our experiments show that synthetic datasets can be used for training object recognition in haptic and visual modes by incorporating noise, performing some preprocessing, data augmentation, or domain adaptation. This work contributes to the advancement of multimodal machine learning toward a more nuanced and comprehensive robotic perception.

Tipus de document

Article

Versió del document

Versió publicada

Llengua

Anglès

Pàgines

13 p.

Publicat per

Springer

Publicat a

The Visual Computer, 2024. Vol. 40

Citació recomanada

Aquesta citació s'ha generat automàticament.

Drets

© L'autor/a

© L'autor/a

Attribution 4.0 International

Aquest element apareix en la col·lecció o col·leccions següent(s)

La Salle [1096]