A Generative Framework for Multimodal Learning of Spatial Concepts and Object Categories: An Unsupervised Part-of-Speech Tagging and 3D Visual Perception Based Approach

Amir Aly; Akira Taniguchi; Tadahiro Taniguchi

Communication Dans Un Congrès Année : 2017

A Generative Framework for Multimodal Learning of Spatial Concepts and Object Categories: An Unsupervised Part-of-Speech Tagging and 3D Visual Perception Based Approach

(1, 2) , (2) , (2)

1
2

Amir Aly

Fonction : Auteur
PersonId : 3554
IdHAL : amir-aly
ORCID : 0000-0001-5169-0679
IdRef : 177803185

Robotique et Vision

Ritsumeikan University

Akira Taniguchi

Fonction : Auteur

Ritsumeikan University

Tadahiro Taniguchi

Fonction : Auteur

Ritsumeikan University

Résumé

Future human-robot collaboration employs language in instructing a robot about specific tasks to perform in its surroundings. This requires the robot to be able to associate spatial knowledge with language to understand the details of an assigned task so as to behave appropriately in the context of interaction. In this paper, we propose a probabilistic framework for learning the meaning of language spatial concepts (spatial prepositions) and object categories based on visual cues representing spatial layouts and geometric characteristics of objects in a tabletop scene. The model investigates unsupervised Part-of-Speech (POS) tagging through a Hidden Markov Model (HMM) that infers the corresponding hidden tags to words. Spatial configurations and geometric characteristics of objects on the tabletop are described through 3D point cloud information that encodes spatial semantics and categories of referents and landmarks in the environment. The proposed model is evaluated through human user interaction with Toyota HSR robot, where the obtained results show the significant effect of the model in making the robot able to successfully engage in interaction with the user in space.

Domaines

Robotique [cs.RO]

Fichier principal

ICDL-Epirob-2017.pdf (646.29 Ko)

Amir ALY : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01953470

Soumis le : mercredi 2 janvier 2019-18:11:57

Dernière modification le : mercredi 11 mai 2022-15:20:03

Archivage à long terme le : mercredi 3 avril 2019-12:11:23

Dates et versions

hal-01953470 , version 1 (02-01-2019)

Identifiants

HAL Id : hal-01953470 , version 1

Citer

Amir Aly, Akira Taniguchi, Tadahiro Taniguchi. A Generative Framework for Multimodal Learning of Spatial Concepts and Object Categories: An Unsupervised Part-of-Speech Tagging and 3D Visual Perception Based Approach. IEEE International Joint Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob), Sep 2017, Lisbon, Portugal. ⟨hal-01953470⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENSTA ENSTA_U2IS UNIV-PARIS-SACLAY

36 Consultations

127 Téléchargements

A Generative Framework for Multimodal Learning of Spatial Concepts and Object Categories: An Unsupervised Part-of-Speech Tagging and 3D Visual Perception Based Approach

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager