Decoupling feature extraction from policy learning: assessing benefits of state representation learning in goal based robotics

Antonin Raffin; Ashley Hill; René Traoré; Timothée Lesort; Natalia Díaz-Rodríguez; David Filliat

Communication Dans Un Congrès Année : 2019

Decoupling feature extraction from policy learning: assessing benefits of state representation learning in goal based robotics

(1) , (1) , (1) , (1) , (2, 1) , (1, 2)

1
2

Antonin Raffin

Fonction : Auteur
PersonId : 1039438

Unité d'Informatique et d'Ingénierie des Systèmes

Ashley Hill

Fonction : Auteur

Unité d'Informatique et d'Ingénierie des Systèmes

René Traoré

Fonction : Auteur
PersonId : 1039440

Unité d'Informatique et d'Ingénierie des Systèmes

Timothée Lesort

Fonction : Auteur

Unité d'Informatique et d'Ingénierie des Systèmes

Natalia Díaz-Rodríguez

Fonction : Auteur
PersonId : 170998
IdHAL : natalia-diaz-rodriguez
ORCID : 0000-0003-3362-9326
IdRef : 261850032

Flowing Epigenetic Robots and Systems

Unité d'Informatique et d'Ingénierie des Systèmes

David Filliat

Fonction : Auteur
PersonId : 45
IdHAL : david-filliat
ORCID : 0000-0002-5739-1618
IdRef : 070072337

Unité d'Informatique et d'Ingénierie des Systèmes

Flowing Epigenetic Robots and Systems

Résumé

Scaling end-to-end reinforcement learning to control real robots from vision presents a series of challenges, in particular in terms of sample efficiency. Against end-to-end learning, state representation learning can help learn a compact, efficient and relevant representation of states that speeds up policy learning, reducing the number of samples needed, and that is easier to interpret. We evaluate several state representation learning methods on goal based robotics tasks and propose a new unsupervised model that stacks representations and combines strengths of several of these approaches. This method encodes all the relevant features, performs on par or better than end-to-end learning with better sample efficiency, and is robust to hyper-parameters change.

Domaines

Apprentissage [cs.LG]

Timothéee LESORT : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02285831

Soumis le : vendredi 13 septembre 2019-10:30:18

Dernière modification le : mercredi 15 mars 2023-08:50:07

Dates et versions

hal-02285831 , version 1 (13-09-2019)

Identifiants

HAL Id : hal-02285831 , version 1
ARXIV : 1901.08651

Citer

Antonin Raffin, Ashley Hill, René Traoré, Timothée Lesort, Natalia Díaz-Rodríguez, et al.. Decoupling feature extraction from policy learning: assessing benefits of state representation learning in goal based robotics. SPiRL 2019 : Workshop on Structure and Priors in Reinforcement Learning at ICLR 2019, May 2019, Nouvelle Orléans, United States. ⟨hal-02285831⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENSTA INRIA ENSTA_U2IS INRIA2 UNIV-PARIS-SACLAY

78 Consultations

0 Téléchargements

Decoupling feature extraction from policy learning: assessing benefits of state representation learning in goal based robotics

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager