Decoupling feature extraction from policy learning: assessing benefits of state representation learning in goal based robotics - ENSTA Paris - École nationale supérieure de techniques avancées Paris Accéder directement au contenu
Communication Dans Un Congrès Année : 2019

Decoupling feature extraction from policy learning: assessing benefits of state representation learning in goal based robotics

Résumé

Scaling end-to-end reinforcement learning to control real robots from vision presents a series of challenges, in particular in terms of sample efficiency. Against end-to-end learning, state representation learning can help learn a compact, efficient and relevant representation of states that speeds up policy learning, reducing the number of samples needed, and that is easier to interpret. We evaluate several state representation learning methods on goal based robotics tasks and propose a new unsupervised model that stacks representations and combines strengths of several of these approaches. This method encodes all the relevant features, performs on par or better than end-to-end learning with better sample efficiency, and is robust to hyper-parameters change.

Dates et versions

hal-02285831 , version 1 (13-09-2019)

Identifiants

Citer

Antonin Raffin, Ashley Hill, René Traoré, Timothée Lesort, Natalia Díaz-Rodríguez, et al.. Decoupling feature extraction from policy learning: assessing benefits of state representation learning in goal based robotics. SPiRL 2019 : Workshop on Structure and Priors in Reinforcement Learning at ICLR 2019, May 2019, Nouvelle Orléans, United States. ⟨hal-02285831⟩
78 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More