Intrinsically Motivated Goal Exploration Processes with Automatic Curriculum Learning

Sébastien Forestier; Yoan Mollard; Pierre-Yves Oudeyer

Pré-Publication, Document De Travail Année : 2017

Intrinsically Motivated Goal Exploration Processes with Automatic Curriculum Learning

(1, 2) , (2) , (2)

1
2

Sébastien Forestier

Fonction : Auteur
PersonId : 974282

Université de Bordeaux

Flowing Epigenetic Robots and Systems

Yoan Mollard

Fonction : Auteur
PersonId : 970528

Flowing Epigenetic Robots and Systems

Pierre-Yves Oudeyer

Fonction : Auteur
PersonId : 6675
IdHAL : pyoudeyer
ORCID : 0000-0002-9404-7613
IdRef : 081674481

Flowing Epigenetic Robots and Systems

Résumé

Intrinsically motivated spontaneous exploration is a key enabler of autonomous lifelong learning in human children. It allows them to discover and acquire large repertoires of skills through self-generation, self-selection, self-ordering and self-experimentation of learning goals. We present the unsupervised multi-goal reinforcement learning formal framework as well as an algorithmic approach called intrinsically motivated goal exploration processes (IMGEP) to enable similar properties of autonomous learning in machines. The IMGEP algorithmic architecture relies on several principles: 1) self-generation of goals as parameterized reinforcement learning problems; 2) selection of goals based on intrinsic rewards; 3) exploration with parameterized time-bounded policies and fast incremental goal-parameterized policy search; 4) systematic reuse of information acquired when targeting a goal for improving other goals. We present a particularly efficient form of IMGEP that uses a modular representation of goal spaces as well as intrinsic rewards based on learning progress. We show how IMGEPs automatically generate a learning curriculum within an experimental setup where a real humanoid robot can explore multiple spaces of goals with several hundred continuous dimensions. While no particular target goal is provided to the system beforehand, this curriculum allows the discovery of skills of increasing complexity, that act as stepping stone for learning more complex skills (like nested tool use). We show that learning several spaces of diverse problems can be more efficient for learning complex skills than only trying to directly learn these complex skills. We illustrate the computational efficiency of IMGEPs as these robotic experiments use a simple memory-based low-level policy representations and search algorithm, enabling the whole system to learn online and incrementally on a Raspberry Pi 3.

Mots clés

intrinsically motivated exploration unsupervised multi-goal reinforce- ment learning intrinsic motivation curiosity-driven learning automatic generation of goals curriculum learning learning progress robotics modular representations

Domaines

Intelligence artificielle [cs.AI] Robotique [cs.RO] Sciences cognitives

Fichier principal

main.pdf (3.68 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Sébastien Forestier : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01651233

Soumis le : mardi 28 novembre 2017-18:05:08

Dernière modification le : mercredi 15 mars 2023-08:50:07

Dates et versions

hal-01651233 , version 1 (28-11-2017)

hal-01651233 , version 2 (24-11-2021)

hal-01651233 , version 3 (09-12-2022)

Identifiants

HAL Id : hal-01651233 , version 1

Citer

Sébastien Forestier, Yoan Mollard, Pierre-Yves Oudeyer. Intrinsically Motivated Goal Exploration Processes with Automatic Curriculum Learning. 2017. ⟨hal-01651233v1⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENSTA

345 Consultations

412 Téléchargements

Intrinsically Motivated Goal Exploration Processes with Automatic Curriculum Learning

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager