Exploiting Additive Structure in Factored MDPs for Reinforcement Learning - Université Pierre et Marie Curie Accéder directement au contenu
Communication Dans Un Congrès Année : 2008

Exploiting Additive Structure in Factored MDPs for Reinforcement Learning

Thomas Degris
  • Fonction : Auteur
Olivier Sigaud
Pierre-Henri Wuillemin

Résumé

sdyna is a framework able to address large, discrete and stochastic reinforcement learning problems. It incrementally learns a fmdp representing the problem to solve while using fmdp planning techniques to build an efficient policy. spiti, an instantiation of sdyna, uses a planning method based on dynamic programming which cannot exploit the additive structure of a fmdp. In this paper, we present two new instantiations of sdyna, namely ulp and unatlp, using a linear programming based planning method that can exploit the additive structure of a fmdp and address problems out of reach of spiti.

Dates et versions

hal-01302178 , version 1 (13-04-2016)

Identifiants

Citer

Thomas Degris, Olivier Sigaud, Pierre-Henri Wuillemin. Exploiting Additive Structure in Factored MDPs for Reinforcement Learning. European Workshop on Reinforcement Learning, Jun 2008, Villeneuve d’Ascq, France. pp.15-26, ⟨10.1007/978-3-540-89722-4_2⟩. ⟨hal-01302178⟩
81 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More