Temporal difference learning with kernels for pricing american-style options

Kengy Barty; Jean-Sébastien Roy; Cyrille Strugarek

Article Dans Une Revue Optimization Online Année : 2005

Temporal difference learning with kernels for pricing american-style options

(1) , , (1)

Kengy Barty

Fonction : Auteur

Optimisation et commande

Jean-Sébastien Roy

Fonction : Auteur

Cyrille Strugarek

Fonction : Auteur

Optimisation et commande

Résumé

We propose in this paper to study the problem of estimating the cost-to-go function for an infinite-horizon discounted Markov chain with possibly continuous state space. For implementation purposes, the state space is typically discretized. As soon as the dimension of the state space becomes large, the computation is no more practicable, a phenomenon referred to as the curse of dimensionality. The approximation of dynamic programming problems is therefore of major importance. A powerful method for dynamic programming, often referred to as neuro-dynamic programming, consists in representing the Bellman function as a linear combination of a priori defined functions, called neurons. In this article, we propose an alternative approach very similar to temporal differences, based on functional gradient descent and using an infinite kernel basis.Furthermore, our algorithm, though aimed at infinite dimensional problems, is implementable in practice. We prove the convergence of this algorithm, and show applications on e.g. bermudan option pricing.

Aurélien Arnoux : Connectez-vous pour contacter le contributeur

https://ensta-paris.hal.science/hal-00983326

Soumis le : vendredi 25 avril 2014-10:37:16

Dernière modification le : mercredi 11 mai 2022-12:06:05

Dates et versions

hal-00983326 , version 1 (25-04-2014)

Identifiants

HAL Id : hal-00983326 , version 1

Citer

Kengy Barty, Jean-Sébastien Roy, Cyrille Strugarek. Temporal difference learning with kernels for pricing american-style options. Optimization Online, 2005. ⟨hal-00983326⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENSTA UMA_ENSTA

41 Consultations

0 Téléchargements

Temporal difference learning with kernels for pricing american-style options

Résumé

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager