Computing Reviews, the leading online review service for computing literature.

Search

A new learning algorithm for optimal stopping
Borkar V., Pinto J., Prabhu T. Discrete Event Dynamic Systems19 (1):91-113,2009.Type:Article

Date Reviewed: Sep 8 2010

The finite horizon optimal stopping model’s various applications, mainly in American-type option pricing, required research in order to construct a solution for these problems in exact form. Dynkin’s original formulation and solution [1] uses dynamic programming methods, but rarely provides the exact analytical and numerical solution. The alternative is to approximate the model by an optimization problem with a known algorithm. Some of these techniques are obtained from iterative schemes for Markov decision processes. In this paper, Borkar et al.’s main idea is based on the formulation of the optimal stopping problem: “The ‘primal’ formulation of this is a linear program (LP for short) over the so-called ‘occupation measures.’ Its ‘dual’ is an LP over functions” [2]. Next, the learning scheme is constructed for the linear approximation of the LP. The main part of the paper is devoted to the derivation and justification of the algorithm. The work is theoretical, with descriptions of numerical experiments related to the option pricing model. This well-executed application of known techniques offers new, effective methods for solving optimal stopping problems. The examples presented are appealing. Although the paper lists other methods of approximation, it fails to provide a direct comparison of the proposed approach to other approaches.

Reviewer: Krzysztof Szajowski	Review #: CR138366 (1102-0214)

1)	Dynkin, E.B. The optimum choice of the instant of stopping a Markov process. Soviet Mathematics Doklady 4, (1963), 627–629.

2)	Cho, M.J. ; Stockbridge, R.H. Linear programming formulation for optimal stopping problems. SIAM Journal on Control and Optimization 40, 6(2001), 1965–1982.

Learning (I.2.6 )

Discrete event (I.6.8 ... )

Linear Programming (G.1.6 ... )

Optimization (G.1.6 )

Types Of Simulation (I.6.8 )

Would you recommend this review?

yes

Other reviews under "Learning":	Date

Learning in parallel networks: simulating learning in a probabilistic system Hinton G. (ed) BYTE 10(4): 265-273, 1985. Type: Article	Nov 1 1985

Macro-operators: a weak method for learning Korf R. Artificial Intelligence 26(1): 35-77, 1985. Type: Article	Feb 1 1986

Inferring (mal) rules from pupils’ protocols Sleeman D. Progress in artificial intelligence (, Orsay, France,391985. Type: Proceedings	Dec 1 1985

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy