Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
A new learning algorithm for optimal stopping
Borkar V., Pinto J., Prabhu T. Discrete Event Dynamic Systems19 (1):91-113,2009.Type:Article
Date Reviewed: Sep 8 2010

The finite horizon optimal stopping model’s various applications, mainly in American-type option pricing, required research in order to construct a solution for these problems in exact form. Dynkin’s original formulation and solution [1] uses dynamic programming methods, but rarely provides the exact analytical and numerical solution. The alternative is to approximate the model by an optimization problem with a known algorithm. Some of these techniques are obtained from iterative schemes for Markov decision processes.

In this paper, Borkar et al.’s main idea is based on the formulation of the optimal stopping problem: “The ‘primal’ formulation of this is a linear program (LP for short) over the so-called ‘occupation measures.’ Its ‘dual’ is an LP over functions” [2].

Next, the learning scheme is constructed for the linear approximation of the LP. The main part of the paper is devoted to the derivation and justification of the algorithm. The work is theoretical, with descriptions of numerical experiments related to the option pricing model.

This well-executed application of known techniques offers new, effective methods for solving optimal stopping problems. The examples presented are appealing. Although the paper lists other methods of approximation, it fails to provide a direct comparison of the proposed approach to other approaches.

Reviewer:  Krzysztof Szajowski Review #: CR138366 (1102-0214)
1) Dynkin, E.B. The optimum choice of the instant of stopping a Markov process. Soviet Mathematics Doklady 4, (1963), 627–629.
2) Cho, M.J. ; Stockbridge, R.H. Linear programming formulation for optimal stopping problems. SIAM Journal on Control and Optimization 40, 6(2001), 1965–1982.
Bookmark and Share
  Reviewer Selected
 
 
Learning (I.2.6 )
 
 
Discrete event (I.6.8 ... )
 
 
Linear Programming (G.1.6 ... )
 
 
Optimization (G.1.6 )
 
 
Types Of Simulation (I.6.8 )
 
Would you recommend this review?
yes
no
Other reviews under "Learning": Date
Learning in parallel networks: simulating learning in a probabilistic system
Hinton G. (ed) BYTE 10(4): 265-273, 1985. Type: Article
Nov 1 1985
Macro-operators: a weak method for learning
Korf R. Artificial Intelligence 26(1): 35-77, 1985. Type: Article
Feb 1 1986
Inferring (mal) rules from pupils’ protocols
Sleeman D.  Progress in artificial intelligence (, Orsay, France,391985. Type: Proceedings
Dec 1 1985
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy