Abstract
In this paper, we introduce an approach for obtaining probabilistically guaranteed upper and lower bounds on the true optimal value of stopping problems. Bounds of existing simulation-and-regression approaches, such as those based on least squares Monte Carlo and information relaxation, are stochastic in nature and therefore do not come with a finite sample guarantee. Our data-driven approach is fundamentally different as it allows replacing the sampling error with a pre-specified confidence level. The key to this approach is to use high- and low-biased estimates that are guaranteed to over- and underestimate, respectively, the conditional expected continuation value that appears in the stopping problem’s dynamic programming formulation with a pre-specified confidence level. By incorporating these guaranteed over- and underestimates into a backward recursive procedure, we obtain probabilistically guaranteed bounds on the problem’s true optimal value. As a byproduct we present novel kernel-based non-asymptotic uniform confidence bands for regression functions from a reproducing kernel Hilbert space. We derive closed-form formulas for the cases where the data-generating distribution is either known or unknown, which makes our data-driven approach readily applicable in a range of practical situations including simulation. We illustrate the applicability of the proposed bounding procedure by valuing a Bermudan put option.
| Translated title of the contribution | Garantierte Schranken für Optimales Stoppen mit Hilfe von kernbasierten nicht-asymptotischen gleichmäßigen Konfidenzbändern |
|---|---|
| Original language | English |
| Pages (from-to) | 162-173 |
| Number of pages | 12 |
| Journal | European Journal of Operational Research |
| Volume | 327 |
| Issue number | 1 |
| DOIs | |
| Publication status | Published - 2025 |
Austrian Fields of Science 2012
- 101015 Operations research
- 101016 Optimisation
Keywords
- Finite sample guarantees
- Optimal stopping
- Approximate dynamic programming
- Reproducing kernel Hilbert spaces
- Stochastic programming
Fingerprint
Dive into the research topics of 'Guaranteed bounds for optimal stopping problems using kernel-based non-asymptotic uniform confidence bands'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver