Stochastic Control
Contents |
Introduction
Referring to Stochastic Differential Equations we see that there is a forward equation (for value functions) and a backward equation (for the evolution of measures), the Kolmogorov backward equation.
Controlled Diffusions
Consider a controlled diffusion process
where the μ = μa,σ = σa are parameterized by the control parameter a.
Here the objective is to minimized the expected cost, which is a combination of the running cost, and a terminal cost
Then the value function is the expected value, starting from (x,t) of the cost, assuming the optimal choice of controls is used
The value function satisfied the Hamilton-Jacobi-Bellman equation
,
- u(x,T) = g(x)
where La are the corresponding operators as in the linear case
The HJB equation can be regarded at the stochastic control version of the Kolmogorov backward equation.
Derivation of HJB equation
Uses
- Ito
- Dynamic Programming Principle
- Taylor Series
Refer to Fleming Soner Controlled Markov processes and viscosity solutions
Least time problem
The least time problem is the expected time to exit a domain. Here we have zero Dirichlet boundary conditions on the inside of a bounded domain, and a running cost of 1. Instead of the fixed time T, the process runs until the stopping time, which is the first arrival on the boundary.