Thesis icon

Thesis

Adaptive robust control with statistical learning

Abstract:

In stochastic control problems the agent chooses the optimal strategy to maximise or minimise the performance criterion. The performance criterion can be either the expectation of a reward function for the standard control problem or the non-linear expectation for the robust control problem. In parameterised stochastic control problems, the agent needs to know the value of the model parameters in the stochastic system to specify the optimal strategy correctly. However, it is hardly the case that the agent knows the values of the model parameters.

In this thesis, we aim to study a robust stochastic control problem where the agent does not know the values of the parameters of the underlying process. Therefore, we frame the stochastic control problem where we assume that the agent does not know the values of the model parameters. However, the agent uses the observable processes to estimate the values of the model parameters while simultaneously solving the stochastic control problem in a robust framework.

There are two key components in this new stochastic control problem. The first component is the parameter estimation part where the agent uses the realisation of the underlying process to estimate the unknown parameters in the stochastic system. We particularly focus on online parameter estimation. The online estimator is an important ingredient for our stochastic control problem because this type of estimator allows the agent to obtain the optimal strategy in feedback form. The second component is the stochastic control part which is the question of how to design a time-consistent stochastic control problem that allows the agent to also estimate the parameters and optimise her strategy simultaneously. In this thesis, we address each component of the problem above in the continuous-time setting and then the utility maximisation problem under this framework is studied carefully.

Actions


Access Document


Files:

Authors


More by this author
Division:
MPLS
Department:
Mathematical Institute
Role:
Author

Contributors

Institution:
University of Oxford
Division:
MPLS
Department:
Mathematical Institute
Role:
Supervisor
ORCID:
0000-0002-7426-4645


DOI:
Type of award:
DPhil
Level of award:
Doctoral
Awarding institution:
University of Oxford


Terms of use



Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP