Thesis
Stochastic control approach to the multi-armed bandit problems
- Abstract:
-
A multi-armed bandit is the simplest problem to study learning under uncertainty when decisions affect information. A standard approach to the multi-armed bandit often gives a heuristic construction of an algorithm and proves its regret bound. Following a constructive approach, it is often possible to find a scenario where following heuristic approaches gives a poor decision.
In this thesis, we consider solving the multi-armed bandit problem from first principles, in terms of stoch...
Expand abstract
Actions
Funding
The Royal Thai Goverment
More from this funder
Bibliographic Details
- Type of award:
- DPhil
- Level of award:
- Doctoral
- Awarding institution:
- University of Oxford
Item Description
- Language:
- English
- Keywords:
- Subjects:
- Deposit date:
- 2021-07-01
Related Items
Terms of use
- Copyright holder:
- Treetanthiploet, T
- Copyright date:
- 2021
Metrics
If you are the owner of this record, you can report an update to it here: Report update to this record