Thesis
Towards efficient and robust reinforcement learning via synthetic environments and offline data
- Abstract:
-
Over the past decade, Deep Reinforcement Learning (RL) has driven many advances in sequential decision-making, including remarkable applications in superhuman Go-playing, robotic control, and automated algorithm discovery. However, despite these successes, deep RL is also notoriously sample-inefficient, usually generalizes poorly to settings beyond the original environment, and can be unstable during training. Moreover, the conventional RL setting still relies on exploring and learning tab...
Expand abstract
Actions
Authors
Contributors
+ Osborne, M
- Institution:
- University of Oxford
- Division:
- MPLS
- Department:
- Engineering Science
- Sub department:
- Engineering Science
- Role:
- Supervisor
+ Teh, Y
- Institution:
- University of Oxford
- Division:
- MPLS
- Department:
- Statistics
- Sub department:
- Statistics
- Role:
- Supervisor
+ Foerster, J
- Institution:
- University of Oxford
- Division:
- MPLS
- Department:
- Engineering Science
- Sub department:
- Engineering Science
- Role:
- Examiner
+ Rocktäschel, T
- Institution:
- University College London
- Role:
- Examiner
+ Engineering and Physical Sciences Research Council
More from this funder
- Programme:
- Autonomous Intelligent Machines and Systems (CDT)
- DOI:
- Type of award:
- DPhil
- Level of award:
- Doctoral
- Awarding institution:
- University of Oxford
- Language:
-
English
- Keywords:
- Subjects:
- Deposit date:
-
2023-12-23
Terms of use
- Copyright holder:
- Lu, C
- Copyright date:
- 2023
If you are the owner of this record, you can report an update to it here: Report update to this record