Deep Reinforcement Learning in complex environments

Nardelli, N

Thesis

Deep Reinforcement Learning in complex environments

Abstract:: Deep Reinforcement Learning (DRL), is becoming a popular and mature framework for learning to solve sequential decision making problems. The application of Deep Neural Networks, flexible and powerful function approximators, towards learning policies has effectively enabled RL to solve applications that were thought to be too difficult: from beating professional human players in hard games such as Go, to becoming the foundation for flexible embodied control. We explore what happens when one attempts to learn policies in environments that present complex dynamics and hard and structured tasks. As these environments provide challenges that lie fundamentally at the forefront what most state-of-the-art Reinforcement Learning methods try to tackle, they provide a general view of existing weaknesses, while also providing opportunities for improving the general framework as well as particular algorithms. Firstly, we study and develop methods for Deep Multi-Agent Reinforcement Learning, a setting in which multiple agents are interacting with an (often complex) environment and each other. The presence of multiple agents breaks some of the key assumptions that provide necessary stability to standard learning methods, creating unique and interesting problems. We test these methods by formulating a multi-agent version of the StarCraft micromanagement problem, an extremely complex real-time control and planning problem based on one of the hardest environments currently available in the literature. Secondly, in a single-agent version of the same problem, we investigate how DRL can be used to develop a set of parameter-efficient differentiable planning modules to solve path-planning tasks with complex environment dynamics and variable map sizes. We show that the modules enable learning to plan when the environment also includes stochastic elements, providing a cost-efficient learning system to build low-level size-invariant planners for a variety of interactive, hard navigation problems. Thirdly, and lastly, we present a novel RL benchmark based on one of the oldest and most complex video games ever developed: the NetHack Learning Environment (NLE). NLE provides an environment that is scalable, rich, and challenging for state-of-the-art RL, while maintaining familiarity with standard grid-worlds, and dramatically decreasing the computational requirements compared to existing environments of similar complexity and scope. We believe that this particular intersection of properties will enable the community to employ a single environment both as a debugging tool for increasingly complicated RL agents, and as a target for the next decade of RL research.

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Cite

Cite this record

APA Style

Nardelli, N. (2021). Deep Reinforcement Learning in complex environments [PhD thesis]. University of Oxford.

MLA Style

Nardelli, N. Deep Reinforcement Learning in Complex Environments. University of Oxford, 2021.

Chicago Style

Nardelli, N. 2021. “Deep Reinforcement Learning in Complex Environments.” PhD thesis, University of Oxford.
Share
Print

Access Document

Files:: Nardelli_2021_Deep_reinforcement_learning.pdf

(Preview, Dissemination version, pdf, 12.6MB, Terms of use)

Authors

+ Nardelli, N More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Engineering Science
Sub department:: Engineering Science
Research group:: Torr Vision Group
Oxford college:: St Catherine's College
Role:: Author
ORCID:: 0000-0001-8491-8166

Contributors

+ Torr, P

Role:: Supervisor

DOI:: 10.5287/ora-dqz695do5
Type of award:: DPhil
Level of award:: Doctoral
Awarding institution:: University of Oxford

Language:: English
Keywords:: Deep Learning

Artificial Intelligence

Reinforcement Learning

Machine Learning
Subjects:: Artificial intelligence

Machine learning

Reinforcement learning

Deep learning (Machine learning)
Deposit date:: 2022-07-25

Terms of use

Copyright holder:: Nardelli, N

Licence:: Terms and Conditions of Use for Oxford University Research Archive

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

Thesis

Deep Reinforcement Learning in complex environments

Actions

Access Document

Authors

Contributors

Terms of use

Views and Downloads

Altmetrics

Dimensions

Thesis

Deep Reinforcement Learning in complex environments

Actions

Access Document

Authors

Contributors

Bibliographic Details

Item Description

Terms of use

Metrics

Views and Downloads

Altmetrics

Dimensions