Conference item
Snowflake: Scaling GNNs to high-dimensional continuous control via parameter freezing
- Abstract:
- Recent research has shown that graph neural networks (GNNs) can learn policies for locomotion control that are as effective as a typical multi-layer perceptron (MLP), with superior transfer and multi-task performance. However, results have so far been limited to training on small agents, with the performance of GNNs deteriorating rapidly as the number of sensors and actuators grows. A key motivation for the use of GNNs in the supervised learning setting is their applicability to large graphs, but this benefit has not yet been realised for locomotion control. We show that poor scaling in GNNs is a result of increasingly unstable policy updates, caused by overfitting in parts of the network during training. To combat this, we introduce Snowflake, a GNN training method for high-dimensional continuous control that freezes parameters in selected parts of the network. Snowflake significantly boosts the performance of GNNs for locomotion control on large agents, now matching the performance of MLPs while offering superior transfer properties.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Supplementary materials, 1.6MB, Terms of use)
-
(Preview, Accepted manuscript, 3.3MB, Terms of use)
-
Authors
- Publisher:
- Curran Associates
- Host title:
- Advances in Neural Information Processing Systems 34 (NeurIPS 2021)
- Volume:
- 34
- Pages:
- 23983-23992
- Publication date:
- 2021-12-06
- Acceptance date:
- 2021-11-01
- Event title:
- 35th Conference on Neural Information Processing Systems (NeurIPS 2021)
- Event location:
- Virtual event
- Event website:
- https://nips.cc/Conferences/2021/
- Event start date:
- 2021-12-06
- Event end date:
- 2021-12-14
- ISBN:
- 9781713845393
- Language:
-
English
- Keywords:
- Pubs id:
-
1211846
- Local pid:
-
pubs:1211846
- Deposit date:
-
2021-11-23
Terms of use
- Copyright holder:
- Blake et al and Neural Information Processing Systems Foundation Inc
- Copyright date:
- 2021
- Rights statement:
- © (2021) by individual authors and Neural Information Processing Systems Foundation Inc
- Notes:
- This is the accepted manuscript version of the conference paper. The final version is available from the NIPS Foundation at https://proceedings.neurips.cc/paper/2021/file/c952ce98517ac529c60744ac28364b03-Paper.pdf
If you are the owner of this record, you can report an update to it here: Report update to this record