Journal article
Training neural networks with end-to-end optical backpropagation
- Abstract:
- Optics is an exciting route for the next generation of computing hardware for machine learning, promising several orders of magnitude enhancement in both computational speed and energy efficiency. However, reaching the full capacity of an optical neural network necessitates the computing be implemented optically not only for inference, but also for training. The primary algorithm for network training is backpropagation, in which the calculation is performed in the order opposite to the information flow for inference. While straightforward in a digital computer, optical implementation of backpropagation has remained elusive, particularly because of the conflicting requirements for the optical element that implements the nonlinear activation function. In this work, we address this challenge for the first time with a surprisingly simple scheme, employing saturable absorbers for the role of activation units. Our approach is adaptable to various analog platforms and materials, and demonstrates the possibility of constructing neural networks entirely reliant on analog optical processes for both training and inference tasks.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Version of record, pdf, 3.4MB, Terms of use)
-
- Publisher copy:
- 10.1117/1.AP.7.1.016004
Authors
- Publisher:
- Society of Photo-Optical Instrumentation Engineers
- Journal:
- Advanced Photonics More from this journal
- Volume:
- 7
- Issue:
- 1
- Article number:
- 016004
- Publication date:
- 2025-02-04
- Acceptance date:
- 2024-12-11
- DOI:
- ISSN:
-
2577-5421
- Language:
-
English
- Keywords:
- Pubs id:
-
2080267
- Local pid:
-
pubs:2080267
- Deposit date:
-
2025-01-23
Terms of use
- Copyright holder:
- Spall et al.
- Copyright date:
- 2025
- Rights statement:
- © The Authors. Published by SPIE and CLP under a Creative Commons Attribution 4.0 International License. Distribution or reproduction of this work in whole or in part requires full attribution of the original publication, including its DOI.
- Licence:
- CC Attribution (CC BY)
If you are the owner of this record, you can report an update to it here: Report update to this record