Conference item
Trusting SVM for piecewise linear CNNs
- Abstract:
- We present a novel layerwise optimization algorithm for the learning objective of Piecewise-Linear Convolutional Neural Networks (PL-CNNs), a large class of convolutional neural networks. Specifically, PL-CNNs employ piecewise linear non-linearities such as the commonly used ReLU and max-pool, and an SVM classifier as the final layer. The key observation of our approach is that the problem corresponding to the parameter estimation of a layer can be formulated as a difference-of-convex (DC) program, which happens to be a latent structured SVM. We optimize the DC program using the concave-convex procedure, which requires us to iteratively solve a structured SVM problem. This allows to design an optimization algorithm with an optimal learning rate that does not require any tuning. Using the MNIST, CIFAR and ImageNet data sets, we show that our approach always improves over the state of the art variants of backpropagation and scales to large data and large network settings.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Accepted manuscript, pdf, 650.1KB, Terms of use)
-
- Publication website:
- https://www.proceedings.com/68832.html
Authors
+ Engineering and Physical Sciences Research Council
More from this funder
- Grant:
- EP/M013774/1/EPSRC Programme Grant Seebibyte
- Publisher:
- Curran Associates
- Host title:
- 5th International Conference on Learning Representations (ICLR 2017)
- Volume:
- 4
- Pages:
- 2727-2749
- Publication date:
- 2017-04-26
- Acceptance date:
- 2017-02-06
- Event title:
- 5th International Conference on Learning Representations (ICLR 2017)
- Event location:
- Toulon, France
- Event website:
- https://iclr.cc/archive/www/2017.html
- Event start date:
- 2017-04-24
- Event end date:
- 2017-04-26
- ISBN:
- 9781713872719
- Language:
-
English
- Pubs id:
-
pubs:688986
- UUID:
-
uuid:08f4a60f-b5f7-40a5-aff9-7ce4bce85b5b
- Local pid:
-
pubs:688986
- Source identifiers:
-
688986
- Deposit date:
-
2017-04-11
- ARK identifier:
Terms of use
- Copyright holder:
- ICLR
- Copyright date:
- 2017
- Rights statement:
- © (2017) by International Conference on Learning Representations. All rights reserved.
- Notes:
- This paper was presented at the 5th International Conference on Learning Representations (ICLR 2017), 24-26 April 2017, Toulon, France. This is the accepted manuscript version of the article. The final version is available online from Curran Associates at https://www.proceedings.com/68832.html
If you are the owner of this record, you can report an update to it here: Report update to this record