Conference item

A tutorial on stochastic approximation algorithms for training Restricted Boltzmann Machines and Deep Belief Nets

Abstract:: In this study, we provide a direct comparison of the Stochastic Maximum Likelihood algorithm and Contrastive Divergence for training Restricted Boltzmann Machines using the MNIST data set. We demonstrate that Stochastic Maximum Likelihood is superior when using the Restricted Boltzmann Machine as a classifier, and that the algorithm can be greatly improved using the technique of iterate averaging from the field of stochastic approximation. We further show that training with optimal parameters for classification does not necessarily lead to optimal results when Restricted Boltzmann Machines are stacked to form a Deep Belief Network. In our experiments we observe that fine tuning a Deep Belief Network significantly changes the distribution of the latent data, even though the parameter changes are negligible.

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Cite

Cite this record

APA Style

Swersky, K., Chen, B., Marlin, B., & de Freitas, N. (2010). A tutorial on stochastic approximation algorithms for training Restricted Boltzmann Machines and Deep Belief Nets. Information Theory and Applications Workshop (ITA).

MLA Style

Swersky, K., et al. “A Tutorial on Stochastic Approximation Algorithms for Training Restricted Boltzmann Machines and Deep Belief Nets.” Information Theory and Applications Workshop (ITA), 2010.

Chicago Style

Swersky, K, B Chen, B Marlin, and N de Freitas. 2010. “A Tutorial on Stochastic Approximation Algorithms for Training Restricted Boltzmann Machines and Deep Belief Nets.” In Information Theory and Applications Workshop (ITA).
Share
Print

Access Document

Publisher copy:: 10.1109/ITA.2010.5454138

Authors

+ Swersky, K More by this author

Role:: Author

+ Chen, B More by this author

Role:: Author

+ Marlin, B More by this author

Role:: Author

+ de Freitas, N More by this author

Role:: Author

Host title:: Information Theory and Applications Workshop (ITA)
Publication date:: 2010-01-01
DOI:: 10.1109/ITA.2010.5454138

UUID:: uuid:998bc49a-d82e-4a5c-80eb-e16ed222434f
Local pid:: cs:7469
Deposit date:: 2015-03-31

Terms of use

Copyright date:: 2010

Licence:: Terms and Conditions of Use for Oxford University Research Archive

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP