Collapsed variational inference for computational linguistics

Wang, P

Thesis

Collapsed variational inference for computational linguistics

Abstract:: Bayesian modelling is a natural fit for tasks in computational linguistics, since it can provide interpretable structures, useful prior controls, and coherent management of uncertainty. However, exact Bayesian inference is intractable for many models of practical interest. Developing both accurate and efficient approximate Bayesian inference algorithms remains a fundamental challenge, especially for the field of computational linguistics where datasets are large and growing and models consist of complex latent structures.

Collapsed variational inference (CVI) is an important milestone that combines the efficiency of variational inference (VI) and the accuracy of Markov chain Monte Carlo (MCMC) (Teh et al., 2006). However, its previous applications were limited to bag-of-words models whose hidden variables are conditionally independent given the parameters, whereas in computational linguistics, the hidden variable dependencies are crucial for modelling the underlying syntactic and semantic relations. To enlarge the application domain of CVI as well as to address the above Bayesian inference challenge, we investigate the applications of collapsed variational inference to computational linguistics.

In this thesis, our contributions are three-fold. First, we solve a number of inference challenges arising from the hidden variable dependencies and derive a set of new CVI algorithms for the two ubiquitous and foundational models in computational linguistics, namely hidden Markov models (HMMs) and probabilistic context free grammars. We also propose CVI for hierarchical Dirichlet process (HDP) HMMs that are Bayesian nonparametric extensions of HMMs.

Second, along the way we propose a set of novel algorithmic techniques, which are generally applicable to a wide variety of probabilistic graphical models in the conjugate exponential family and computational linguistic models using non-conjugate HDP constructions. Therefore, our work represents one step in bridging the gap between increasingly richer Bayesian models in computational linguistics and recent advances in approximate Bayesian inference.

Third, we empirically evaluate our proposed CVI algorithms and their stochastic versions in a range of computational linguistic tasks, such as part-of-speech induction, grammar induction and many others. Experimental results consistently demonstrate that, using our techniques for handling the hidden variable dependencies, the empirical advantages of both VI and MCMC can be combined in a much larger domain of CVI applications.

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Share
Cite

Cite this record

APA Style

Wang, P. (2016). Collapsed variational inference for computational linguistics [PhD thesis]. University of Oxford.

MLA Style

Wang, P. Collapsed Variational Inference for Computational Linguistics. 2016. University of Oxford, PhD thesis.

Chicago Style

Wang, P. 2016. “Collapsed Variational Inference for Computational Linguistics.” PhD thesis, University of Oxford.
Print

Access Document

Files:: thesis_final_version.pdf

(Preview, pdf, 2.1MB, Terms of use)

Authors

+ Wang, P More by this author

Division:: MPLS
Department:: Computer Science
Department:: Computer Science
Role:: Author

Contributors

+ Blunsom, P

Department:: Computer Science
Role:: Supervisor

DOI:: 10.5287/ora-vqwbqovoj
Type of award:: DPhil
Level of award:: Doctoral
Awarding institution:: University of Oxford

UUID:: uuid:13c08f60-1441-4ea5-b52f-7ffd0d7a744f
Deposit date:: 2017-06-23
ARK identifier:: ark:/29072/ora_13c08f6014414ea5b52f7ffd0d7a744f

Terms of use

Copyright holder:: Wang, P

Licence:: Terms and Conditions of Use for Oxford University Research Archive

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

Thesis

Collapsed variational inference for computational linguistics

Actions

Access Document

Authors

Contributors

Terms of use

Views and Downloads

Altmetrics

Dimensions

Thesis

Collapsed variational inference for computational linguistics

Actions

Access Document

Authors

Contributors

Bibliographic Details

Item Description

Terms of use

Metrics

Views and Downloads

Altmetrics

Dimensions