Journal article
Modelling genetic variations with fragmentation-coagulation processes
- Abstract:
- We propose a novel class of Bayesian nonparametric models for sequential data called fragmentation-coagulation processes (FCPs). FCPs model a set of sequences using a partition-valued Markov process which evolves by splitting and merging clusters. An FCP is exchangeable, projective, stationary and reversible, and its equilibrium distributions are given by the Chinese restaurant process. As opposed to hidden Markov models, FCPs allow for flexible modelling of the number of clusters, and they avoid label switching non-identifiability problems. We develop an efficient Gibbs sampler for FCPs which uses uniformization and the forward-backward algorithm. Our development of FCPs is motivated by applications in population genetics, and we demonstrate the utility of FCPs on problems of genotype imputation with phased and unphased SNP data.
Actions
Authors
- Journal:
- Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011, NIPS 2011 More from this journal
- Publication date:
- 2011-01-01
- Language:
-
English
- Pubs id:
-
pubs:353218
- UUID:
-
uuid:aa1409f2-644a-4e75-b3fd-6176cd9cbeaf
- Local pid:
-
pubs:353218
- Source identifiers:
-
353218
- Deposit date:
-
2013-11-16
Terms of use
- Copyright date:
- 2011
If you are the owner of this record, you can report an update to it here: Report update to this record