Thesis icon

Thesis

Application of the Neutral Indel Model to genome sequences for diverse metazoans

Abstract:
The Neutral Indel Model is able to predict accurately the distribution of indel events in alignments of neutrally evolving genomic sequence. Here, I apply this model to a diverse range of metazoan species pairs, to a number of ends. First, I apply the Neutral Indel Model to alignments of genome sequences for species within the mammalian clade in order to estimate the quantities of functional DNA shared between species pairs. I demonstrate that as the evolutionary divergence between species pairs increases, estimates of functional sequence drop off dramatically. This pattern is not replicated in extensive simulations of genome sequence alignments, suggesting that functional (and mostly non-coding) sequence is turning over at a rapid rate. I also estimate that between 200 and 300 Mb (6.5-10%) of the human genome is under evolutionary constraint, a considerably higher quantity of sequence than has been estimated by previous whole genome analyses. Second, extending my analyses to consider more diverse metazoan species, I provide estimates for functional bases within organisms’ genomes that appear to mirror our conceptions of organismal complexity. Thirdly, I develop the Neutral Indel Model as a method for assessing genome sequence quality, by quantifying indel errors within alignments of closely related (ds < 0.1) species pairs. Applying this method to six primate genome sequence assemblies, I demonstrate that the frequency of indel error events per base varies up to six-fold. Further to this, I show that second generation sequencing technologies can be used to create high quality genome sequence assemblies and to ameliorate errors in pre-existing assemblies. Finally, I analyse patterns of indel mutations in primate transposable elements and show that indels are not randomly distributed within these sequences due to regularly spaced homo-nucleotide motifs.

Actions


Authors


More by this author
Institution:
University of Oxford
Division:
MSD
Department:
Physiology Anatomy & Genetics
Research group:
Ponting Group
Oxford college:
Linacre College
Role:
Author

Contributors

Division:
MSD
Department:
Physiology Anatomy & Genetics
Role:
Supervisor



Publication date:
2010
DOI:
Type of award:
DPhil
Level of award:
Doctoral
Awarding institution:
Oxford University, UK


Language:
English
Keywords:
Subjects:
UUID:
uuid:18f8c5fc-28f2-4d5e-aa87-c1086582213c
Local pid:
ora:5526
Deposit date:
2011-07-05

Terms of use



Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP