Journal article icon

Journal article

Occurrence and consequences of coding sequence insertions and deletions in Mammalian genomes.

Abstract:
Nucleotide insertion and deletion (indel) events, together with substitutions, represent the major mutational processes of gene evolution. Through the alignment of 8148 orthologous genes from human, mouse, and rat, we have identified 1743 indel events within rodent protein-coding sequences. Using human as an out-group, we reconstructed the mutational event underlying each of these indels. Overall, we found an excess of deletions over insertions, particularly for the rat lineage (70% excess). Sequence slippage accounts for at least 52% of insertions and 38% of deletions. We have also evaluated the selective tolerance of identifiable protein structures to indels. Transmembrane domains are the least, and low complexity regions, the most tolerant. Mapping of indels onto known protein structures demonstrated that structural cores are markedly less tolerant to indels than are loop regions. There is a specific enrichment of CpG dinucleotides in close proximity to insertion events, and both insertions and deletions are more common in higher G+C content sequences.
Publication status:
Published

Actions


Access Document


Publisher copy:
10.1101/gr.1977804

Authors


More by this author
Institution:
University of Oxford
Division:
MSD
Department:
Physiology Anatomy & Genetics
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MSD
Department:
NDM
Sub department:
Human Genetics Wt Centre
Role:
Author


Journal:
Genome research More from this journal
Volume:
14
Issue:
4
Pages:
555-566
Publication date:
2004-04-01
DOI:
EISSN:
1549-5469
ISSN:
1088-9051


Language:
English
Keywords:
Pubs id:
pubs:54034
UUID:
uuid:25cdf4fe-f757-449b-8cba-c10d77d0730d
Local pid:
pubs:54034
Source identifiers:
54034
Deposit date:
2012-12-19

Terms of use



Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP