Journal article icon

Journal article

Compositional Variability and Mutation Spectra of Monophyletic SARS-CoV-2 Clades

Abstract:
COVID-19 and its causative pathogen SARS-CoV-2 have rushed the world into a staggering pandemic in a few months, and a global fight against both has been intensifying. Here, we describe an analysis procedure where genome composition and its variables are related, through the genetic code to molecular mechanisms, based on understanding of RNA replication and its feedback loop from mutation to viral proteome sequence fraternity including effective sites on the replicase-transcriptase complex. Our analysis starts with primary sequence information, identity-based phylogeny based on 22,051 SARS-CoV-2 sequences, and evaluation of sequence variation patterns as mutation spectra and its 12 permutations among organized clades. All are tailored to two key mechanisms: strand-biased and function-associated mutations. Our findings are listed as follows: 1) The most dominant mutation is C-to-U permutation, whose abundant second-codon-position counts alter amino acid composition toward higher molecular weight and lower hydrophobicity, albeit assumed most slightly deleterious. 2) The second abundance group includes three negative-strand mutations (U-to-C, A-to-G, and G-to-A) and a positive-strand mutation (G-to-U) due to DNA repair mechanisms after cellular abasic events. 3) A clade-associated biased mutation trend is found attributable to elevated level of negative-sense strand synthesis. 4) Within-clade permutation variation is very informative for associating non-synonymous mutations and viral proteome changes. These findings demand a platform where emerging mutations are mapped onto mostly subtle but fast-adjusting viral proteomes and transcriptomes, to provide biological and clinical information after logical convergence for effective pharmaceutical and diagnostic applications. Such actions are in desperate need, especially in the middle of the War against COVID-19.
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Publisher copy:
10.1016/j.gpb.2020.10.003

Authors

More by this author
Role:
Author
ORCID:
0000-0001-9282-4282
More by this author
Institution:
University of Oxford
Role:
Author
ORCID:
0000-0001-6984-5499
More by this author
Role:
Author
ORCID:
0000-0001-7374-3348
More by this author
Role:
Author
ORCID:
0000-0001-6876-4611
More by this author
Role:
Author
ORCID:
0000-0002-8010-8817


Publisher:
Oxford University Press
Journal:
Genomics, Proteomics and Bioinformatics More from this journal
Volume:
18
Issue:
6
Pages:
648-663
Publication date:
2021-02-11
DOI:
EISSN:
2210-3244
ISSN:
1672-0229


Language:
English
Keywords:
Pubs id:
2373823
Local pid:
pubs:2373823
Source identifiers:
W3128751190
Deposit date:
2026-02-22
ARK identifier:
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP