Journal article
GREEN-DB: a framework for the annotation and prioritization of non-coding regulatory variants from whole-genome sequencing data
- Abstract:
- Non-coding variants have long been recognized as important contributors to common disease risks, but with the expansion of clinical whole genome sequencing, examples of rare, high-impact non-coding variants are also accumulating. Despite recent advances in the study of regulatory elements and the availability of specialized data collections, the systematic annotation of non-coding variants from genome sequencing remains challenging. Here, we propose a new framework for the prioritization of non-coding regulatory variants that integrates information about regulatory regions with prediction scores and HPO-based prioritization. Firstly, we created a comprehensive collection of annotations for regulatory regions including a database of 2.4 million regulatory elements (GREEN-DB) annotated with controlled gene(s), tissue(s) and associated phenotype(s) where available. Secondly, we calculated a variation constraint metric and showed that constrained regulatory regions associate with disease-associated genes and essential genes from mouse knock-outs. Thirdly, we compared 19 non-coding impact prediction scores providing suggestions for variant prioritization. Finally, we developed a VCF annotation tool (GREEN-VARAN) that can integrate all these elements to annotate variants for their potential regulatory impact. In our evaluation, we show that GREEN-DB can capture previously published disease-associated non-coding variants as well as identify additional candidate disease genes in trio analyses.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Version of record, pdf, 2.1MB, Terms of use)
-
- Publisher copy:
- 10.1093/nar/gkac130
Authors
+ National Institute for Health Research
More from this funder
- Funder identifier:
- https://ror.org/0187kwz08
- Publisher:
- Oxford University Press
- Journal:
- Nucleic Acids Research More from this journal
- Volume:
- 50
- Issue:
- 5
- Pages:
- 2522-2535
- Publication date:
- 2022-03-02
- Acceptance date:
- 2022-02-14
- DOI:
- EISSN:
-
1362-4962
- ISSN:
-
0305-1048
- Language:
-
English
- Source identifiers:
-
2461739
- Deposit date:
-
2024-11-29
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.
If you are the owner of this record, you can report an update to it here: Report update to this record