Journal article icon

Journal article

Using reference-free compressed data structures to analyse sequencing reads from thousands of human genomes.

Abstract:

We are rapidly approaching the point where we have sequenced millions of human genomes. There is a pressing need for new data structures to store raw sequencing data and efficient algorithms for population scale analysis. Current reference based data formats do not fully exploit the redundancy in population sequencing nor take advantage of shared genetic variation. In recent years, the Burrows-Wheeler transform (BWT) and FM-index have been widely employed as a full text search...

Expand abstract
Publication status:
Published
Peer review status:
Peer reviewed

Actions


Access Document


Files:
Publisher copy:
10.1101/gr.211748.116

Authors


More by this author
Institution:
University of Oxford
Division:
MSD
Department:
NDM
Sub department:
Human Genetics Wt Centre
Role:
Author
Expand authors...
Wellcome Trust More from this funder
Publisher:
Cold Spring Harbor Laboratory Press Publisher's website
Journal:
Genome Research Journal website
Volume:
27
Pages:
300-309
Publication date:
2016-12-01
Acceptance date:
2016-12-14
DOI:
ISSN:
1549-5469
Source identifiers:
667795
Language:
English
Keywords:
Pubs id:
pubs:667795
UUID:
uuid:cfe47bc5-9ab0-4684-ad12-f6c49f9a63c2
Local pid:
pubs:667795
Deposit date:
2017-01-31

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP