Conference item
A natural encoding of genetic variation in a Burrows-Wheeler Transform to enable mapping and genome inference
- Abstract:
- We show how positional markers can be used to encode genetic variation within a Burrows-Wheeler Transform (BWT), and use this to construct a generalisation of the traditional “reference genome”, incorporating known variation within a species. Our goal is to support the inference of the closest mosaic of previously known sequences to the genome(s) under analysis. Our scheme results in an increased alphabet size, and by using a wavelet tree encoding of the BWT we reduce the performance impact on rank operations. We give a specialised form of the backward search that allows variation-aware exact matching. We implement this, and demonstrate the cost of constructing an index of the whole human genome with 8 million genetic variants is 25GB of RAM. We also show that inferring a closer reference can close large kilobase-scale coverage gaps in P. falciparum.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Accepted manuscript, pdf, 923.6KB, Terms of use)
-
- Publisher copy:
- 10.1007/978-3-319-43681-4_18
Authors
- Publisher:
- Springer Verlag
- Journal:
- WABI 2016: 16th Workshop on Algorithms in Bioinformatics More from this journal
- Publication date:
- 2016-08-01
- Acceptance date:
- 2016-05-31
- DOI:
- ISSN:
-
0302-9743
- Keywords:
- Pubs id:
-
pubs:630012
- UUID:
-
uuid:14d555a2-5f01-48f6-8ae0-7adebf6918a7
- Local pid:
-
pubs:630012
- Source identifiers:
-
630012
- Deposit date:
-
2016-06-27
- ARK identifier:
Terms of use
- Copyright holder:
- Springer
- Copyright date:
- 2016
- Notes:
- © Springer International Publishing Switzerland 2016
If you are the owner of this record, you can report an update to it here: Report update to this record