Journal article
Pangenome-guided sequence assembly via binary optimization
- Abstract:
- De novo genome assembly is challenging in highly repetitive regions; however, reference-guided assemblers often suffer from bias. We propose a framework for pangenome-guided sequence assembly that can resolve short-read data in complex regions without bias towards a single reference genome. Our primary contribution is to frame the assembly as a graph traversal optimization problem, which can be implemented classically or on a quantum computer. The workflow involves first annotating pangenome graphs with estimated copy numbers for each node, then finding a path on the graph that best explains those copy numbers. On simulated data, our approach significantly reduces the number of contigs compared with de novo assemblers. While they introduce a small increase in inaccuracies, such as false joins, our optimization-based methods are competitive with current exhaustive search techniques. They are also designed to scale more efficiently as the problem size grows and will run effectively on future quantum computers; a small experiment on a real quantum device showcases this behaviour. Moreover, they are more resilient to noise in copy number estimation inherent in short-read-based assembly. We also develop novel tools for creating realistic synthetic pangenomes, aligning reads to pangenomes and for evaluating assembly quality.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Version of record, pdf, 407.6KB, Terms of use)
-
(Preview, Version of record, pdf, 1.7MB, Terms of use)
-
- Publisher copy:
- 10.1093/bib/bbag084
Authors
+ Wellcome Trust
More from this funder
- Funder identifier:
- https://ror.org/029chgv08
- Grant:
- 220540/Z/20/A
+ Engineering and Physical Sciences Research Council
More from this funder
- Funder identifier:
- https://ror.org/0439y7842
- Publisher:
- Oxford University Press
- Journal:
- Briefings in Bioinformatics More from this journal
- Volume:
- 27
- Issue:
- 1
- Article number:
- bbag084
- Publication date:
- 2026-02-26
- Acceptance date:
- 2026-01-24
- DOI:
- EISSN:
-
1477-4054
- ISSN:
-
1467-5463
- Language:
-
English
- Keywords:
- Pubs id:
-
2382059
- Local pid:
-
pubs:2382059
- Source identifiers:
-
3802808
- Deposit date:
-
2026-02-26
- ARK identifier:
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.
Terms of use
- Copyright date:
- 2026
- Licence:
- CC Attribution (CC BY)
If you are the owner of this record, you can report an update to it here: Report update to this record