Journal article
TransRate: reference-free quality assessment of de novo transcriptome assemblies
- Abstract:
- TransRate is a tool for reference-free quality assessment of de novo transcriptome assemblies. Using only the sequenced reads and the assembly as input, we show that multiple common artifacts of de novo transcriptome assembly can be readily detected. These include chimeras, structural errors, incomplete assembly, and base errors. TransRate evaluates these errors to produce a diagnostic quality score for each contig, and these contig scores are integrated to evaluate whole assemblies. Thus, TransRate can be used for de novo assembly filtering and optimization as well as comparison of assemblies generated using different methods from the same input reads. Applying the method to a data set of 155 published de novo transcriptome assemblies, we deconstruct the contribution that assembly method, read length, read quantity, and read quality make to the accuracy of de novo transcriptome assemblies and reveal that variance in the quality of the input data explains 43% of the variance in the quality of published de novo transcriptome assemblies. Because TransRate is reference-free, it is suitable for assessment of assemblies of all types of RNA, including assemblies of long noncoding RNA, rRNA, mRNA, and mixed RNA samples.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Version of record, pdf, 1.3MB, Terms of use)
-
(Supplementary materials, zip, 5.2MB, Terms of use)
-
- Publisher copy:
- 10.1101/gr.196469.115
Authors
+ Biotechnology and Biological Sciences Research Council
More from this funder
- Funder identifier:
- https://ror.org/00cwqg982
- Publisher:
- Cold Spring Harbor Laboratory Press
- Journal:
- Genome Research More from this journal
- Volume:
- 26
- Issue:
- 8
- Pages:
- 1134-1144
- Publication date:
- 2016-06-01
- Acceptance date:
- 2016-05-27
- DOI:
- EISSN:
-
1549-5469
- ISSN:
-
1088-9051
- Language:
-
English
- Keywords:
- Pubs id:
-
pubs:628076
- UUID:
-
uuid:dfcb0f24-06fa-4510-90ff-082a8ce61ed9
- Local pid:
-
pubs:628076
- Source identifiers:
-
628076
- Deposit date:
-
2016-07-04
Terms of use
- Copyright holder:
- Smith-Unna et al
- Copyright date:
- 2016
- Rights statement:
- © 2016 Smith-Unna et al.; Published by Cold Spring Harbor Laboratory Press This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0/.
- Licence:
- CC Attribution (CC BY)
If you are the owner of this record, you can report an update to it here: Report update to this record