Telomere-to-telomere (T2T) phased assemblies are emerging as a benchmark for reference-quality genomes1,17, though they remain technically and financially demanding, particularly at scale. Generating such assemblies for diploid and polyploid genomes typically involves combining high-accuracy long reads, such as PacBio HiFi16 or the now-deprecated ONT Duplex2 reads, with ultra-long ONT Simplex reads. Using multiple platforms or methods increases the cost and the required amount of genomic DNA. Here, we show that comparable results are possible using error correction of ultra-long ONT Simplex reads and then assembling them using state-of-the-art de novo assembly methods. To achieve this, we have developed the deep learning-based HERRO (Haplotype-aware ERRor cOrrection) framework, which corrects ONT Simplex reads while carefully preserving differences in related genomic sequences. Taking into account informative positions that differentiate the haplotypes or genomic repeat copies, HERRO achieves an increase of read accuracy of up to 100-fold for diploid human genomes. By combining HERRO with the Verkko17 assembler, we reconstruct up to 32 chromosomes telomere-to-telomere, including chromosomes X and Y, and consistently achieve NGA50 values of 100 Mb or higher across several human genomes. HERRO supports both R9.4.1 and R10.4.1 ONT Simplex reads and generalizes well to other species. These results show that error-corrected ONT reads can lower sequencing costs and improve the quality of genomic analyses.
Telomere-to-Telomere Assembly Using HERRO-Corrected Simplex Nanopore Reads
Why This Matters
This breakthrough demonstrates that error correction of ultra-long ONT Simplex reads using the HERRO framework enables high-quality, cost-effective telomere-to-telomere genome assemblies. This advancement reduces reliance on multiple sequencing platforms and lowers overall costs, making high-quality genome assembly more accessible for research and clinical applications. It paves the way for more comprehensive and affordable genomic analyses across diverse species.
Key Takeaways
- HERRO improves ONT read accuracy up to 100-fold.
- The method enables telomere-to-telomere assemblies of human genomes.
- This approach reduces costs and simplifies the genome assembly process.
Get alerts for these topics