Genome Sequencing and Assembly by Long Reads in Plants
植物基因组的长读测序与组装
Abstract:Plant genomes generated by Sanger and Next Generation Sequencing (NGS) have provided
insight into species diversity and evolution. However, Sanger sequencing is limited in its applications
due to high cost, labor intensity, and low throughput, while NGS reads are too short to resolve
abundant repeats and polyploidy, leading to incomplete or ambiguous assemblies. The advent and
improvement of long-read sequencing by Third Generation Sequencing (TGS) methods such as PacBio
and Nanopore have shown promise in producing high-quality assemblies for complex genomes.
Here, we review the development of sequencing, introducing the application as well as considerations
of experimental design in TGS of plant genomes. We also introduce recent revolutionary scaffolding
technologies including BioNano, Hi-C, and 10 Genomics. We expect that the informative guidance
for genome sequencing and assembly by long reads will benefit the initiation of scientists’ projects.
Keywords:genome assembly; long reads; Sanger sequencing; Next Generation Sequencing; Third
Generation Sequencing
文摘:
由桑格和下一代测序(NGS)产生的植物基因组为物种多样性和进化提供了深入的研究。
然而,由于Sanger测序成本高、劳动强度大、吞吐量低,其应用受到限制,而NGS读取时间太短,无法解析丰富的重复和多倍体,导致组装不完整或不明确。
通过第三代测序(TGS)方法的出现和改进,如PacBio和Nanopore,在为复杂基因组生产高质量组装物方面显示出了希望。
本文综述了测序技术的发展,介绍了实验设计在植物基因组测序中的应用和注意事项。
我们还推出了最新的革命性支架技术,包括BioNano、Hi-C和10 Genomics。
我们期望,通过长reads对基因组测序和组装的信息指导将有利于科学家项目的启动。
关键词:基因组组装;长阅读;桑格测序;下一代测序;第三代测序