Sequence coverage required for accurate genotyping by sequencing in polyploid species

Lin Wang, Jixuan Yang, Hong Zhang, Qin Tao, Yuxin Zhang, Zhenyu Dang, Fengjun Zhang, Zewei Luo

Research output: Contribution to journalArticlepeer-review

56 Downloads (Pure)

Abstract

Polyploidy plays an important role in the evolution of eukaryotes, especially for flower-ing plants. Many of ecologically or agronomically important plant or crop species are polyploids, including sycamore maple (tetraploid), the world second and third largest food crops wheat (hexaploid) and potato (tetraploid) as well as economically important aquaculture animals such as Atlantic salmon and trout. The next generation sequencing data enables to allocate genotype at a sequence variant site, known as genotyping by sequencing (GBS). GBS has stimulated enormous interests in population based genom-ics studies in almost all diploid and many polyploid organisms. DNA sequence polymor-phisms are codominant and thus fully informative about the underlying genotype at the polymorphic site, making GBS a straightforward task in diploids. However, sequence data may usually be uninformative in polyploid species, making GBS a far more challeng-ing task in polyploids. This paper presents novel and rigorous statistical methods for pre-dicting the number of sequence reads needed to ensure accurate GBS at a polymorphic site bared by the reads in polyploids and shows that a dozen of reads can ensure a prob-ability of 95% to recover all constituent alleles of any tetraploid genotype but several hundreds of reads are needed to accurately uncover the genotype with probability con-fidence of 90%, subverting the proposition of GBS using low coverage sequence data in the literature. The theoretical prediction was tested by use of RAD-seq data from tetraploid potato cultivars. The paper provides polyploid experimentalists with theo-retical guides and methods for designing and conducting their sequence- based studies.
Original languageEnglish
Number of pages10
JournalMolecular ecology resources
Early online date26 Nov 2021
DOIs
Publication statusE-pub ahead of print - 26 Nov 2021

Keywords

  • genotyping by sequencing
  • polyploids
  • RAD-seq data
  • sequence coverage
  • Solanum tuberosum L

Fingerprint

Dive into the research topics of 'Sequence coverage required for accurate genotyping by sequencing in polyploid species'. Together they form a unique fingerprint.

Cite this