Plant Diversity ›› 2025, Vol. 47 ›› Issue (01): 115-126.DOI: 10.1016/j.pld.2024.10.002

• Articles • Previous Articles    

Assembly-free reads accurate identification (AFRAID) approach outperforms other methods of DNA barcoding in the walnut family (Juglandaceae)

Yanlei Liua, Kai Chena, Lihu Wanga, Xinqiang Yua, Chao Xub, Zhili Suob, Shiliang Zhoub, Shuo Shic, Wenpan Dongd   

  1. a. School of Landscape and Ecological Engineering, Hebei University of Engineering, Handan 056038, China;
    b. State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China;
    c. College of Life Science, Hebei Normal University, Shijiazhuang 050024, China;
    d. School of Ecology and Nature Conservation, Beijing Forestry University, Beijing 100083, China
  • Received:2024-02-16 Revised:2024-09-30 Published:2025-02-15
  • Contact: Shiliang Zhou,E-mail:slzhou@ibcas.ac.cn;Shuo Shi,E-mail:shishuo@hebtu.edu.cn;Wenpan Dong,E-mail:wpdong@bjfu.edu.cn
  • Supported by:
    This study was partly supported by the funds from Natural Science Foundation of Hebei Province (C2022402017).

Abstract: DNA barcoding has been extensively used for species identification. However, species identification of mixed samples or degraded DNA is limited by current DNA barcoding methods. In this study, we use plant species in Juglandaceae to evaluate an assembly-free reads accurate identification (AFRAID) method of species identification, a novel approach for precise species identification in plants. Specifically, we determined (1) the accuracy of DNA barcoding approaches in delimiting species in Juglandaceae, (2) the minimum size of chloroplast dataset for species discrimination, and (3) minimum amount of next generation sequencing (NGS) data required for species identification. We found that species identification rates were highest when whole chloroplast genomes were used, followed by taxon-specific DNA barcodes, and then universal DNA barcodes. Species identification of 100% was achieved when chloroplast genome sequence coverage reached 20% and the original sequencing data reached 500,000 reads. AFRAID accurately identified species for all samples tested after 500,000 clean reads, with far less computing time than common approaches. These results provide a new approach to accurately identify species, overcoming limitations of traditional DNA barcodes. Our method, which uses next generation sequencing to generate partial chloroplast genomes, reveals that DNA barcode regions are not necessarily fixed, accelerating the process of species identification.

Key words: DNA barcode, Species identification, Random DNA barcode, Juglandaceae, Assembly-free