Initiation of the Primate Genome Project

2023-01-07 20:42Dong-DongWu,Xiao-GuangQi,LiYu
Zoological Research 2022年2期

A crucial step for understanding human evolution is to identify the genomic changes that occurred during primate evolution,thus allowing investigators to reconstruct the ancestral states preceding the human condition.In the past several decades,the primate clade has been a research focus in genome sequencing due to its unique phylogenetic position and key importance.Comparative genomic analyses of several primate lineages have radically expanded our knowledge on the tempo and mode of different features in primate genome evolution,revealing many genomic innovations contributing to the development and evolution of human phenotypes.However,with less than 10% of primate species currently sequenced, a considerable gap remains regarding the evolutionary history of every base pair in human and non-human primate (NHP)genomes.To fill this gap, we propose to organize and establish the Primate Genome Project (PGP) to scale up the number of high-quality reference genome assemblies for primate species using cutting-edge sequencing technologies.We outline here the possible paths going forward and some of the major questions to be addressed within this ambitious project.We anticipate that genomic comparisons, including broader taxon sampling of extant primate species, will significantly contribute to our understanding of the evolution of human phenotypes and diseases and the genomic mechanisms of primate speciation and adaptation, which will ultimately assist in primate conservation efforts.

Worldwide, there are currently more than 500 primate species from 80 genera and 16 families, with new primate species still being discovered in recent years (Fan et al., 2017;Nater et al., 2017; Roos et al., 2020).As our closest biological relatives, NHPs hold many clues for understanding the origin and evolution of human complex traits, behaviors, and diseases.NHPs are widely used as biomedical research models for studying the genetic basis of human diseases (e.g.,neurodegeneration, diabetes, cancer, and cardiovascular disease), as well as models for reproduction, transplantation,and pharmacology (Rogers & Gibbs, 2014).Furthermore, over

This is an open-access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits unrestricted non-commercial use, distribution, and reproduction in any medium,provided the original work is properly cited.

Copyright ©2022 Editorial Office of Zoological Research, Kunming

This is an open-access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits unrestricted non-commercial use, distribution, and reproduction in any medium,provided the original work is properly cited.

Copyright ©2022 Editorial Office of Zoological Research, Kunming the last two years, NHPs have played critical roles in the study of SAR-CoV-2 infection mechanisms and vaccine development (Chandrashekar et al., 2020; Mercado et al.,2020).Therefore, current primate genome sequencing efforts have prioritized the great ape lineage, i.e., taxa closest to humans, and other biomedically relevant species (Carbone et al., 2014; Gibbs et al., 2007; Gordon et al., 2016; Locke et al.,2011; Scally et al., 2012; The Chimpanzee Sequencing and Analysis Consortium, 2005; The Marmoset Genome Sequencing and Analysis Consortium, 2014; Warren et al.,2020), with 90% of primate species yet to be characterized.Due to anthropogenic interference, climate change, and other factors, some 60% of the world’s primate species are threatened with extinction and 75% are experiencing population declines (Estrada et al., 2017), thus highlighting an impending extinction crisis.Therefore, the phylogenomic study of primates is critical for broadening our knowledge on the evolutionary and adaptive history of each species, and for providing relevant information for conservation decisions going forward.

Currently, the genomes of a dozen representative NHP species from 22 genera have been published (Ensembl v103),with 72% of genera not yet sequenced.The PGP aims to close this gap and generate long-read-based high-quality reference genomes for at least one representative species in each primate genus in the next few years.Ultimately, we wish to determine the genome sequences for all primate species.These high-quality genomes will provide rapid information for comparisons in the context of primate phylogeny to reconstruct the primate tree of life and clarify the genomic changes underlying the speciation and adaptation processes of major primate lineages.We anticipate that a detailed evolutionary landscape will be disclosed for all genomic variations across primate lineages from chromosomal rearrangements to single base-pair substitutions.This landscape will inform the evolutionary patterns of structural variations, segmental duplications, protein-coding genes, and non-coding regulatory elements.By integrating the metarecords of trait data, genomic information will enable us to

Received: 04 January 2022; Accepted: 06 January 2022; Online: 06 January 2022

Foundation items: This work was supported by the National Natural Science Foundation of China (31822048) and Strategic Priority

Received: 04 January 2022; Accepted: 06 January 2022; Online: 06 January 2022

Foundation items: This work was supported by the National Natural Science Foundation of China (31822048) and Strategic Priorityinvestigate phylogeny, incomplete lineage sorting, and chromosomal evolution, and to map genetic changes associated with human-specific phenotypes (such as brain and cognitive ability), and phenotypic diversity among primate lineages that may contribute to their adaptive evolution.

Due to the difficulties in collecting samples of all primate species, it may not be feasible to sequence all individual primate species within a short time.Nevertheless, depending on sample availability, we will select certain genera as primary sequencing targets and generate reference genomes and population sequences for all recorded species in those genera.This will allow us to investigate the genetic basis of speciation and characterize the history and genomic outcomes of hybridization, as observed in major primate lineages(Ayoola et al., 2021; de Manuel et al., 2016; Svardal et al.,2017).Conservation genomic analyses of population data will be highly informative regarding genomic diversity and recent demographic history, which are crucial parameters for evaluating the conservation status of species (Orkin et al.,2021).

We expect that the PGP will generate a rich resource for primate lineage genomics over the next few years.Given the advances in sequencing technologies, it is important to initiate interdisciplinary cooperation and joint data analysis by integrating expertise in primatology, conservation,biomedicine, genomics, and evolutionary biology.Therefore,we welcome researchers to join the consortium and contribute to this necessary and timely research.

COMPETING INTERESTS

The authors declare that they have no competing interests.

AUTHORS' CONTRIBUTIONS

D.D.W.and G.Z.conceived the review and prepared the draft.All authors contributed to the discussions.All authors read and approved the final version of the manuscript.

Dong-Dong Wu1,2,3,4,*, Xiao-Guang Qi5, Li Yu6, Ming Li4,7,Zhi-Jin Liu7, Anne D.Yoder8, Christian Roos9,Takashi Hayakawa10,11, Jeffrey Rogers12, Tomas Marques-Bonet13,14,15,16, Bing Su1,4, Yong-Gang Yao2,17,18,Ya-Ping Zhang1,4,18, Guojie Zhang1,4,19,20,*1State Key Laboratory of Genetic Resources and Evolution,Kunming Institute of Zoology,Chinese Academy of Sciences,Kunming,Yunnan650223,China

2National Resource Center for Non-Human Primates,Kunming Primate Research Center,and National Research Facility for Phenotypic&Genetic Analysis of Model Animals (Primate Facility),Kunming Institute of Zoology,Chinese Academy of Sciences,Kunming,Yunnan650107,China

3Kunming Natural History Museum of Zoology,Kunming Institute of Zoology,Chinese Academy of Sciences,Kunming,Yunnan650223,China

4Center for Excellence in Animal Evolution and Genetics,Chinese Academy of Sciences,Kunming,Yunnan650223,China

5Shaanxi Key Laboratory for Animal Conservation,College of Life Sciences,Northwest University,Xi’an,Shaanxi710069,China

6State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan,Yunnan University,Kunming,Yunnan650091,China

7CAS Key Laboratory of Animal Ecology and Conservation Biology,Institute of Zoology,Chinese Academy of Sciences,Beijing100101,China

8Department of Biology,Duke University,Durham,NC27708,USA

9Gene Bank of Primates and Primate Genetics Laboratory,German Primate Center,Leibniz Institute for Primate Research,Göttingen37077,Germany

10Faculty of Environmental Earth Science,Hokkaido University,Sapporo,Hokkaido060-0810,Japan

11Japan Monkey Centre,Inuyama,Aichi484-0081,Japan

12Human Genome Sequencing Center,Department of Molecular and Human Genetics,Baylor College of Medicine,Houston,TX77030,USA

13Institute of Evolutionary Biology (UPF-CSIC),PRBB,Dr.Aiguader 88,Barcelona08003,Spain

14Catalan Institution of Research and Advanced Studies (ICREA),Passeig de Lluís Companys,23,Barcelona08010,Spain

15CNAG-CRG,Centre for Genomic Regulation (CRG),Barcelona Institute of Science and Technology (BIST),Baldiri i Reixac 4,Barcelona08028,Spain

16Institut Català de Paleontologia Miquel Crusafont,Universitat Autònoma de Barcelona,Edifici ICTA-ICP,c/ Columnes s/n,Cerdanyola del Vallès,Barcelona08193,Spain

17Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences&Yunnan Province,and KIZ-CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases,Kunming Institute of Zoology,Chinese Academy of Sciences,Kunming,Yunnan650201,China

18Kunming College of Life Science,University of Chinese Academy of Sciences,Kunming,Yunnan650204,China

19Villum Centre for Biodiversity Genomics,Section for Ecology and Evolution,Department of Biology,University of Copenhagen,Copenhagen DK-2100,Denmark

20China National GeneBank,BGI-Shenzhen,Shenzhen,Guangdong518083,China

*Corresponding authors, E-mail: wudongdong@mail.kiz.ac.cn;guojie.zhang@bio.ku.dk