The celery genome sequence reveals sequential paleo‐polyploidizations, karyotype evolution and resistance gene reduction in apiales
Open Access
- 23 October 2020
- journal article
- research article
- Published by Wiley in Plant Biotechnology Journal
- Vol. 19 (4), 731-744
- https://doi.org/10.1111/pbi.13499
Abstract
Celery (Apium graveolens L. 2n = 2x = 22), a member of the Apiaceae family, is among the most important and globally grown vegetables. Here, we report a high‐quality genome sequence assembly, anchored to 11 chromosomes, with total length of 3.33 Gb and N50 scaffold length of 289.78 Mb. Most (92.91%) of the genome is composed of repetitive sequences, with 62.12% of 31,326 annotated genes confined to the terminal 20% of chromosomes. Simultaneous bursts of shared long‐terminal repeats (LTRs) in different Apiaceae plants suggest inter‐specific exchanges. Two ancestral polyploidizations were inferred, one shared by Apiales taxa and the other confined to Apiaceae. We reconstructed 8 Apiales proto‐chromosomes, inferring their evolutionary trajectories from the eudicot‐common ancestor to extant plants. Transcriptome sequencing in three tissues (roots, leaves, and petioles), and varieties with different‐coloured petioles, revealed 4 and 2 key genes in pathways regulating anthocyanin and coumarin biosynthesis, respectively. A remarkable paucity of NBS disease‐resistance genes in celery (62) and other Apiales was explained by extensive loss and limited production of these genes during the last ~10 million years, raising questions about their biotic defense mechanisms and motivating research into effects of chemicals, e.g. coumarins, that give off distinctive odors. Celery genome sequencing and annotation facilitates further research into important gene functions and breeding, and comparative genomic analyses in Apiales.Keywords
Funding Information
- China Postdoctoral Science Foundation (2020M673188)
- National Natural Science Foundation of China (31801856 to X.S, 31371282, 2016YFD0101001 to X.W)
This publication has 88 references indexed in Scilit:
- Infernal 1.1: 100-fold faster RNA homology searchesBioinformatics, 2013
- MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearityNucleic Acids Research, 2012
- A fast, lock-free approach for efficient parallel counting of occurrences of k-mersBioinformatics, 2011
- Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiationNature Biotechnology, 2010
- The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phylaNature, 2007
- LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposonsNucleic Acids Research, 2007
- PAML 4: Phylogenetic Analysis by Maximum LikelihoodMolecular Biology and Evolution, 2007
- De novo identification of repeat families in large genomesBioinformatics, 2005
- Improving the Arabidopsis genome annotation using maximal transcript alignment assembliesNucleic Acids Research, 2003
- BLAT—The BLAST-Like Alignment ToolGenome Research, 2002