An 8.22 Mb Assembly and Annotation of the Alpaca (Vicugna pacos) Y Chromosome
Open Access
- 16 January 2021
- Vol. 12 (1), 105
- https://doi.org/10.3390/genes12010105
Abstract
The unique evolutionary dynamics and complex structure make the Y chromosome the most diverse and least understood region in the mammalian genome, despite its undisputable role in sex determination, development, and male fertility. Here we present the first contig-level annotated draft assembly for the alpaca (Vicugna pacos) Y chromosome based on hybrid assembly of short- and long-read sequence data of flow-sorted Y. The latter was also used for cDNA selection providing Y-enriched testis transcriptome for annotation. The final assembly of 8.22 Mb comprised 4.5 Mb of male specific Y (MSY) and 3.7 Mb of the pseudoautosomal region. In MSY, we annotated 15 X-degenerate genes and two novel transcripts, but no transposed sequences. Two MSY genes, HSFY and RBMY, are multicopy. The pseudoautosomal boundary is located between SHROOM2 and HSFY. Comparative analysis shows that the small and cytogenetically distinct alpaca Y shares most of MSY sequences with the larger dromedary and Bactrian camel Y chromosomes. Most of alpaca X-degenerate genes are also shared with other mammalian MSYs, though WWC3Y is Y-specific only in alpaca/camels and the horse. The partial alpaca Y assembly is a starting point for further expansion and will have applications in the study of camelid populations and male biology.This publication has 85 references indexed in Scilit:
- Strict evolutionary conservation followed rapid gene loss on human and rhesus Y chromosomesNature, 2012
- Genome sequences of wild and domestic bactrian camelsNature Communications, 2012
- Chimpanzee and human Y chromosomes are remarkably divergent in structure and gene contentNature, 2010
- The Sequence Alignment/Map format and SAMtoolsBioinformatics, 2009
- Gene discovery and comparative analysis of X-degenerate genes from the domestic cat Y chromosomeGenomics, 2008
- MAKER: An easy-to-use annotation pipeline designed for emerging model organism genomesGenome Research, 2007
- RNAmmer: consistent and rapid annotation of ribosomal RNA genesNucleic Acids Research, 2007
- Locating proteins in the cell using TargetP, SignalP and related toolsNature Protocols, 2007
- The male-specific region of the human Y chromosome is a mosaic of discrete sequence classesNature, 2003
- Predicting transmembrane protein topology with a hidden markov model: application to complete genomesJournal of Molecular Biology, 2001