Using RNA-Seq to Profile Soybean Seed Development from Fertilization to Maturity
Open Access
- 15 March 2013
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLOS ONE
- Vol. 8 (3), e59270
- https://doi.org/10.1371/journal.pone.0059270
Abstract
To understand gene expression networks leading to functional properties and compositional traits of the soybean seed, we have undertaken a detailed examination of soybean seed development from a few days post-fertilization to the mature seed using Illumina high-throughput transcriptome sequencing (RNA-Seq). RNA was sequenced from seven different stages of seed development, yielding between 12 million and 78 million sequenced transcripts. These have been aligned to the 79,000 gene models predicted from the soybean genome recently sequenced by the Department of Energy Joint Genome Institute. Over one hundred gene models were identified with high expression exclusively in young seed stages, starting at just four days after fertilization. These were annotated as being related to many basic components and processes such as histones and proline-rich proteins. Genes encoding storage proteins such as glycinin and beta-conglycinin had their highest expression levels at the stages of largest fresh weight, confirming previous knowledge that these storage products are being rapidly accumulated before the seed begins the desiccation process. Other gene models showed high expression in the dry, mature seeds, perhaps indicating the preparation of pathways needed later, in the early stages of imbibition. Many highly-expressed gene models at the dry seed stage are, as expected, annotated as hydrophilic proteins associated with low water conditions, such as late embryogenesis abundant (LEA) proteins and dehydrins, which help preserve the cellular structures and nutrients within the seed during desiccation. More significantly, the power of RNA-Seq to detect genes expressed at low levels revealed hundreds of transcription factors with notable expression in at least one stage of seed development. Results from a second biological replicate demonstrate high reproducibility of these data revealing a comprehensive view of the transciptome of seed development in the cultivar Williams, the reference cultivar for the first soybean genome sequence.Keywords
This publication has 39 references indexed in Scilit:
- Identification of soybean seed developmental stage-specific and tissue-specific miRNA targets by degradome sequencingBMC Genomics, 2012
- Phytozome: a comparative platform for green plant genomicsNucleic Acids Research, 2011
- Genome-Wide Survey and Expression Analysis of the Plant-Specific NAC Transcription Factor Family in Soybean During Development and Dehydration StressDNA Research, 2011
- Gene coexpression clusters and putative regulatory elements underlying seed storage reserve accumulation in ArabidopsisBMC Genomics, 2011
- Flux of transcript patterns during soybean seed developmentBMC Genomics, 2010
- Endogenous, Tissue-Specific Short Interfering RNAs Silence the Chalcone Synthase Gene Family inGlycine maxSeed CoatsPlant Cell, 2009
- The Arabidopsis NFYA5 Transcription Factor Is Regulated Transcriptionally and Posttranscriptionally to Promote Drought ResistancePlant Cell, 2008
- Mapping and quantifying mammalian transcriptomes by RNA-SeqNature Methods, 2008
- Using Genomics to Study Legume Seed DevelopmentPlant Physiology, 2007
- Specific elements of the glyoxylate pathway play a significant role in the functional transition of the soybean cotyledon during seedling developmentBMC Genomics, 2007