ArrayExpress update—simplifying data submissions
Top Cited Papers
Open Access
- 31 October 2014
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 43 (D1), D1113-D1116
- https://doi.org/10.1093/nar/gku1057
Abstract
The ArrayExpress Archive of Functional Genomics Data (http://www.ebi.ac.uk/arrayexpress) is an international functional genomics database at the European Bioinformatics Institute (EMBL-EBI) recommended by most journals as a repository for data supporting peer-reviewed publications. It contains data from over 7000 public sequencing and 42 000 array-based studies comprising over 1.5 million assays in total. The proportion of sequencing-based submissions has grown significantly over the last few years and has doubled in the last 18 months, whilst the rate of microarray submissions is growing slightly. All data in ArrayExpress are available in the MAGE-TAB format, which allows robust linking to data analysis and visualization tools and standardized analysis. The main development over the last two years has been the release of a new data submission tool Annotare, which has reduced the average submission time almost 3-fold. In the near future, Annotare will become the only submission route into ArrayExpress, alongside MAGE-TAB format-based pipelines. ArrayExpress is a stable and highly accessed resource. Our future tasks include automation of data flows and further integration with other EMBL-EBI resources for the representation of multi-omics data.Keywords
This publication has 14 references indexed in Scilit:
- Expression Atlas update—a database of gene and transcript expression from microarray- and sequencing-based functional genomics experimentsNucleic Acids Research, 2013
- Database Citation in Full Text Biomedical ArticlesPLOS ONE, 2013
- Reuse of public genome-wide gene expression dataNature Reviews Genetics, 2012
- NCBI GEO: archive for functional genomics data sets—updateNucleic Acids Research, 2012
- ArrayExpress update—trends in database growth and links to data analysis toolsNucleic Acids Research, 2012
- Annotare—a tool for annotating high-throughput biomedical investigations and resulting dataBioinformatics, 2010
- Modeling sample variables with an Experimental Factor OntologyBioinformatics, 2010
- Repeatability of published microarray gene expression analysesNature Genetics, 2009
- ArrayExpress--a public repository for microarray gene expression data at the EBINucleic Acids Research, 2003
- Minimum information about a microarray experiment (MIAME)—toward standards for microarray dataNature Genetics, 2001