How many human genes can be defined as housekeeping with current expression data?
Open Access
- 16 April 2008
- journal article
- Published by Springer Science and Business Media LLC in BMC Genomics
- Vol. 9 (1), 172
- https://doi.org/10.1186/1471-2164-9-172
Abstract
Background Housekeeping (HK) genes are ubiquitously expressed in all tissue/cell types and constitute a basal transcriptome for the maintenance of basic cellular functions. Partitioning transcriptomes into HK and tissue-specific (TS) genes relatively is fundamental for studying gene expression and cellular differentiation. Although many studies have aimed at large-scale and thorough categorization of human HK genes, a meaningful consensus has yet to be reached. Results We collected two latest gene expression datasets (both EST and microarray data) from public databases and analyzed the gene expression profiles in 18 human tissues that have been well-documented by both two data types. Benchmarked by a manually-curated HK gene collection (HK408), we demonstrated that present data from EST sampling was far from saturated, and the inadequacy has limited the gene detectability and our understanding of TS expressions. Due to a likely over-stringent threshold, microarray data showed higher false negative rate compared with EST data, leading to a significant underestimation of HK genes. Based on EST data, we found that 40.0% of the currently annotated human genes were universally expressed in at least 16 of 18 tissues, as compared to only 5.1% specifically expressed in a single tissue. Our current EST-based estimate on human HK genes ranged from 3,140 to 6,909 in number, a ten-fold increase in comparison with previous microarray-based estimates. Conclusion We concluded that a significant fraction of human genes, at least in the currently annotated data depositories, was broadly expressed. Our understanding of tissue-specific expression was still preliminary and required much more large-scale and high-quality transcriptomic data in future studies. The new HK gene list categorized in this study will be useful for genome-wide analyses on structural and functional features of HK genes.This publication has 48 references indexed in Scilit:
- Housekeeping genes tend to show reduced upstream sequence conservationGenome Biology, 2007
- Genome-wide transcription and the implications for genomic organizationNature Reviews Genetics, 2007
- Repetitive sequence environment distinguishes housekeeping genesGene, 2007
- Reactome: a knowledge base of biologic pathways and processesGenome Biology, 2007
- The UCSC genome browser database: update 2007Nucleic Acids Research, 2006
- NCBI GEO: mining tens of millions of expression profiles--database and tools updateNucleic Acids Research, 2006
- Multiplex sequencing of paired-end ditags (MS-PET): a strategy for the ultra-high-throughput analysis of transcriptomes and genomesNucleic Acids Research, 2006
- A gene atlas of the mouse and human protein-encoding transcriptomesProceedings of the National Academy of Sciences of the United States of America, 2004
- BLAT—The BLAST-Like Alignment ToolGenome Research, 2002
- Control Genes and Variability: Absence of Ubiquitous Reference Transcripts in Diverse Mammalian Expression StudiesGenome Research, 2002