Automated Statistical Analysis of Protein Abundance Ratios from Data Generated by Stable-Isotope Dilution and Tandem Mass Spectrometry
- 28 October 2003
- journal article
- research article
- Published by American Chemical Society (ACS) in Analytical Chemistry
- Vol. 75 (23), 6648-6657
- https://doi.org/10.1021/ac034633i
Abstract
We describe an algorithm for the automated statistical analysis of protein abundance ratios (ASAPRatio) of proteins contained in two samples. Proteins are labeled with distinct stable-isotope tags and fragmented, and the tagged peptide fragments are separated by liquid chromatography (LC) and analyzed by electrospray ionization (ESI) tandem mass spectrometry (MS/MS). The algorithm utilizes the signals recorded for the different isotopic forms of peptides of identical sequence and numerical and statistical methods, such as Savitzky−Golay smoothing filters, statistics for weighted samples, and Dixon's test for outliers, to evaluate protein abundance ratios and their associated errors. The algorithm also provides a statistical assessment to distinguish proteins of significant abundance changes from a population of proteins of unchanged abundance. To evaluate its performance, two sets of LC-ESI-MS/MS data were analyzed by the ASAPRatio algorithm without human intervention, and the data were related to the expected and manually validated values. The utility of the ASAPRatio program was clearly demonstrated by its speed and the accuracy of the generated protein abundance ratios and by its capability to identify specific core components of the RNA polymerase II transcription complex within a high background of copurifying proteins.Keywords
This publication has 19 references indexed in Scilit:
- The Application of New Software Tools to Quantitative Protein Profiling Via Isotope-coded Affinity Tag (ICAT) and Tandem Mass SpectrometryMolecular & Cellular Proteomics, 2003
- Proteomics: the first decade and beyondNature Genetics, 2003
- A proteomics strategy to elucidate functional protein-protein interactions applied to EGF signalingNature Biotechnology, 2003
- Empirical Statistical Model To Estimate the Accuracy of Peptide Identifications Made by MS/MS and Database SearchAnalytical Chemistry, 2002
- De novo peptide sequencing and quantitative profiling of complex protein mixtures using mass-coded abundance taggingNature Biotechnology, 2002
- Integrated Genomic and Proteomic Analyses of a Systematically Perturbed Metabolic NetworkScience, 2001
- Probability-based protein identification by searching sequence databases using mass spectrometry dataElectrophoresis, 1999
- High Throughput Proteome-Wide Precision Measurements of Protein Expression Using Mass SpectrometryJournal of the American Chemical Society, 1999
- Frequency-domain spectroscopic study of the effect of n-propanol on the internal viscosity of sodium dodecyl sulfate micellesAnalytical Chemistry, 1991
- Some Developments in Nuclear Magnetic Resonance of SolidsScience, 1989