Formalization, Annotation and Analysis of Diverse Drug and Probe Screening Assay Datasets Using the BioAssay Ontology (BAO)
Open Access
- 14 November 2012
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLOS ONE
- Vol. 7 (11), e49198
- https://doi.org/10.1371/journal.pone.0049198
Abstract
Huge amounts of high-throughput screening (HTS) data for probe and drug development projects are being generated in the pharmaceutical industry and more recently in the public sector. The resulting experimental datasets are increasingly being disseminated via publically accessible repositories. However, existing repositories lack sufficient metadata to describe the experiments and are often difficult to navigate by non-experts. The lack of standardized descriptions and semantics of biological assays and screening results hinder targeted data retrieval, integration, aggregation, and analyses across different HTS datasets, for example to infer mechanisms of action of small molecule perturbagens. To address these limitations, we created the BioAssay Ontology (BAO). BAO has been developed with a focus on data integration and analysis enabling the classification of assays and screening results by concepts that relate to format, assay design, technology, target, and endpoint. Previously, we reported on the higher-level design of BAO and on the semantic querying capabilities offered by the ontology-indexed triple store of HTS data. Here, we report on our detailed design, annotation pipeline, substantially enlarged annotation knowledgebase, and analysis results. We used BAO to annotate assays from the largest public HTS data repository, PubChem, and demonstrate its utility to categorize and analyze diverse HTS results from numerous experiments. BAO is publically available from the NCBO BioPortal at http://bioportal.bioontology.org/ontologies/1533. BAO provides controlled terminology and uniform scope to report probe and drug discovery screening assays and results. BAO leverages description logic to formalize the domain knowledge and facilitate the semantic integration with diverse other resources. As a consequence, BAO offers the potential to infer new knowledge from a corpus of assay results, for example molecular mechanisms of action of perturbagens.Keywords
This publication has 41 references indexed in Scilit:
- Identification of Small-Molecule Inhibitors of the Colorectal Cancer Oncogene Krüppel-like Factor 5 Expression by Ultrahigh-Throughput ScreeningMolecular Cancer Therapeutics, 2011
- BioAssay Ontology Annotations Facilitate Cross-Analysis of Diverse High-Throughput Screening Data SetsSLAS Discovery, 2011
- PubChem as a public resource for drug discoveryDrug Discovery Today, 2010
- PubChem: a public information system for analyzing bioactivities of small moleculesNucleic Acids Research, 2009
- From disease ontology to disease-ontology lite: statistical methods to adapt a general-purpose ontology for the test of gene-ontology associationsBioinformatics, 2009
- Promiscuous Aggregate-Based Inhibitors Promote Enzyme UnfoldingJournal of Medicinal Chemistry, 2009
- A bioinformatics analysis of the cell line nomenclatureBioinformatics, 2008
- Comprehensive Mechanistic Analysis of Hits from High-Throughput and Docking Screens against β-LactamaseJournal of Medicinal Chemistry, 2008
- The OBO Foundry: coordinated evolution of ontologies to support biomedical data integrationNature Biotechnology, 2007
- Krüppel-like factor 5 mediates the transforming activity of oncogenic H-RasOncogene, 2004