SR4GN: A Species Recognition Software Tool for Gene Normalization
Open Access
- 5 June 2012
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLOS ONE
- Vol. 7 (6), e38460
- https://doi.org/10.1371/journal.pone.0038460
Abstract
As suggested in recent studies, species recognition and disambiguation is one of the most critical and challenging steps in many downstream text-mining applications such as the gene normalization task and protein-protein interaction extraction. We report SR4GN: an open source tool for species recognition and disambiguation in biomedical text. In addition to the species detection function in existing tools, SR4GN is optimized for the Gene Normalization task. As such it is developed to link detected species with corresponding gene mentions in a document. SR4GN achieves 85.42% in accuracy and compares favorably to the other state-of-the-art techniques in benchmark experiments. Finally, SR4GN is implemented as a standalone software tool, thus making it convenient and robust for use in many text-mining applications. SR4GN can be downloaded at: http://www.ncbi.nlm.nih.gov/CBBresearch/Lu/downloads/SR4GNKeywords
This publication has 18 references indexed in Scilit:
- The Protein-Protein Interaction tasks of BioCreative III: classification/ranking of articles and linking bio-ontology concepts to full textBMC Bioinformatics, 2011
- The gene normalization task in BioCreative IIIBMC Bioinformatics, 2011
- OrganismTagger: detection, normalization and grounding of organism entities in biomedical documentsBioinformatics, 2011
- The GNAT library for local and remote gene mention normalizationBioinformatics, 2011
- Threshold Average Precision (TAP-k): a measure of retrieval designed for bioinformaticsBioinformatics, 2010
- LINNAEUS: A species name identification system for biomedical literatureBMC Bioinformatics, 2010
- Disambiguating the species of biomedical named entities using natural language parsersBioinformatics, 2010
- U-Compare: share and compare text mining tools with UIMABioinformatics, 2009
- Inter-species normalization of gene mentions with GNATBioinformatics, 2008
- Integrating high dimensional bi-directional parsing models for gene mention taggingBioinformatics, 2008