Mining chemical information from open patents
Open Access
- 14 October 2011
- journal article
- Published by Springer Science and Business Media LLC in Journal of Cheminformatics
- Vol. 3 (1), 1-17
- https://doi.org/10.1186/1758-2946-3-40
Abstract
Linked Open Data presents an opportunity to vastly improve the quality of science in all fields by increasing the availability and usability of the data upon which it is based. In the chemical field, there is a huge amount of information available in the published literature, the vast majority of which is not available in machine-understandable formats. PatentEye, a prototype system for the extraction and semantification of chemical reactions from the patent literature has been implemented and is discussed. A total of 4444 reactions were extracted from 667 patent documents that comprised 10 weeks' worth of publications from the European Patent Office (EPO), with a precision of 78% and recall of 64% with regards to determining the identity and amount of reactants employed and an accuracy of 92% with regards to product identification. NMR spectra reported as product characterisation data are additionally captured.Keywords
This publication has 20 references indexed in Scilit:
- The past, present and future of Scientific discourseJournal of Cheminformatics, 2011
- OSCAR4: a flexible architecture for chemical text-miningJournal of Cheminformatics, 2011
- ChemicalTagger: A tool for semantic text-mining in chemistryJournal of Cheminformatics, 2011
- Optical Structure Recognition Software To Recover Chemical Information: OSRA, An Open Source SolutionJournal of Chemical Information and Modeling, 2009
- Cascaded classifiers for confidence-based chemical named entity recognitionBMC Bioinformatics, 2008
- Chemical Markup, XML, and the World Wide Web. 4. CML SchemaJournal of Chemical Information and Computer Sciences, 2003
- Chemical Markup, XML, and the Worldwide Web. 1. Basic PrinciplesJournal of Chemical Information and Computer Sciences, 1999
- Kekule: OCR-optical chemical (structure) recognitionJournal of Chemical Information and Computer Sciences, 1992
- Extraction of chemical reaction information from primary journal text using computational linguistics techniques. 2. Semantic phaseJournal of Chemical Information and Computer Sciences, 1984
- Extraction of chemical reaction information from primary journal text using computational linguistics techniques. 1. Lexical and syntactic phasesJournal of Chemical Information and Computer Sciences, 1984