Identifying Patient Smoking Status from Medical Discharge Records
Open Access
- 1 January 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Journal of the American Medical Informatics Association
- Vol. 15 (1), 14-24
- https://doi.org/10.1197/jamia.m2408
Abstract
Clinical narrative records contain much useful information. However, most clinical narratives are in the form of fragmented English free text, showing the characteristics of a clinical sublanguage. This makes their linguistic processing, search, and retrieval challenging.1 Traditional natural language processing (NLP) tools are not designed for the fragmented free text found in narrative clinical records; therefore, they do not perform well on this type of data.2 Limited access to clinical records has been a barrier to the widespread development of medical language processing (MLP) technologies. In the absence of a standardized, publicly available ground truth that encourages the development of MLP systems and allows their head-to-head comparison, successful MLP efforts have been limited, e.g., MedLEE3 and Symtxt.4 A few MLP systems have been developed,5 and such efforts have successfully shown the usefulness of MLP in clinical settings.6–8Keywords
This publication has 38 references indexed in Scilit:
- Identifying Smokers with a Medical Extraction SystemJournal of the American Medical Informatics Association, 2008
- Using Implicit Information to Identify Smoking Status in Smoke-blind Medical Discharge SummariesJournal of the American Medical Informatics Association, 2008
- Medical i2b2 NLP Smoking Challenge: The A-Life System Architecture and MethodologyJournal of the American Medical Informatics Association, 2008
- Mayo Clinic NLP System for Patient Smoking Status IdentificationJournal of the American Medical Informatics Association, 2008
- Five-way Smoking Status Classification Using Text Hot-Spot Identification and Error-correcting Output CodesJournal of the American Medical Informatics Association, 2008
- Evaluating the State-of-the-Art in Automatic De-identificationJournal of the American Medical Informatics Association, 2007
- Human and Automated Coding of Rehabilitation Discharge Summaries According to the International Classification of Functioning, Disability, and HealthJournal of the American Medical Informatics Association, 2006
- Advancing Biomedical Image Retrieval: Development and Analysis of a Test CollectionJournal of the American Medical Informatics Association, 2006
- Classification of Emergency Department Chief Complaints Into 7 Syndromes: A Retrospective Analysis of 527,228 PatientsAnnals of Emergency Medicine, 2005
- A Coefficient of Agreement for Nominal ScalesEducational and Psychological Measurement, 1960