Text fragment extraction using incremental evolving fuzzy grammar fragments learner
- 1 July 2010
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
Additional structure within free texts can be utilized to assist in identification of matching items and can benefit many intelligent text pattern recognition applications. This paper presents an incremental evolving fuzzy grammar (IEFG) method that focuses on the learning of underlying text fragment patterns and provides an efficient fuzzy grammar representation that exploits both syntactic and semantic properties. This notion is quantified via (i) fuzzy membership which measures the degree of membership for a text fragment in a semantic grammar class and (ii) fuzzy grammar similarity which estimates the similarity between two grammars (iii) grammar combination which combines and generalizes the grammar at a minimal generalization. Terrorism incidents data from the United States World Incidents Tracking System (WITS) are used in experiments and presented throughout the paper. A comparison with regular expression methods is made in identification of text fragments representing times. The application of text fragment extraction using IEFG is demonstrated in event type, victim type, dead count and wounded count detection with WITS XML-tagged data used as golden standard. Results have shown the efficiency and practicality of IEFG.Keywords
This publication has 19 references indexed in Scilit:
- Order Independent Incremental Evolving Fuzzy Grammar Fragment LearnerPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2009
- Automatic generation of concept hierarchies using WordNetExpert Systems with Applications, 2008
- Incremental Evolution of Fuzzy Grammar Fragments to Enhance Instance Matching and Text MiningIEEE Transactions on Fuzzy Systems, 2008
- Structure-based inference of xml similarity for fuzzy duplicate detectionPublished by Association for Computing Machinery (ACM) ,2007
- A Semantic Approach to Discovering Schema Mapping ExpressionsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2007
- Automated ontology construction for unstructured text documentsData & Knowledge Engineering, 2007
- Evolution of Fuzzy Grammars to aid Instance MatchingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2006
- Probabilistic Data Generation for Deduplication and Data LinkageLecture Notes in Computer Science, 2005
- Learning Probabilistic Tree Grammars for Genetic ProgrammingLecture Notes in Computer Science, 2004
- Can a parser be generated from examples?Published by Association for Computing Machinery (ACM) ,2003