Unlocking Social Media and User Generated Content as a Data Source for Knowledge Management
- 1 January 2020
- journal article
- research article
- Published by IGI Global in International Journal of Knowledge Management
- Vol. 16 (1), 101-122
- https://doi.org/10.4018/ijkm.2020010105
Abstract
The pervasiveness of social media and user-generated content has triggered an exponential increase in global data. However, due to collection and extraction challenges, data in embedded comments, reviews and testimonials are largely inaccessible to a knowledge management system. This article describes a KM framework for the end-to-end knowledge management and value extraction from such content. This framework embodies solutions to unlock the potential of UGC as a rich, real-time data source. Three contributions are described in this article. First, a method for automatically navigating webpages to expose UGC for collection is presented. This is evaluated using browser emulation integrated with automated collection. Second, a method for collecting data without any a priori knowledge of the sites is introduced. Finally, a new testbed is developed to reflect the current state of internet sites and shared publicly to encourage future research. The discussion benchmarks the new algorithm alongside existing techniques, providing evidence of the increased amount of UGC data extracted. Request access from your librarian to read this article's full text.Keywords
This publication has 39 references indexed in Scilit:
- Estimating trust value: A social network perspectiveInformation Systems Frontiers, 2014
- Information extraction for deep web using repetitive subject patternWorld Wide Web, 2013
- A Revised Knowledge PyramidInternational Journal of Knowledge Management, 2013
- Knowledge management systems in support of disasters management: A two decade reviewTechnological Forecasting and Social Change, 2013
- Two-centered magical charge orbitsJournal of High Energy Physics, 2011
- Automatic Wrapper Adaptation by Tree Edit Distance MatchingPublished by Springer Science and Business Media LLC ,2011
- A Generalized Tree Matching Algorithm Considering Nested Lists for Web Data ExtractionPublished by Society for Industrial & Applied Mathematics (SIAM) ,2010
- What is Knowledge Management?Published by IGI Global ,2007
- A survey on tree edit distance and related problemsTheoretical Computer Science, 2005
- Grammars have exceptionsInformation Systems, 1998