KoRASA: Pipeline Optimization for Open-Source Korean Natural Language Understanding Framework Based on Deep Learning
Open Access
- 24 June 2021
- journal article
- research article
- Published by Hindawi Limited in Mobile Information Systems
- Vol. 2021, 1-9
- https://doi.org/10.1155/2021/9987462
Abstract
Since the emergence of deep learning-based chatbots for knowledge services, numerous research and development projects have been conducted in various industries. A high demand for chatbots has drastically increased the global market size; however, the limited functional scalability of open-domain chatbots is a challenge to their application to industries. Moreover, as most chatbot frameworks employ English, it is necessary to create chatbots customized for other languages. To address this problem, this paper proposes KoRASA as a pipeline-optimization method, which uses a deep learning-based open-source chatbot framework to understand the Korean language. KoRASA is a closed-domain chatbot that is applicable across a wide range of industries in Korea. KoRASAs operation consists of four stages: tokenization, featurization, intent classification, and entity extraction. The accuracy and F1-score of KoRASA were measured based on datasets taken from common tasks carried out in most industrial fields. The algorithm for intent classification and entity extraction was optimized. The accuracy and F1-score were 98.2 and 98.4 for intent classification and 97.4 and 94.7 for entity extraction, respectively. Furthermore, these results are better than those achieved by existing models. Accordingly, KoRASA can be applied to various industries, including mobile services based on closed-domain chatbots using Korean, robotic process automation (RPA), edge computing, and Internet of Energy (IoE) services.Keywords
Funding Information
- Korea Electric Power Corporation
This publication has 14 references indexed in Scilit:
- Chatbots: History, technology, and applicationsMachine Learning with Applications, 2020
- KBot: A Knowledge Graph Based ChatBot for Natural Language Understanding Over Linked DataIEEE Access, 2020
- Toward effective mobile encrypted traffic classification through deep learningNeurocomputing, 2020
- Xatkit: A Multimodal Low-Code Chatbot Development FrameworkIEEE Access, 2020
- Mobile Encrypted Traffic Classification Using Deep Learning: Experimental Evaluation, Lessons Learned, and ChallengesIEEE Transactions on Network and Service Management, 2019
- An Efficient Framework for Development of Task-Oriented Dialog Systems in a Smart Home EnvironmentSensors, 2018
- EMPOWERING CHATBOTS WITH BUSINESS INTELLIGENCE BY BIG DATA INTEGRATIONInternational Journal of Advanced Research in Computer Science, 2018
- Deep Learning in Natural Language ProcessingPublished by Springer Science and Business Media LLC ,2018
- Programming challenges of chatbot: Current and future prospectivePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2017
- An Introduction to Deep Learning for the Physical LayerIEEE Transactions on Cognitive Communications and Networking, 2017