KoRASA: Pipeline Optimization for Open-Source Korean Natural Language Understanding Framework Based on Deep Learning

Open Access

24 June 2021

journal article
research article
Published by Hindawi Limited in Mobile Information Systems

Vol. 2021, 1-9
https://doi.org/10.1155/2021/9987462

Abstract

Since the emergence of deep learning-based chatbots for knowledge services, numerous research and development projects have been conducted in various industries. A high demand for chatbots has drastically increased the global market size; however, the limited functional scalability of open-domain chatbots is a challenge to their application to industries. Moreover, as most chatbot frameworks employ English, it is necessary to create chatbots customized for other languages. To address this problem, this paper proposes KoRASA as a pipeline-optimization method, which uses a deep learning-based open-source chatbot framework to understand the Korean language. KoRASA is a closed-domain chatbot that is applicable across a wide range of industries in Korea. KoRASAs operation consists of four stages: tokenization, featurization, intent classification, and entity extraction. The accuracy and F1-score of KoRASA were measured based on datasets taken from common tasks carried out in most industrial fields. The algorithm for intent classification and entity extraction was optimized. The accuracy and F1-score were 98.2 and 98.4 for intent classification and 97.4 and 94.7 for entity extraction, respectively. Furthermore, these results are better than those achieved by existing models. Accordingly, KoRASA can be applied to various industries, including mobile services based on closed-domain chatbots using Korean, robotic process automation (RPA), edge computing, and Internet of Energy (IoE) services.

Keywords

Funding Information

Korea Electric Power Corporation

This publication has 14 references indexed in Scilit:

Chatbots: History, technology, and applications
Machine Learning with Applications, 2020
KBot: A Knowledge Graph Based ChatBot for Natural Language Understanding Over Linked Data
IEEE Access, 2020
Toward effective mobile encrypted traffic classification through deep learning
Neurocomputing, 2020
Xatkit: A Multimodal Low-Code Chatbot Development Framework
IEEE Access, 2020
Mobile Encrypted Traffic Classification Using Deep Learning: Experimental Evaluation, Lessons Learned, and Challenges
IEEE Transactions on Network and Service Management, 2019
An Efficient Framework for Development of Task-Oriented Dialog Systems in a Smart Home Environment
Sensors, 2018
EMPOWERING CHATBOTS WITH BUSINESS INTELLIGENCE BY BIG DATA INTEGRATION
International Journal of Advanced Research in Computer Science, 2018
Deep Learning in Natural Language Processing
Published by Springer Science and Business Media LLC ,2018
Programming challenges of chatbot: Current and future prospective
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2017
An Introduction to Deep Learning for the Physical Layer
IEEE Transactions on Cognitive Communications and Networking, 2017

Cited by 2 articles