Abstract
Recognition of Unknown Entities in Specific Financial Field Based on ERNIE-Doc-BiLSTM-CRF: The Internet is rich in information related to the financial field. The financial entity information text containing new internet vocabulary has a certain impact on the results of existing recognition algorithms. How to solve the problems of new vocabulary and polysemy is a problem to be solved in the current field. This paper proposes an ERNIE-Doc-BiLSTM-CRF named entity recognition model based on the pretrained language model. Compared with the traditional model, the ERNIE-Doc pretrained language model constructs a unique word vector from the word vector and combines the location coding, which solves polysemy problem well. The intensive skimming mechanism realizes the long text processing well and captures the context information effectively. The experimental results show that the accuracy of this model is 86.72, the recall rate is 83.39, and the F1 value is 85.02, which is 13.36 higher than other models; the recall rate is increased by 13.05, and the F1 value is increased by 13.21.
Funding Information
  • Key R&D Projects in Shanxi Province (rh2100005181, rh2100005178, 203290929-j)

This publication has 7 references indexed in Scilit: