G.S. Ivanova1, P.A. Martynyuk2
1–2 Bauman Moscow State Technical University (Moscow, Russia)
Problem setting. Nowadays, there is an increasing need to implement tools into software systems that enable automatic or automated processing of text data. This is due to the increasing rate of growth in the amount of information presented in the form of text, both in individual information systems and in the global Internet. The development of modern technologies of machine and deep learning, in turn, has led to an increase in the popularity of neural network models. This article is devoted to the analysis of neural network models used to solve classical problems of processing text data in natural language.
Target. Clarification of the range of tasks to be solved in natural language processing for each classical neural network language model.
Results. For each of the considered models, the features of the architecture and principles of functioning are formulated, the strengths and weaknesses of the models are highlighted. The dependence between the architectures of models and the range of tasks they solve is given.
Practical significance. The results of the analysis can be of practical value for developers of text data processing systems. The article provides basic information about the most popular neural network models, which can help specialists in choosing a specific neural network architecture.
Ivanova G.S., Martynyuk P.A., Analysis of neural network language models for solving problems of text data processing. Neurocomputers. 2023. V. 25. № 2. Р. 5-20. DOI: https://doi.org/10.18127/j19998554-202302-01 (In Russian)
