350 rub
Journal Nonlinear World №4 for 2024 г.
Article in number:
The impact of text preprocessing methods on the efficiency of machine learning models for employee loyalty analysis
Type of article: scientific article
DOI: 10.18127/j20700970-202404-13
UDC: 0519.254+519.673+004.942
Authors:

G.N. Zholobova1, E.D. Sinitsyna2

1,2 Financial University under the Government of the Russian Federation (Moscow, Russia)
1 gnzholobova@fa.ru, 2 221766@edu.fa.ru

Abstract:

Analyzing employee loyalty and the sentiment of their reviews is a crucial task for corporate analysts and HR specialists. The accuracy of these analyses directly depends on the quality of text data preprocessing, which is currently insufficiently studied and requires deeper attention. This study aims to substantiate methods for improving the quality of sentiment analysis and employee loyalty assessment by identifying the most effective preprocessing techniques for machine learning models such as BERT, DistilBERT, and RoBERTa. The research provides practical recommendations for text data preprocessing, including cleaning, normalization, stop-word removal, stemming, and lemmatization. It was demonstrated that optimal preprocessing can achieve an accuracy of up to 91% for the RoBERTa model. This research identified key text preprocessing methods that significantly enhance the accuracy of machine learning models in analyzing employee loyalty. Optimizing such preprocessing processes as text cleaning, normalization, stop-word removal, and decoding professional jargon allows for significantly improved sentiment analysis of employee reviews. These findings can be integrated into analytical systems used by HR specialists and corporate analysts to achieve more accurate sentiment and engagement analysis, ultimately contributing to better corporate culture management and increased overall employee satisfaction.

Pages: 102-107
For citation

Zholobova G.N., Sinitsyna E.D. The impact of text preprocessing methods on the efficiency of machine learning models for employee loyalty analysis. Nonlinear World. 2024. V. 22. № 4. P. 102–107. DOI: https://doi.org/10.18127/ j20700970-202404-13 (In Russian)

References
  1. Rezapova D.B. Vovlechennost' sotrudnikov v rabotu. Universitetskoe upravlenie: praktika i analiz. 2016. № 1 (101). URL: https://cyberleninka.ru/article/n/vovlechennost-sotrudnikov-v-rabotu (In Russian).
  2. Miriam Alzate, Marta Arce-Urriza, Javier Cebollada. Mining the text of online consumer reviews to analyze brand image and brand positioning. 2022. URL: https://www.sciencedirect.com/science/article/pii/S0969698922000820.
  3. Bunina E.V. Innovacionnye metody upravleniya personalom. 2019. URL: https://cyberleninka.ru/article/n/innovatsionnye-metody-upravleniya-personalom-1 (In Russian).
  4. Majed A. Alshamari. Evaluating User Satisfaction Using Deep-Learning-Based Sentiment Analysis for Social Media Data in Saudi Arabia’s Telecommunication Sector. 2023. URL: https://www.mdpi.com/2073-431X/12/9/170.
  5. Bakhshi A., Kumar K. Organizational Commitment as predictor of Organizational Citizenship Behavior. URL: https://core.ac.uk/download/pdf/234624049.pdf.
  6. Onuchin A. Vovlechennost' personala: ot izmereniya k upravleniyu. URL: https://www.ecopsy.ru/insights/vovlechennost-personala-ot-izmereniya-k-upravleniyu/ (In Russian).
  7. Qurvenianto H.A., Tobing D.S.K., Handriyono The Effect Of Employee Loyalty And Work Creativity On Work Performance Of Employee Through Career Development PT. Astra International, Tbk Isuzu Malang.
  8. Narariya Dita Handani, Angellie Williady, Hak-Seon Kim. An Analysis of Customer Textual Reviews and Satisfaction at Luxury Hotels in Singapore’s Marina Bay Area (SG-Clean-Certified Hotels). 2022. URL: https://www.mdpi.com/2071-1050/14/15/9382.
  9. Oehler K. Модель вовлеченности сотрудников Kincentric. Aon Hewitt. URL: https://axes.ru/articles/model-vovlechennosti-sotrudnikov-aon-hewitt/.
  10. Allen N.J., Meyer J.P. The measurement and antecedents of affective, continuance and normative commitment to the organization. Journal of occupational psychology/ 1990. V. 63(1). P. 1–18.
  11. Tom Young, Devamanyu Hazarika, Soujanya Poria, Erik Cambria. Recent Trends in Deep Learning Based Natural Language Processing. 2020.
Date of receipt: 02.10.2024
Approved after review: 14.10.2024
Accepted for publication: 29.10.2024