A.N. Golovenkov1
1 Federal State Autonomous Educational Institution of Higher Education «Saint Petersburg State University of Aerospace Instrumentation» (St. Petersburg, Russia)
1 aleksandr.golovenkov@mail.ru
Problem Statement. Markov chains are of interest across a wide range of applications and domains, such as telecommunications systems, remanufacturing and inventory systems, speech recognition, and more. In recent years, various models have been proposed using Markov chains for predicting multiple categorical data sequences, enabling the discovery of patterns in them. Objective: Conduct an analysis of the application of k-order Markov chains to categorical data analysis. Results: This paper presents a description of a mixed hidden Markov model that offers a solution to the problem of clustering uncertainty. In this model, instead of assigning individuals to c lusters, all data is used to estimate a mixture of hidden Markov models, where each individual belongs to each cluster with a certain probability. Special attention is given to a model adjustment me thod based on the Baum-Welch algorithm, which, together with a selection scheme, allows the use of partial labels in the data to improve the training of hidden Markov models. Practical Relevance. A model tuning method based on the Baum -Welch algorithm is presented. Together with a model selection scheme, this met hod enables the use of partial labels in data to improve the training of hidden Markov models. Further research would benefit from testing this approach to data and sequence analysis using Markov chains.
Golovenkov A.N. Application of k -order Markov chains to categorical data analysis. Dynamics of complex systems. 2026. V. 20. № 2. P. 55−63. DOI: 10.18127/j19997493-202602-06 (in Russian).
- Zaxarov V.M., S halagin S.V. Analiz psevdosluchajny`xposledovatel`nostejzadannoj dliny` po kriteriyu «e`ntropiyacepej Markova». Vestnik Dagestanskogo gosudarstvennogo universiteta. Ser. 1: Estestvenny`enauki. 2023. T. 38. № 2. S. 61–68.
- Arbekov I.M., Molotkov S.N. Kva ntovy`egeneratory` sluchajny`xchisel, e`kstrakciya dokazuemo sluchajny`xbitovy`xposledovatel`nostej iz traektorij cepi Markova. Uspexi fizicheskix nauk. 2024. T. 194. № 9. S. 974–993.
- William K. Schwartz, Sonja Petrović Longitudinal network models and permutation-uniform Markov chains. Scandinavian Journal of Statistics. 2022. V. 5 Iss. 3. R. 76–84.
- Bakaj G.A. O skorostisxodimosti vlokal`noj teoreme vosstanovleniya dlyamarkovskogo sluchajnogo bluzhdaniya. Matematicheskie zametki. 2024. T. 115. № 4. S. 521–532.
- Hamza Ruzayqat, Alexandros Beskos, Dan Crisan Sequential Markov chain Monte Carlo for Lagrangian dataassimilation with applications to unknowndata locations. Quarterly Journal of the Royal Meteorological Society. 2024. V. 1 5 Iss. 761. R. 120–129.
- Shalagin S.V. Reshenie zadachi kommivoyazhyora metodom statisticheskix ispy`tanij pri ispol`zovaniislozhny`xcepej Markova. Ucheny`ezapiski Kazanskogo universiteta. Ser.: Fiziko-matematicheskie nauki. 2024. T. 166. № 4. S. 639–650.
- Yunpei Chen, Dan Zhang, Qi Zhu Markov chainmodelling of ordered Rayleigh fadingchannels in non-orthogonal multiple access wireless networks. IET Signal Processing. 2023. V. 1 Iss. 3. R. 56–63.
- Paguba G.Yu., Pavlenko E.Yu. Generaciya vxodny`xdanny`x na osnovecepe j Markova dlyafazzinga programmnogo obespecheniya. Metody` itexnicheskie sredstva obespecheniya bezopasnosti informacii. 2022. № 31. S. 151–153.
- Xarin Yu.S., Shibalko S.A. Statisticheskij analizmnogomerny`xdvoichny`xvremenny`xryadov na osnovenejrose tevojmodeli. Doklady` Nacional`noj akademiinauk Belarusi. 2024. T. 68. № 4. S. 271–281. 1
- Han Cheng Lie, Daniel Rudolf, Björn Sprungk Dimension-independent Markov chain Monte Carlo on the sphere. Scandinavian Journal of Statistics. 2023. V. 5 Iss. 4. R. 209–212. 1
- Jin Wang, Matthew V. Tirrell, Juan J. de Pablo Parameter estimation for X-ray scattering analysis with Hamiltonian Markov Chain Monte Carlo. Journal of Synchrotron Radiation. 2022. V. 2 Iss. 3. R. 10–17. 1
- Susilo Hariyanto Y.D. Sumanto, Siti Khabibah Average-Based Fuzzy Time Series Markov Chain Based on Frequency Density Partitioning. Journal of Applied Mathematics. 2023. V. 2 0 2 Iss. 1. P. 33–39.

