350 rub
Journal Information-measuring and Control Systems №1 for 2025 г.
Article in number:
A study of the «normalization paradox» in factual relational databases on the PostgreSQL platform by
Type of article: scientific article
DOI: https://doi.org/10.18127/j20700814-202501-05
UDC: 004.652.42
Authors:

M.M. Matyushin1, M.Y. Konyshev2, O.V. Rudakova3, O.V. Tarakanov4

1–3 TsNIIMash JSC (Moscow region, Korolev, Russia)
4 FSUE NTC Orion (Moscow, Russia)
1 mccm@mcc.rsa.ru, 2 misha-kon@mail.ru, 3 rudakovaov@mcc.rsa.ru, 4 ole_g66@list.ru

Abstract:

The relational database relation normalization algorithm assumes sequential projection of the source table to meet the specific requirements of each normal form. The procedure for fetching data from a set of tables is based on their cascade connection, which, depending on the implementation (manufacturer and version of the database management system – DBMS), creates the need to perform recursive calculations. This means increased delays in the database when performing multi-tabular queries. The interpretation of the essence of the "normalization paradox" is that the need to ensure data integrity by normalizing relationships leads to a decrease in database performance. Testing the hypothesis that the query execution time with joining tables in a factographic relational database is correlated with the required depth of algorithm recursion due to the number of tables involved in query execution. The "normalization paradox" in a factographic relational database managed by the PostgreSQL database management system is investigated. The strength of the relationship between the degree of normalization of database tables and the time of issuing a query with subqueries depending on the depth of recursion has been established. The dependence of the issue time on the depth of recursion and on the volume of the factual database is studied separately. Based on the results of the correlation analysis, it was concluded that in databases based on the PostgreSQL v.12 database management system, the relationship between the query execution time and the depth of required recursion to generate output is weak, and the relationship between the output generation time and the database volume, determined by the number of registered instances of accounting objects, is lower than average. The conclusion is formulated about the inconsistency of the "normalization paradox" in formalized relational databases managed by PostgreSQL v.12, therefore, the unreasonableness of database denormalization as a way to increase its performance. The scientifically substantiated grounds for the necessity of bringing the relations of a factual relational database to the highest normal forms are formulated, regardless of the scope and method of its application. The results obtained for the PostgreSQL v.12 platform were not transposed to databases running other popular databases.

Pages: 41-51
For citation

Matyushin M.M., Konyshev M.Y., Rudakov O.V., Tarakanov O.V. A study of the «normalization paradox» in factual relational databases on the PostgreSQL platform by. Information-measuring and Control Systems. 2025. V. 23. № 1. P. 41−51. DOI: https://doi.org/ 10.18127/j20700814-202501-05 (in Russian)

References
  1. Karpuk A.A., Krasnoproshin V.V., Cikly` v strukturax funkcional`ny`x zavisimostej. International Journal of Open Information Technologies. M.: Laboratoriya otkry`ty`x informacionny`x texnologij fakul`teta VMK MGU im. M.V. Lomonosova. 2017. T. 5. № 7.
    S. 38–44.
  2. Marty`nov V.V., Proshin E.N., Podxod k analizu xarakteristik proizvoditel`nosti baz danny`x. Vestnik UGATU. Upravlenie VT i I. Ufa: UGATU. 2009. T. 13. № 2 (35). S. 198–205.
  3. Mendkovich N.A., Kuzneczov S.D., Ocenka e`ffektivnosti minimizacii ogranichenij zaprosov k SUBD / Trudy` Instituta sistemnogo programmirovaniya RAN. M.: Institut sistemnogo programmirovaniya RAN. 2013. T. 25. S. 113–130.
  4. Breshenkov A.V., Belous V.V., Preobrazovanie zapolnenny`x relyacionny`x tablicz ko vtoroj normal`noj forme. Nauka i obrazovanie. Inzhenernoe obrazovanie. E`lektronny`j zhurnal. 2007. № 2. E`lektronny`j resurs elibrary.ru/download/elibrary_9530726 _43884255.pdf
  5. Panchenko B.E., Issledovaniya domenno-klyuchevoj sxemy` relyacionnoj bazy` danny`x. Kibernetika i sistemny`j analiz. Kiev: Institut kibernetiki im. V.M. Glushkova NAN Ukrainy`. 2012. № 6. S. 157–172.
  6. Poolet M. 2002. Database Harmony. E`lektronny`j resurs SQL Server PRO sqlmag.com/business-intelligence/database-harmony (In English, unpubl.).
  7. Grigor`ev Yu.A., Gasov V.M., Ocenka srednego vremeni vy`polneniya soedineniya tablicz metodami NLJ i HJ v parallel`noj sisteme baz danny`x. Nauka i obrazovanie (e`lektronnoe nauchno-texnicheskoe izdanie). 2012. № 4 aprel`. E`lektronny`j resurs http://technomag.edu.ru/doc/380603.html
  8. Tarakanov O.V., Pikalova N.B., Tarakanova V.O., Issledovanie «paradoksa normalizacii» v faktograficheskix relyacionny`x bazax danny`x na platforme ORACLE. Nauchno-tex. vestnik informacionny`x texnologij, mexaniki i optiki. 2018. T. 18. № 4. S. 639–645. doi: 10.17586/2226-1494-2018-18-4-639-645.
  9. Bloxin V.G. i dr. Sovremenny`j e`ksperiment: podgotovka, provedenie, analiz rezul`tatov. M.: Radio i svyaz`. 1997. 232 s.
  10. Pin-Shan Chen P. 1976. The Entity-Relationship Model – Toward a Unified View of Data. ACM Transactions on Database Systems. V. 1. № 1. March. 9–36.
  11. Gmurman V.E. Teoriya veroyatnostej i matematicheskaya statistika. M.: Vy`sshaya shkola. 1972. 367 s.
  12. Pavlo A., Paulson E., Rasin A., Abadi D.J., DeWitt D.J., Madden S., Stonebraker M. A Comparison of Approaches to Large-Scale Data Analysis. SIGMOD '09: Proceedings of the 35th SIGMOD International Conference on Management of Data. 2009. New York, NY, USA. Р. 165–178.
  13. Grigor`ev Yu.A., Plutenko A.D., Analiz processa vy`polneniya zaprosa na soedinenie tablicz v strochnoj parallel`noj SUBD. Informatika i sistemy` upravleniya. Blagoveshhensk: Amurskij gosudarstvenny`j universitet. 2013. № 4 (38). S. 3–15.
  14. Grigor`ev Yu.A., Ustimov A.I. Sravnenie vremeni vy`polneniya zaprosa k xranilishhu danny`x v srede MapReduce/Hadoop i SUBD MySQL. Informatika i sistemy` upravleniya. Blagoveshhensk: Amurskij gosudarstvenny`j universitet. 2016. № 3 (49). S. 3–12.
  15. Leis V. et all. How good are query optimizers, really? Proceedings of the VLDB Endowment. 2015. V. 9. № 3. Р. 204–215.
  16. Dudakov N.S., Morozov P.A., Putyato S.A., Yakubovskij S.V., Model` sistemy` upravleniya xraneniem danny`x o vozdushnoj obstanovke pri reshenii vy`sokodinamichny`x zadach avtomatizacii upravleniya silami aviacii i PVO. Informacionno-izmeritel`ny`e i upravlyayushhie sistemy`. 2022. № 3. S. 4–8.
  17. Slepczova K.A., Komkov A.E., Kuzovlev V.I., Predstavlenie i obrabotka informacionny`x resursov dlya upravleniya deyatel`nost`yu predpriyatiya. Sovremenny`e naukoemkie texnologii. Penza: Izdatel`skij dom «Akademiya Estestvoznaniya». 2016. № 4-1. S. 57–61.
  18. Rogov E.V. PostgreSQL iznutri. M.: DMK Press. 2022. 660 s.
  19. Saenko I.B., Kuvaev V.O. Modeli i metodika ocenki i obespecheniya operativnoj dostupnosti resursov v edinom informacionnom prostranstve. Matematicheskie metody` v texnike i texnologiyax – MMIT. Saratov: Saratovskij gosudarstvenny`j texnicheskij universitet im. Yu.A. Gagarina. 2016. № 6 (88). S. 139–141.
  20. Kislyakov M.Yu., Logachev N.S., Petushkov A.M. Sistemno-texnicheskie aspekty` razvitiya NAKU KA NSE`N i izmerenij do 2025 goda. Raketno-kosmicheskoe priborostroenie i informacionny`e sistemy`. 2016. T. 3. № 1. S. 62–71.
Date of receipt: 19.09.2024
Approved after review: 01.10.2024
Accepted for publication: 14.01.2025