350 rub
Journal Science Intensive Technologies №5 for 2025 г.
Article in number:
Methods for error detection in streaming data of modern relational databases using Structured Query Language technology in large organizations
Type of article: scientific article
DOI: 10.18127/j19998465-202505-02
UDC: 004.628
Authors:

I.O. Shkokov1

1 AVIV Group Gmbh (Paris, France)

Abstract:

Modern relational databases (RDBs) serve as the backbone for storing and processing vast amounts of information used for business decision-making in large corporations. However, data quality in these systems can be compromised for various reasons, including input errors, failures in ETL (Extract, Transform, Load) processes, or data loss during transformation. In environments where there is a shortage of skilled data engineers, the responsibility for monitoring and maintaining data quality often falls on data analysts, highlighting the need for fast and effective methods to track and improve data quality.

Objective – the objective of this paper is to develop and present practical methods for monitoring data quality in relational databases using SQL. The methods described in the article are aimed at ensuring the reliability of decision-making based on data and can be applied in both private and public organizations.

The paper proposes three methods for monitoring data quality: 1. Creating BI dashboards to visualize data volume, which helps in promptly identifying data gaps or issues in the data loading process.2. Analyzing the completeness of columns in SQL tables, which assists in monitoring the presence of NULL or empty values that serve as indicators of data transformation issues. 3. Setting up alerts for poor data quality, which automatically notify data professionals about emerging data issues. Applying these methods in real-world commercial projects has demonstrated their effectiveness in quickly detecting errors and restoring data quality.

The proposed methods can significantly enhance data management processes in organizations by enabling prompt detection and resolution of data quality issues. This will improve the reliability and accuracy of business decisions and optimize the work of data professionals, particularly in environments with a shortage of skilled data engineers.

Pages: 17-25
For citation

Shkokov I.O. Methods for error detection in streaming data of modern relational databases using Structured Query Language technology in large organizations. Science Intensive Technologies. 2025. V. 26. № 5. P. 17−25. DOI: https://doi.org/ 10.18127/j19998465-202505-02 (in Russian)

References
  1. Khelifa N., Belbachir H., Belalem G. Time series big data: a survey on data stream frameworks, analysis and algorithms. Journal of Big Data. 2023. V. 10. Art. no. 75. https://doi.org/10.1186/s40537-023-00760-1.
  2. Bovee M., Srivastava R.P., Mak B. A conceptual framework and belief function approach to assessing overall information quality. In: Proceedings of the International Conference on Information Quality. Cambridge, MA. USA. 2001. P. 66–84. https://doi.org/10.1002/int.10074.
  3. Jarke M., Jeusfeld M.A., Quix C., Vassiliadis P. Architecture and quality in data warehouses: an extended repository approach. Information Systems. 1999. V. 24. Iss. 3. P. 229–253. https://doi.org/10.1016/S0306-4379(99)00017-4.
  4. Nelson H.J., Poels G., Genero M., Piattini M. A conceptual modeling quality framework. Software Quality Journal. 2012. V. 20. Iss. 1.
    P. 201–228. https://doi.org/10.1007/s11219-011-9136-9.
  5. Sidi F., Hassany P., Affendey L.S., Jabar M.A. Data quality: a survey of data quality dimensions. In: Proceedings of the 2012 International Conference on Information Retrieval & Knowledge Management (CAMP). Kuala Lumpur, Malaysia. 2012. P. 37–41. https://doi.org/10.1109/InfRKM.2012.6204995.
  6. Daniel F., Casati F., Palpanas T., Chayka O. Managing data quality in business intelligence applications. In: Proceedings of the International Workshop on Quality in Databases and Management of Uncertain Data. Auckland, New Zealand. 2008. P. 5–12. https://doi.org/10.1145/1452292.1452294.
  7. Baranov V.G., Misevich A.A., Sevryukov A.A., Suslov B.A., Sevryukov M.A., Alipova N.A. Primenenie metodov intellektual'nogo analiza dannyh v informacionno-analiticheskih sistemah monitoringa. Informacionno-izmeritel'nye i upravlyayushchie sistemy. 2011. T. 9. № 3. S. 38–42 (in Russian).
  8. Vasil'ev V.N., Knyaz'kov K.V., Churov T.N., Nasonov D.A., Mar'in S.V., Koval'chuk S.V., Buhanovskij A.V. CLAVIRE: oblachnaya platforma dlya obrabotki dannyh bol'shih ob"emov. Informacionno-izmeritel'nye i upravlyayushchie sistemy. 2012. T. 10. № 11. S. 7–16 (in Russian).
Date of receipt: 11.08.2025
Approved after review: 25.08.2025
Accepted for publication: 02.09.2025