CLASSIFICATION OF DATA TYPES USED IN BIG DATA ANALYTICS
UDC 681.518
Korolyov Artyom Andreevich – Senior Lecturer, the Department of Automation of Production Processes and Electrical Engineering. Belarusian State Technological University (13a Sverdlova str., 220006, Minsk, Republic of Belarus). E-mail: korolev@belstu.by
Karpovich Dzmitry Semenovich – PhD (Engineering), Associate Professor, Head of the Department of Automation of Production Processes and Electrical Engineering. Belarusian State Technological University (13a Sverdlova str., 220006, Minsk, Republic of Belarus). E-mail: d.karpovich@belstu.by
Fokin Timophej Pavlovich – teacher trainee, the Department of Automation of Production Processes and Electrical Engineering. Belarusian State Technological University (13a Sverdlova str., 220006, Minsk, Republic of Belarus). E-mail: fokin@belstu.by
DOI: https://doi.org/ 10.52065/2520-6141-2025-290-6.
Key words: big data, structured data, semi-structured data, unstructured data.
For citation: Korolyov A. A., Karpovich D. S., Fokin T. P. Classification of data types used in big data analytics. Proсeedings of BSTU, issue 3, Physics and Mathematics. Informatics, 2025, no. 1 (290), pp. 31–35 (In Russian). DOI: 10.52065/2520-6141-2025-290-6.
Abstract
The article is devoted to the main categories of data used in modern Big Data systems. The article classifies data into structured, semi-structured and unstructured with a description of the features of each type, an analysis of their key characteristics, and a comparison of their advantages and disadvantages. The article also describes big data, considers general concepts and features of big data, areas of their application, and sourcesof their appearance. An analysis of the main types of data used in Big Data systems is carried out by comparing them with each other according to such characteristics as flexibility, scalability, complexity of analysis, complexity of integration, type of data representation, storage efficiency, complexity of queries for data selection. The technologies that ensure the efficient use of various types of data within Big Data are considered, as well as the role of these technologies in improving the processes of processing and decision-making at enterprises. The importance of analyzing each of the existing data types for optimizing business operations is emphasized.
References
- Chen M., Mao S., Liu Y. Big data: A survey. Mobile Networks and Applications, 2014, vol. 19 (2), pp. 171–209. DOI: 10.1007/s11036-013-0489-0.
- Katal A., Wazid M., Goudar R. H. Big Data: Issues, challenges, tools and good practices. 6th International Conference on Contemporary Computing (IC3), 2013, pp. 404–409. DOI: 10.1109/IC3.2013.6612229.
- Chen C. P., Zhang C. Y. Data-intensive applications, challenges, techniques and technologies: A survey on Big Data. Information Sciences, 2014, vol. 275, pp. 314–347. DOI: 10.1016/j.ins.2014.01.015.
- Gandomi A., Haider M. Beyond the hype: Big data concepts, methods, and analytics. International Journal of Information Management, 2015, vol. 35 (2), pp. 137–144. DOI: 10.1016/j.ijinfomgt.2014.10.007.
- Papakonstantinou Y. Semistructured Models, Queries and Algebras in the Big Data Era. SIGMOD '16: Proceedings of the 2016 International Conference on Management of Data, 2016, pp. 2229–2233. DOI: 10.1145/2882903.2912573.
15.01.2025