Supercomputing Technologies as Drive for Development of Enterprise Information Systems and Digital Economy

Oleg V. Loginovsky, Alexander L. Shestakov, Alexander A. Shinkarev

Abstract


The article presents an analysis of approaches to the development of enterprise information systems that are in use today. One of the major trends that predetermines the agenda of information technology is the focus on parallel computing of large volumes of data using supercomputing technologies. The article considers the resulting ubiquitous move to distributed patterns of building enterprise information systems and avoiding monolithic architectures. The emphasis is placed on the importance of such fundamental characteristics of enterprise information systems as reliability, scalability, and maintainability. The article justifies the importance of machine learning in the context of effective big data analysis and competitive gain for business, vital for both maintaining a leading position in the market and surviving in conditions of global instability and digitalization of economy. Transition from storing the current state of a enterprise information system to storing a full log and history of all changes in the event stream is proposed as an instrument of achieving linearization of the data stream for subsequent parallel computing. There is a new view that is being shaped of specialists at the intersection of engineering and analytical disciplines, who would be able to effectively develop scalable systems and algorithms for data processing and integration of its results into company business processes.

Full Text:

PDF

References


Anderson, J.: Data Engineers vs. Data Scientist. https://www.oreilly.com/ideas/data-engineers-vs-data-scientists (2018), accessed: 2019-10-13

Belcham, D., Baley, K.: Brownfield application development in. NET. Manning Publications Co. (2010)

Brutlag, J.: Speed Matters for Google Web Search. http://googleresearch.blogspot.com/2009/06/speed-matters.html (2009), accessed: 2019-10-13

Buxton, B.: Big Data: the Next Google. https://www.nature.com/news/2008/080903/full/455008a.html (2008), accessed: 2019-10-13

Celesti, A., Fazio, M., Villari, M.: A study on join operations in MongoDB preserving collections data models for future internet applications. Future Internet 11(4), 83 (2019), DOI: 10.3390/fi11040083

Chamberlin, D.D.: Early history of SQL. IEEE Annals of the History of Computing 34(4), 78–82 (2012), DOI: 10.1109/MAHC.2012.61

Cielen, D., Meysman, A.D.B., Ali, M. (eds.): Introducing Data Science. Manning (2016)

Clarke, T.: Twenty Two Plus Instagram Stats That Marketers Can’t Ignore This Year. https://blog.hootsuite.com/instagram-statistics (2019), accessed: 2019-10-13

Codd, E.F.: Derivability, redundancy and consistency of relations stored in large data banks. IBM Research Report, San Jose, California RJ599 (1969)

Codd, E.F., Codd, S.B., Salley, C.T.: Providing OLAP (online analytical processing) to user-analysts: an it mandate (1992)

Conn, S.S.: OLTP and OLAP data integration: a review of feasible implementation methods and architectures for real time data analysis. In: Proceedings of the IEEE SoutheastCon 2005, 8-10 April 2005, Ft. Lauderdale, FL, USA. pp. 515–520 (2005), DOI: 10.1109/SECON.2005.1423297

Dedi, N., Stanier, C.: An evaluation of the challenges of multilingualism in data warehouse development. In: Proceedings of the 18th International Conference on Enterprise Information Systems, 25-28 April 2016, Rome, Italy. SCITEPRESS - Science and and Technology Publications (2016), DOI: 10.5220/0005858401960206

Drinkwater, D.: Does a Data Breach Really Affect Your Firm’s Reputation. https://www.csoonline.com/article/3019283/does-a-data-breach-really-affect-your-firm-s-reputation.html (2016), accessed: 2019-10-13

Everts, T.: The Real Cost of Slow Time vs Downtime. http://www.webperformancetoday.com/2014/11/12/real-cost-slow-time-vs-downtime-slides, accessed: 2019-10-13

Farber, F., May, N., Lehner, W., et al.: The SAP HANA database – an architecture overview. IEEE Data Eng. Bull. 35(1), 28–33 (2012), http://sites.computer.org/debull/A12mar/hana.pdf

Fowler, M., Sadalage, P.: The Future is Polyglot Persistence. https://martinfowler.com/articles/nosql-intro-original.pdf (2012), accessed: 2019-10-13

Fowler, M.: Patterns of enterprise application architecture. Addison-Wesley Longman Publishing Co., Inc. (2002)

Gaikwad, R.G., Goje, A.: SQL and NoSQL: Which is better. International Journal of Emerging Technologies and Innovative Research 2(8), 3277–3284 (2015), http://www.jetir.org/papers/JETIR1508005.pdf

Gepner, P., Kowalik, M.F.: Multi-core processors: New way to achieve high system performance. In: International Symposium on Parallel Computing in Electrical Engineering, 13-17 Sept. 2006, Bialystok, Poland. pp. 9–13 (2006), DOI: 10.1109/PARELEC.2006.54

Gray, J.: The transaction concept: Virtues and limitations. In: Proc. of the 7th Int. Conf. on Very Large Databases, 13-17 Sept. 1981, Cannes, France. pp. 144–154 (1981)

Grishina, A.: Big Data and Data Science Labor Market Survey. https://habr.com/company/newprolab/blog/320336 (2017), accessed: 2019-10-13

Hilbert, M., Lopez, P.: The world’s technological capacity to store, communicate, and compute information. Science 332(6025), 60–65 (2011), DOI: 10.1126/science.1200970

Jun, S.P., Yoo, H.S., Choi, S.: Ten years of research change using Google Trends: From the perspective of big data utilizations and applications. Technological Forecasting and Social Change 130, 69–87 (2018), DOI: 10.1016/j.techfore.2017.11.009

Kalske, M., Makitalo, N., Mikkonen, T.: Challenges when moving from monolith to microservice architecture. In: Current Trends in Web Engineering - ICWE 2017 International Workshops, Liquid Multi-Device Software and EnWoT, practi-O-web, NLPIT, SoWeMine, 5-8 June 2017, Rome, Italy, Revised Selected Papers. pp. 32–47 (2017), DOI: 10.1007/978-3-319-74433-9_3

Khine, P.P., Wang, Z.S.: Data lake: a new ideology in big data era. In: ITM Web of Conferences. vol. 17, p. 03025. EDP Sciences (2018), DOI: 10.1051/itmconf/20181703025

Kleppmann, M.: Designing Data-Intensive Applications: The Big Ideas Behind Reliable, Scalable, and Maintainable Systems. O’Reilly (2016), http://shop.oreilly.com/product/0636920032175.do

Kostenetskiy, P., Safonov, A.: SUSU supercomputer resources. In: Proc. of the 10th Annual Int. Scientific Conf. on Parallel Computing Technologies, PCT 2016, 29-31 March 2016, Arkhangelsk, Russia. CEUR Workshop Proceedings. vol. 1576, pp. 561–573 (2016)

Kostenetskiy, P., Semenikhina, P.: SUSU supercomputer resources for industry and fundamental science. In: 2018 Global Smart Industry Conference, GloSIC, 13-15 Nov. 2018, Chelyabinsk, Russia. pp. 1–7. IEEE (2018), DOI: 10.1109/GloSIC.2018.8570068

Larson, P., Clinciu, C., Fraser, C., et al.: Enhancements to SQL server column stores. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, SIGMOD 2013, 22-27 June 2013, New York, NY, USA. pp. 1159–1168 (2013), DOI: 10.1145/2463676.2463708

Li, C., Gu, J.: An integration approach of hybrid databases based on SQL in cloud computing environment. Software: Practice and Experience 49(3), 401–422 (2019), DOI: 10.1002/spe.2666

Linden, G.: Make data useful. https://www.scribd.com/doc/4970486/Make-Data-Useful-by-Greg-Linden-Amazon-com (2006), accessed: 2019-10-13

Liu, C.Y., Shie, M.R., Lee, Y.F., et al.: Vertical/horizontal resource scaling mechanism for federated clouds. In: 2014 International Conference on Information Science Applications, 6-9 May 2014, Seoul, South Korea

Namiot, D., Sneps-Sneppe, M.: On micro-services architecture. International Journal of Open Information Technologies 2(9), 24–27 (2014)

Naouali, S., Salem, S.B.: Towards reducing the multidimensionality of OLAP cubes using the evolutionary algorithms and factor analysis methods. CoRR abs/1602.04613 (2016), http://arxiv.org/abs/1602.04613

Novikov, D.A.: Big Data and Big Management. https://mipt.ipu.ru/sites/default/files/page_file/BigDataBigControl.pdf, accessed: 2019-10-13

Richter, J.: Architecting Distributed Cloud Applications. https://www.youtube.com/watch?v=xJMbkZvuVO0 (2017), accessed: 2019-10-13

Robinson, H.: The Elephant Was a Trojan Horse: On the Death of MapReduce at Google. https://www.datacenterknowledge.com/archives/2014/06/25/google-dumps-mapreduce-favor-new-hyper-scale-analytics-system (2014), accessed: 2019-10-13

Sadalage, P.: NoSQL distilled: a brief guide to the emerging world of polyglot persistence. Addison-Wesley, Upper Saddle River, NJ (2013)

Safaei, B., Monazzah, A.M., Bafroei, M., et al.: Reliability side-effects in Internet of Things application layer protocols. In: 2017 2nd International Conference on System Reliability and Safety, 20-22 Dec. 2017, Milan, Italy. pp. 207–212. IEEE (2017), DOI: 10.1109/ICSRS.2017.8272822

Singh, S.: Data warehouse and its methods. Journal of Global Research in Computer Science 2(5), 113–115 (2011)

Su, C.J., Huang, S.F.: Real-time big data analytics for hard disk drive predictive maintenance. Computers & Electrical Engineering 71, 93–101 (2018), DOI: 10.1016/j.compeleceng.2018.07.025

Unger, S.H.: Hazards, critical races, and metastability. IEEE Transactions on Computers 44(6), 754–768 (1995), DOI: 10.1109/12.391185

Viggiato, M., Terra, R., Rocha, H., et al.: Microservices in practice: A survey study (2018)

Vogels, W.: Eventually consistent. Communications of the ACM 52(1), 40 (2009), DOI: 10.1145/1435417.1435432

Simec, A., Maglii: Comparison of JSON and XML data formats. In: Central European Conference on Information and Intelligent Systems (2014)

Vyawahare, H., Karde, P., Thakare, V.M.: A hybrid database approach using graph and relational database. In: 2018 International Conference on Research in Intelligent and Computing in Engineering, 22-24 Aug. 2018, San Salvador, El Salvador. pp. 1–4 (2018), DOI: 10.1109/RICE.2018.8509057

Zhislina, V.: Why the frequency does not increase. https://software.intel.com/en-us/blogs/2014/02/19/why-has-cpu-frequency-ceased-to-grow (2014), accessed: 2019-10-13




Publishing Center of South Ural State University (454080, Lenin prospekt, 76, Chelyabinsk, Russia)