Enterprise Data Mesh Architectures for Scalable and Distributed Analytics

Authors

  • Dilliraja Sundar Independent Researcher, USA. Author

DOI:

https://doi.org/10.63282/3117-5481/AIJCST-V6I3P103

Keywords:

Enterprise Data Mesh, Distributed Analytics, Domain-Oriented Data Products, Self-Serve Data Platform, Event-Driven Ingestion

Abstract

Businesses are facing the volcanic increase in data volume, variety and dispersal in hybrid and multi-cloud setup, which is revealing the constraints of the centralized data warehouse, as well as the monolithic data lakes. Data mesh has become a socio-technical framework that devolves data ownership to domain teams with shared standards on interoperability, governance and security. The presented paper offers a reference enterprise data mesh architecture to meet scalable and distributed analytics. The architecture is designed to follow four pillars, namely domain-oriented data products, a self-service data and machine learning platform, standardized interoperability layer, and a federated computational governance model. Are outlining an implementation framework which is event-driven ingestion, API-first data product exposure, unified metadata management, distributed query capabilities and cross-domain orchestration. Based on the recent empirical findings and synthesized benchmarks, Present the discussion on the gains in the time-to-insight, data quality, reusability, and governance effectiveness and on the measures of scalability in the latency and the number of concurrent users. Other challenges such as organizational resistance, tooling fragmentation, and standards gaps are also analyzed in the paper and then the paper outlines future directions in AI-augmented governance and formal, data product specifications. The proposed model is an effective roadmap that enables enterprises to operationalize data mesh and achieve real-time, reliable analytics at scale by aligning organizational design and a strong layered architecture

References

[1] Jangam, S. K. (2023). Data Architecture Models for Enterprise Applications and Their Implications for Data Integration and Analytics. International Journal of Emerging Trends in Computer Science and Information Technology, 4(3), 91-100.

[2] Vlasiuk, Y., & Onyshchenko, V. (2023, March). Data mesh as distributed data platform for large enterprise companies. In International Conference on Computer Science, Engineering and Education Applications (pp. 183-192). Cham: Springer Nature Switzerland.

[3] Tamburri, D. A., Kazman, R., & Fahimi, H. (2022). On the relationship between organizational structure patterns and architecture in agile teams. IEEE Transactions on Software Engineering, 49(1), 325-347.

[4] Machado, I. A., Costa, C., & Santos, M. Y. (2022). Data mesh: concepts and principles of a paradigm shift in data architectures. Procedia Computer Science, 196, 263-271.

[5] Harby, A. A., & Zulkernine, F. (2022, December). From data warehouse to lakehouse: A comparative review. In 2022 IEEE international conference on big data (big data) (pp. 389-395). IEEE.

[6] Dehghani, Z. (2020, December 3). Data mesh principles and logical architecture. MartinFowler.com. Retrieved from https://martinfowler.com/articles/data-mesh-principles.html

[7] Ait Errami, S., Hajji, H., Ait El Kadi, K., & Badir, H. (2023). Spatial big data architecture: from data warehouses and data lakes to the Lakehouse. Journal of Parallel and Distributed Computing, 176, 70-79.

[8] Brataas, G., & Fægri, T. E. (2017, April). Agile scalability requirements. In Proceedings of the 8th ACM/SPEC on International Conference on Performance Engineering (pp. 413-416).

[9] Varma, Y. (2020). Secure Data Backup Strategies for Machine Learning: Compliance and Risk Mitigation Regulatory Requirements (GDPR, HIPAA, etc.). International Journal of Emerging Trends in Computer Science and Information Technology, 1(1), 29-38.

[10] Hjerppe, K., Ruohonen, J., & Leppänen, V. (2019, September). The general data protection regulation: requirements, architectures, and constraints. In 2019 IEEE 27th International Requirements Engineering Conference (RE) (pp. 265-275). IEEE.

[11] Dehghani, Z. (2020, December 3). Data mesh principles and logical architecture. Martin Fowler. Retrieved from https://martinfowler.com/articles/data-mesh-principles.html

[12] Araújo Machado, I., Costa, C., & Santos, M. Y. (2022, May). Advancing data architectures with data mesh implementations. In International Conference on Advanced Information Systems Engineering (pp. 10-18). Cham: Springer International Publishing.

[13] Wider, A., Verma, S., & Akhtar, A. (2023, July). Decentralized data governance as part of a data mesh platform: concepts and approaches. In 2023 IEEE International Conference on Web Services (ICWS) (pp. 746-754). IEEE.

[14] Dehghani, Z. (2020, December 3). Data mesh principles and logical architecture. Martin Fowler. Retrieved from https://martinfowler.com/articles/data-mesh-principles.html

[15] Mikhtoniuk, S., & Yalcin, O. N. (2021). Open Data Fabric: A decentralized data exchange and transformation protocol with complete reproducibility and provenance. arXiv. https://arxiv.org/abs/2111.06364

[16] Abraham, R., Schneider, J., & vom Brocke, J. (2019). Data governance: A conceptual framework, structured review, and research agenda. International Journal of Information Management, 49, 424–438. https://doi.org/10.1016/j.ijinfomgt.2019.07.008

[17] DeLone, W. H., & McLean, E. R. (2003). The DeLone and McLean model of information systems success: A ten year update. Journal of Management Information Systems, 19(4), 9–30.

[18] Jayaraman, P. P., Gomes, J. B., Nguyen, H. L., Abdallah, Z. S., Krishnaswamy, S., & Zaslavsky, A. (2015). Scalable energy-efficient distributed data analytics for crowdsensing applications in mobile environments. IEEE Transactions on Computational Social Systems, 2(3), 109-123.

[19] Dehghani, Z. (2020, December 3). Data mesh principles and logical architecture. Martin Fowler. Retrieved from https://martinfowler.com/articles/data-mesh-principles.html

[20] Elias, J. R., Chard, R., Levental, M., Liu, Z., Foster, I., & Chaudhuri, S. (2022). Real-Time Streaming and Event-driven Control of Scientific Experiments. arXiv preprint arXiv:2205.01476.

[21] Nangi, P. R., & Settipi, S. (2023). A Cloud-Native Serverless Architecture for Event-Driven, Low-Latency, and AI-Enabled Distributed Systems. International Journal of Emerging Research in Engineering and Technology, 4(4), 128–136. https://doi.org/10.63282/3050-922X.IJERET-V4I4P113

[22] Bhat, J. (2022). The Role of Intelligent Data Engineering in Enterprise Digital Transformation. International Journal of AI, BigData, Computational and Management Studies, 3(4), 106–114. https://doi.org/10.63282/3050-9416.IJAIBDCMS-V3I4P111

[23] Jayaram, Y. (2023). Data Governance and Content Lifecycle Automation in the Cloud for Secure, Compliance-Oriented Data Operations. International Journal of AI, BigData, Computational and Management Studies, 4(3), 124–133. https://doi.org/10.63282/3050-9416.IJAIBDCMS-V4I3P113

[24] Nangi, P. R. (2022). Multi-Cloud Resource Stability Forecasting Using Temporal Fusion Transformers. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 3(3), 123–135. https://doi.org/10.63282/3050-9262.IJAIDSML-V3I3P113

[25] Bhat, J., & Jayaram, Y. (2023). Predictive Analytics for Student Retention and Success Using AI/ML. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 4(4), 121–131. https://doi.org/10.63282/3050-9262.IJAIDSML-V4I4P114

[26] Jayaram, Y., & Sundar, D. (2022). Enhanced Predictive Decision Models for Academia and Operations through Advanced Analytical Methodologies. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 3(4), 113–122. https://doi.org/10.63282/3050-9262.IJAIDSML-V3I4P113

[27] Reddy Nangi, P., & Reddy Nala Obannagari, C. K. (2023). Scalable End-to-End Encryption Management Using Quantum-Resistant Cryptographic Protocols for Cloud-Native Microservices Ecosystems. International Journal of Emerging Trends in Computer Science and Information Technology, 4(1), 142–153. https://doi.org/10.63282/3050-9246.IJETCSIT-V4I1P116

[28] Bhat, J., Sundar, D., & Jayaram, Y. (2022). Modernizing Legacy ERP Systems with AI and Machine Learning in the Public Sector. International Journal of Emerging Research in Engineering and Technology, 3(4), 104–114. https://doi.org/10.63282/3050-922X.IJERET-V3I4P112

[29] Nangi, P. R., Obannagari, C. K. R. N., & Settipi, S. (2022). Self-Auditing Deep Learning Pipelines for Automated Compliance Validation with Explainability, Traceability, and Regulatory Assurance. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 3(1), 133–142. https://doi.org/10.63282/3050-9262.IJAIDSML-V3I1P114

[30] Jayaram, Y. (2023). Cloud-First Content Modernization: Migrating Legacy ECM to Secure, Scalable Cloud Platforms. International Journal of Emerging Research in Engineering and Technology, 4(3), 130–139. https://doi.org/10.63282/3050-922X.IJERET-V4I3P114

[31] Bhat, J. (2023). Automating Higher Education Administrative Processes with AI-Powered Workflows. International Journal of Emerging Trends in Computer Science and Information Technology, 4(4), 147–157. https://doi.org/10.63282/3050-9246.IJETCSIT-V4I4P116

[32] Nangi, P. R., Reddy Nala Obannagari, C. K., & Settipi, S. (2022). Predictive SQL Query Tuning Using Sequence Modeling of Query Plans for Performance Optimization. International Journal of AI, BigData, Computational and Management Studies, 3(2), 104–113. https://doi.org/10.63282/3050-9416.IJAIBDCMS-V3I2P111

[33] Jayaram, Y., & Bhat, J. (2022). Intelligent Forms Automation for Higher Ed: Streamlining Student Onboarding and Administrative Workflows. International Journal of Emerging Trends in Computer Science and Information Technology, 3(4), 100–111. https://doi.org/10.63282/3050-9246.IJETCSIT-V3I4P110

[34] Nangi, P. R., Obannagari, C. K. R. N., & Settipi, S. (2022). Enhanced Serverless Micro-Reactivity Model for High-Velocity Event Streams within Scalable Cloud-Native Architectures. International Journal of Emerging Research in Engineering and Technology, 3(3), 127–135. https://doi.org/10.63282/3050-922X.IJERET-V3I3P113

[35] Bhat, J. (2023). Strengthening ERP Security with AI-Driven Threat Detection and Zero-Trust Principles. International Journal of Emerging Trends in Computer Science and Information Technology, 4(3), 154–163. https://doi.org/10.63282/3050-9246.IJETCSIT-V4I3P116

[36] Jayaram, Y., Sundar, D., & Bhat, J. (2022). AI-Driven Content Intelligence in Higher Education: Transforming Institutional Knowledge Management. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 3(2), 132–142. https://doi.org/10.63282/3050-9262.IJAIDSML-V3I2P115

[37] Nangi, P. R., Reddy Nala Obannagari, C. K., & Settipi, S. (2023). A Multi-Layered Zero-Trust Security Framework for Cloud-Native and Distributed Enterprise Systems Using AI-Driven Identity and Access Intelligence. International Journal of Emerging Trends in Computer Science and Information Technology, 4(3), 144–153. https://doi.org/10.63282/3050-9246.IJETCSIT-V4I3P115

[38] Jayaram, Y., & Sundar, D. (2023). AI-Powered Student Success Ecosystems: Integrating ECM, DXP, and Predictive Analytics. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 4(1), 109–119. https://doi.org/10.63282/3050-9262.IJAIDSML-V4I1P113

[39] Bhat, J., & Sundar, D. (2022). Building a Secure API-Driven Enterprise: A Blueprint for Modern Integrations in Higher Education. International Journal of Emerging Research in Engineering and Technology, 3(2), 123–134. https://doi.org/10.63282/3050-922X.IJERET-V3I2P113

Downloads

Published

2024-05-10

Issue

Section

Articles

How to Cite

[1]
D. Sundar, “Enterprise Data Mesh Architectures for Scalable and Distributed Analytics”, AIJCST, vol. 6, no. 3, pp. 24–35, May 2024, doi: 10.63282/3117-5481/AIJCST-V6I3P103.

Similar Articles

111-119 of 119

You may also start an advanced similarity search for this article.