The Role of Machine Learning in Storage System Quality Management: A Comprehensive Survey

Authors

  • Prasanth Varma Addepalli Lead Data Architect/ Engineer, Federal Motor Carrier Safety Administration, Atlanta, Georgia. Author
  • Sridhar Reddy Bandaru Discover Financial Services, Application Architect for AI/ ML Platforms. Author
  • Dhuli Shyam Business Application, IT, Nagase Holdings America Corp, Manager, Application & Software Development, NYC, NY. Author
  • Prabu Manoharan Information Technology, University or Client: Bourns Inc, HRIS Manager, California, USA. Author
  • Muzaffer Hussain Syed Director of IT Projects & Programs, Powersys Inc. Author
  • Uday Kumar Ragireddy Sr Technical Program Manager, Vdrive IT Solutions, Inc, Richardson, Texas. Author

DOI:

https://doi.org/10.63282/3117-5481/AIJCST-V6I3P105

Keywords:

Machine Learning, Storage System, Quality Management, Data Integrity, Predictive Maintenance, Cloud Computing, Federated Learning

Abstract

The fast growth of cloud computing, big data analytics, and distributed applications has made the modern storage systems remarkably more complex, and quality management has become a serious issue. The Quality Management of Storage Systems (SQM) is expected to guarantee efficient, reliable, secure, and scalable data storage services in accordance with the Quality of Service (QoS) and Service Level Agreement (SLA) conditions. Although useful, traditional quality management methods tend to be ineffective in managing the dynamic, heterogeneous, and large size storage infrastructures of the modern world. The paper provides an overview of the literature on utilizing machine learning (ML) to improve the quality management of storage system. It talks about the main ML paradigms such as supervised, unsupervised, and reinforcement learning and examines how all the three can be applied to monitoring storage, performance optimization, fault detection, and automated decision-making. In addition to that, the paper discusses ML-enabled quality improvement activities as testing, monitoring, debugging, and data evaluation, and security and data integrity features to deal with intrusion detection, data corruption, ransomware attacks, and adaptive access control. The survey identifies the benefits, issues, and research directions of the ML-based SQM and the potential of the system to facilitate intelligent, proactive and autonomous quality control in the next-generation storage systems.

References

[1] A. Chakarov, A. Nori, S. Rajamani, S. Sen, and D. Vijaykeerthy, “Debugging machine learning tasks,” arXiv Prepr. arXiv1603.07292, 2016.

[2] A. Kushwaha, P. Pathak, and S. Gupta, “Review of optimize load balancing algorithms in cloud,” Int. J. Distrib. Cloud Comput., vol. 4, no. 2, pp. 1–9, 2016.

[3] Z. Cao, V. Tarasov, S. Tiwari, and E. Zadok, “Towards Better Understanding of Black-box Auto-Tuning A Comparative Analysis for Storage Systems,” in 2018 USENIX Annual Technical Conference (USENIX ATC 18), Boston, MA: USENIX Association, Jul. 2018, pp. 893–907.

[4] W. Wang, J. Yang, L. Huang, D. Proverbs, and J. Wei, “Intelligent Storage Location Allocation with Multiple Objectives for Flood Control Materials,” Water, vol. 11, no. 8, 2019, doi: 10.3390/w11081537.

[5] J. Hurwitz and D. Kirsch, “Machine Learning For Dummies®, IBM Limited Edition,” IBM Ltd. Ed., vol. 75, pp. 9780429196645--6, 2018.

[6] N. Polyzotis, M. Zinkevich, S. Roy, E. Breck, and S. Whang, “Data validation for machine learning,” Proc. Mach. Learn. Syst., vol. 1, pp. 334–347, 2019.

[7] L. Ma et al., “Deepmutation: Mutation testing of deep learning systems,” in 2018 IEEE 29th international symposium on software reliability engineering (ISSRE), 2018, pp. 100–111.

[8] S. Garg, “Predictive Analytics and Auto Remediation using Artificial Inteligence and Machine learning in Cloud Computing Operations,” Int. J. Innov. Res. Eng. Multidiscip. Phys. Sci., vol. 7, no. 2, 2019, doi: 10.5281/zenodo.15362327.

[9] J. Zhang, Y. Wang, P. Molino, L. Li, and D. S. Ebert, “Manifold: A Model-Agnostic Framework for Interpretation and Diagnosis of Machine Learning Models,” IEEE Trans. Vis. Comput. Graph., vol. 25, no. 1, pp. 364–373, Jan. 2019, doi: 10.1109/TVCG.2018.2864499.

[10] G. Barash, E. Farchi, I. Jayaraman, O. Raz, R. Tzoref-Brill, and M. Zalmanovici, “Bridging the gap between ML solutions and their business requirements using feature interactions,” in Proceedings of the 2019 27th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Aug. 2019, pp. 1048–1058. doi: 10.1145/3338906.3340442.

[11] J. Dolby, A. Shinnar, A. Allain, and J. Reinen, “Ariadne: analysis for machine learning programs,” in Proceedings of the 2nd ACM SIGPLAN International Workshop on Machine Learning and Programming Languages, New York, NY, USA: ACM, Jun. 2018, pp. 1–10. doi: 10.1145/3211346.3211349.

[12] A. Gosain and G. Sharma, “Static Analysis: A Survey of Techniques and Tools,” in Intelligent Computing and Applications, New Delhi: Springer India, 2015, pp. 581–591.

[13] K. Pei, Y. Cao, J. Yang, and S. Jana, “DeepXplore: automated whitebox testing of deep learning systems,” Commun. ACM, vol. 62, no. 11, pp. 137–145, Oct. 2019, doi: 10.1145/3361566.

[14] J. Sekhon and C. Fleming, “Towards improved testing for deep learning,” in 2019 IEEE/ACM 41st International Conference on Software Engineering: New Ideas and Emerging Results (ICSE-NIER), 2019, pp. 85–88.

[15] L. Ma et al., “Combinatorial Testing for Deep Learning Systems,” 2018.

[16] S. Ackerman, O. Raz, M. Zalmanovici, and A. Zlotnick, “Automatically detecting data drift in machine learning based classifiers,” in The AAAI-19 Workshop on Engineering Dependable and Secure Machine Learning Systems Software Engineering for Machine Learning (EDSMLS 2019), 2019.

[17] V. M. L. G. Nerella, “Automated cross-platform database migration and high availability implementation,” Turkish J. Comput. Math. Educ., vol. 9, no. 2, pp. 823–835, 2018.

[18] S. Nidhra, “Black Box and White Box Testing Techniques - A Literature Review,” Int. J. Embed. Syst. Appl., vol. 2, no. 2, pp. 29–50, Jun. 2012, doi: 10.5121/ijesa.2012.2204.

[19] S. Garg, “AI/ML Driven Proactive Performance Monitoring, Resource Allocation and Effective Cost Management in SaaS Operations,” Int. J. Core Eng. Manag., vol. 6, no. 6, pp. 263–273, 2019.

[20] M. Zamzam, T. Elshabrawy, and M. Ashour, “Resource Management using Machine Learning in Mobile Edge Computing: A Survey,” in Proceedings - 2019 IEEE 9th International Conference on Intelligent Computing and Information Systems, ICICIS 2019, 2019. doi: 10.1109/ICICIS46948.2019.9014733.

[21] C. Keerthisinghe, A. C. Chapman, and G. Verbic, “Energy Management of PV-Storage Systems: Policy Approximations Using Machine Learning,” IEEE Trans. Ind. Informatics, vol. 15, no. 1, pp. 257–265, Jan. 2019, doi: 10.1109/TII.2018.2839059.

[22] D. M. J. Nestor and K. A. Ogudo, “Practical Implementation of Machine Learning and Predictive Analytics in Cellular Network Transactions in Real Time,” in 2018 International Conference on Advances in Big Data, Computing and Data Communication Systems (icABCD), IEEE, Aug. 2018, pp. 1–10. doi: 10.1109/ICABCD.2018.8465476.

[23] S. O. Aliyu, F. Chen, and Y. He, “QoS-Aware Resource Management in SDN-Based InterClouds: A Software Cybernetics Perspective,” in 2017 IEEE International Conference on Software Quality, Reliability and Security Companion (QRS-C), IEEE, Jul. 2017, pp. 429–434. doi: 10.1109/QRS-C.2017.77.

[24] I. G. Ben Yahia, J. Bendriss, A. Samba, and P. Dooze, “CogNitive 5G networks: Comprehensive operator use cases with machine learning for management operations,” in 2017 20th Conference on Innovations in Clouds, Internet and Networks (ICIN), IEEE, Mar. 2017, pp. 252–259. doi: 10.1109/ICIN.2017.7899421.

[25] Mamidala, J. V., Enokkaren, S. J., Attipalli, A., Bitkuri, V., Kendyala, R., & Kurma, J. (2023). Machine Learning Models Powered by Big Data for Health Insurance Expense Forecasting. International Research Journal of Economics and Management Studies IRJEMS, 2(1).

[26] Nadella, V. M. (2023). Zero Trust Architecture for Telecom Operations. International Journal of Emerging Research in Engineering and Technology, 4(3), 115-129.

[27] Bitkuri, V., Kendyala, R., Kurma, J., Enokkaren, S. J., & Mamidala, J. V. (2023). Forecasting Stock Price Movements With Deep Learning Models for time Series Data Analysis. Journal of Artificial Intelligence & Cloud Computing. SRC/JAICC-531. DOI: doi. org/10.47363/JAICC/2023 (2), 489, 2-9.

[28] Nadella, V. M. (2023). Anomaly Detection and Fault Prediction using ML in Telecom Operations. International Journal of Emerging Trends in Computer Science and Information Technology, 4(3), 134-143.

[29] Kosaraju, P., & Nadella, V. M. (2022). Security and Privacy in IoT Ecosystems. Universal Library of Engineering Technology, (Issue).

[30] Singh, A. A. S. S., Mania, V., Kothamaram, R. R., Rajendran, D., Namburi, V. D. N., & Tamilmani, V. (2023). Exploration of Java-Based Big Data Frameworks: Architecture, Challenges, and Opportunities. Journal of Artificial Intelligence & Cloud Computing, 2(4), 1-8.

[31] Routhu, K. K. (2023). AI-driven succession planning in Oracle HCM Cloud: Building resilient leadership pipelines through predictive analytics. International Journal of Science, Engineering and Technology, 11(5).

[32] Tamilmani, V., Namburi, V. D., Singh Singh, A. A., Maniar, V., Kothamaram, R. R., & Rajendran, D. (2023). Real-Time Identification of Phishing Websites Using Advanced Machine Learning Methods. Available at SSRN 5837142.

[33] Routhu, K. K. (2023). AI-driven succession planning in Oracle HCM Cloud: Building resilient leadership pipelines through predictive analytics. International Journal of Science, Engineering and Technology, 11(5). https://doi.org/10.5281/zenodo.17292018

[34] From Fragmentation to Focus: The Benefits of Centralizing Procurement. (2023). International Journal of Research and Applied Innovations, 6(6), 9820-9833. https://doi.org/10.15662/

[35] Routhu, K. K. (2023). Embedding fairness into the digital enterprise, data driven DEI strategies with Oracle HCM Analytics. International Journal of Scientific Research in Computer Science, Engineering and Information Technology, 9(8), 266-274.

[36] Routhu, K. K. (2023). AI-driven skills forecasting in Oracle HCM Cloud: From static competencies to predictive workforce design. International Journal of Science, Engineering and Technology, 11(1).

[37] Padur, S. K. R. (2023). AI-Augmented Enterprise ERP Modernization: Zero-Downtime Strategies for Oracle E-Business Suite R12. 2 and Beyond. Available at SSRN 5605510.

[38] Routhu, K. K. (2022). From Case Management to Conversational HR: Redefining Help Desks with Oracle’s AI and NLP Framework. International Journal of Science, Engineering and Technology, 10(6).

[39] Vattikonda, N., Gupta, A. K., Polu, A. R., Narra, B., Buddula, D. V. K. R., & Patchipulusu, H. H. S. (2022). Blockchain Technology in Supply Chain and Logistics: A Comprehensive Review of Applications, Challenges, and Innovations. International Journal of Emerging Trends in Computer Science and Information Technology, 3(3), 72-80.

[40] Attipalli, A., BITKURI, V., Mamidala, J. V., Kendyala, R., & KURMA, J. (2022). Empowering Cloud Security with Artificial Intelligence: Detecting Threats Using Advanced Machine learning Technologies. Available at SSRN 5741263.

[41] Padur, S. K. R. (2022). Intelligent resource management: AI methods for predictive workload forecasting in cloud data centers. J. Artif. Intell. Mach. Learn. & Data Sci, 1(1), 2936-2941.

[42] Nadella, V. M. (2022). Digital Twins for Predictive Network Management and System Simulation. International Journal of AI, BigData, Computational and Management Studies, 3(3), 100-111.

[43] Routhu, K. K. (2022). From RFID to Geofencing: IoT-Enabled Smart Time Tracking in Oracle HCM Cloud. International Journal of Science, Engineering and Technology, 10(4).

[44] Nadella, V. (2019). Extracting road traffic data through video analysis using automatic camera calibration and deep neural networks.

[45] Polam, R. M., Kamarthapu, B., Kakani, A. B., Nandiraju, S. K. K., Chundru, S. K., & Vangala, S. R. (2022). Data Security in Cloud Computing: Encryption, Zero Trust, and Homomorphic Encryption. International Journal of Emerging Trends in Computer Science and Information Technology, 3(4), 31-41.

[46] Padur, S. K. R. (2022). AI augmented platform engineering, transforming developer experience through intelligent automation and self optimizing internal platforms. International Journal of Science, Engineering and Technology, 10(5), 10-5281.

[47] Kosaraju, P. , & Nadella, V. M. (2021). Quality of Experience (QoE) and Network Performance Modelling for Multimedia Traffic. Journal of Artificial Intelligence and Big Data, 1(1), 1-13. https://doi.org/10.31586/jaibd.2021.1358.

Downloads

Published

2024-05-13

Issue

Section

Articles

How to Cite

[1]
P. V. Addepalli, S. R. Bandaru, D. Shyam, P. Manoharan, M. H. Syed, and U. K. Ragireddy, “The Role of Machine Learning in Storage System Quality Management: A Comprehensive Survey”, AIJCST, vol. 6, no. 3, pp. 48–57, May 2024, doi: 10.63282/3117-5481/AIJCST-V6I3P105.

Most read articles by the same author(s)

Similar Articles

41-50 of 163

You may also start an advanced similarity search for this article.