AI Driven Operational Dashboards for Realtime Monitoringand ‎Crisis Decision Support in IT Systems

Authors

  • Nikhil Singla The Harrisburg University of Science and Technology, Harrisburg, PA, USA

DOI:

https://doi.org/10.14419/qnk7fn90

Published

19-05-2026

Keywords:

AI Driven Dashboards; Real Time Monitoring; Predictive Analytics; Machine Learning; AIOps

Abstract

In today’s digital first landscape, enterprise IT systems form the backbone of mission critical ‎operations, demanding resilience, reliability, and rapid crisis response. The complexity of ‎distributed, cloudnative, and IoTdriven environments has rendered traditional monitoring tools ‎inadequate for ensuring uninterrupted services. AI driven operational dashboards represent a ‎paradigm shift by combining realtime data visualization with advanced machine learning, natural ‎language processing, and predictive analytics. Unlike conventional dashboards that passively ‎display metrics, these intelligent platforms actively detect anomalies, forecast failures, and ‎recommend or even automate remedial actions. This reduces mean time to detect (MTTD) and ‎mean time to resolve (MTTR), providing enterprises with faster, more informed decisionmaking ‎during crises such as outages, cyberattacks, or traffic surges.Case studies of prominent industries ‎show quantifiable enhancements in anomaly detection accuracy, alert noise reduction, and ‎operational efficiency. There are obstacles, however, including data complexity of integration, ‎explanation holes, model shift, and organizational resistance. The review integrates existing ‎technological bricks, experimentation findings, and industrial practices alongside identifying ‎research gaps and limitations. In delineating a theoretical framework and future research directions, ‎it substantiates the imperative of AI driven dashboards as adaptive, reliable, and scalable solutions ‎for enterprise IT resilience in a more dynamic world of operation‎.

References

Biggio, B., & Roli, F. (2018). Wild patterns: Ten years after the rise of adversarial machine learning. Pattern Recognition, 84, 317–331. https://doi.org/10.1016/j.patcog.2018.07.023.

Sharma, P., & Krishnan, R. (2021). Realtime anomaly detection and mitigation using AI in IT infrastructures. ACM Computing Surveys, 54(2), 1–35.

Beyer, B., Jones, C., Petoff, J., & Murphy, N. R. (2016). Site Reliability Engineering: How Google Runs Production Systems. O’Reilly Media.

Moogsoft. (2023). Case Studies: Uber, HCL, and American Airlines streamline IT operations with Moogsoft AIOps. Moogsoft.com.

Ribeiro, M. T., Singh, S., & Guestrin, C. (2016). "Why Should I Trust You?": Explaining the predictions of any classifier. Proceedings of the 22nd ACM SIGKDD, 1135–1144. https://doi.org/10.1145/2939672.2939778.

Bélisle-Pipon, J.-C. (2025). Commentary: Implications of causality in artificial intelligence. Why causal AI is easier said than done. Frontiers in Artifi-cial Intelligence, 7, 1488359. https://doi.org/10.3389/frai.2024.1488359.

Armbrust, M., Das, T., Zhu, S., & Xin, R. (2021). Lakehouse: A new generation of open platforms that unify data warehousing and advanced analyt-ics. Communications of the ACM, 64(9), 56–65.

Kreps, J., Narkhede, N., & Rao, J. (2011). Kafka: A Distributed Messaging System for Log Processing. Proceedings of the NetDB, 11, 1–7.

Giebler, C., Gruschka, N., & Jensen, M. (2019). RealTime Stream Processing in CloudNative Data Pipelines. Future Generation Computer Systems, 95, 337–349.

Sculley, D., Holt, G., Golovin, D., Davydov, E., Phillips, T., Ebner, D., ... & Dennison, D. (2015). Hidden Technical Debt in Machine Learning Sys-tems. Advances in Neural Information Processing Systems, 28.

Moogsoft. (2023). Case Studies: RealTime AI in Incident Detection. Moogsoft.com.

Raja, K. V., Siddharth, R., Yuvaraj, S., & Ramesh Kumar, K. A. (2024). An Artificial Intelligence based automated case-based reasoning (CBR) sys-tem for severity investigation and root-cause analysis of road accidents: Comparative analysis with the predictions of ChatGPT. Journal of Engineer-ing Research, 12(4), 895–903. https://doi.org/10.1016/j.jer.2023.09.019.

Beyer, B., Jones, C., Petoff, J., & Murphy, N. R. (2016). Site Reliability Engineering: How Google Runs Production Systems. O’Reilly Media.

Davis, C. R., Murphy, K. J., Curtis, R. G., & Maher, C. A. (2020). A process evaluation examining the performance, adherence, and acceptability of a physical activity and diet artificial intelligence virtual health assistant. International Journal of Environmental Research and Public Health, 17(23), 9137. https://doi.org/10.3390/ijerph17239137.

Ribeiro, M. T., Singh, S., & Guestrin, C. (2016). "Why Should I Trust You?": Explaining the Predictions of Any Classifier. Proceedings of the 22nd ACM SIGKDD, 1135–1144. https://doi.org/10.1145/2939672.2939778.

Sami, M. A., Rehman, A., Ahmad, Z., & Bano, N. (2025). Explainable AIOps: A deep survey on trustworthy and transparent AI in cloudscale DevOps automation. Spectrum of Engineering Sciences, 3(7), 488–507.

Beyer, B., Jones, C., Petoff, J., & Murphy, N. R. (2016). Site reliability engineering: How Google runs production systems. O’Reilly Media.

Giebler, C., Gruschka, N., & Jensen, M. (2019). Realtime stream processing in cloudnative data pipelines. Future Generation Computer Systems, 95, 337–349.

Min, S., & Kim, B. (2024). Adopting artificial intelligence technology for network operations in digital transformation. Admsci, 14(4), 70. https://doi.org/10.3390/admsci14040070.

Carloni, G., Berti, A., & Colantonio, S. (2025). The role of causality in explainable artificial intelligence. Wiley Interdisciplinary Reviews: Data Min-ing and Knowledge Discovery. Advance online publication. https://doi.org/10.1002/widm.70015.

Le, H.S., Tran, Q.T., & Thuan, N. H. (2025). A proposal of leveraging causal AI for enhancing machine learning applications in information systems. In N. H. Thuan, D. P. Duy, H.S. Le, & T. Q. Phan (Eds.), Information Systems Research in Vietnam, Volume 3 (pp. 137148). Springer. https://doi.org/10.1007/978-981-97-9835-3_9.

Folabi, J. A. (2025). Harnessing predictive analytics and machine learning for minority business resilience, crisis management, and competitive ad-vantage. International Journal of Research Publication and Reviews, 6(4), 1810–1827. https://doi.org/10.55248/gengpi.6.0425.1370.

Rajkumar, P., & Prabavathy, P. (2023). Telemedicine monitoring system based on fog/edge computing: A survey. Proceedings of the IEEE (or ap-propriate conference/IEEE journal), Article 10772317. IEEE.

Tejwani, R., Moreno, F., Jeong, S., Park, H. W., & Breazeal, C. (2020). Migratable AI: Effect of identity and information migration on users’ percep-tion of conversational AI agents. In 2020 29th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN) (pp. 877–884). IEEE. https://doi.org/10.1109/RO-MAN47096.2020.9223436.

How to Cite

Singla, N. . (2026). AI Driven Operational Dashboards for Realtime Monitoringand ‎Crisis Decision Support in IT Systems. International Journal of Basic and Applied Sciences, 14(8), 685-692. https://doi.org/10.14419/qnk7fn90

Downloads