Predictive OMS Switchover towards Proactive Disaster Recovery in 5G Networks

  • Charles F. Santos IFRN / UFRN
  • Augusto V. Neto UFRN
  • Ramon R. Fontes UFRN
  • Roger Immich UFRN
  • Vicente Sousa Jr. UFRN
  • Helber W. da Silva IFRN / UFRN

Abstract


The increasing complexity and critical nature of Operations and Maintenance Systems (OMS) in 5G mobile networks necessitate robust Disaster Recovery (DR) solutions to ensure continuous service and minimal downtime. Disaster Recovery Systems (DRS) are essential for maintaining network resilience by facilitating seamless failover and recovery processes. The primary function of switchover enables a DRS to ensure 5G service continuity during unforeseen disasters. This research addresses the limitations of traditional rulebased decision-making techniques that often rely on binary switchover logic, which can be inadequate for the intricate demands of 5G networks. We propose the predictive OMS Switchover (pOM2S), a machine learning (ML)-driven approach that utilizes data on computing and networking metrics to estimate the switchover period for each redundant candidate. In doing so, our solution is able to select the backup OMS that can perform the fastest in a disaster event. Evaluated in a 5G emulation testbed simulating real-world conditions, results demonstrated Random Forest’s superior accuracy (MAE: 1.68s, R²: 0.94) over Linear Regression, Artificial Neural Networks, and Support Vector Machine techniques. Experimental results validate pOM2S’s effectiveness in balancing predictive precision and operational practicality, suggesting that the predictive decision-making method relies on a highly accurate model.

Keywords: 5G/6G networks and applications, Fault tolerance and resilience Future Internet, Management, operation, design, and analysis of networks, Network simulation and emulation Performance, scalability, and reliability, Anomaly and Attack Detection and Prevention

References

Abdelaziz, A. S., Harb, H., Zaghloul, A., and Salem, A. (2023). An enhanced MCDM model for cloud service provider selection. International Journal of Advanced Computer Science and Applications, 14(2).

Al-Essa, H. A. and Abdulbaki, A. A. (2016). Disaster recovery datacenter’s architecture on network replication solution. In 2016 European Modelling Symposium (EMS), pages 175–180.

Chang, W.-C. and Lin, F. J. (2021). Coordinated management of 5G core slices by MANO and OSS/BSS. Journal of Computer and Communications, 9(6):52–72.

Group, G.-P. S. N. W. (2020). Cloud native and 5G verticals services. Technical report, 5G PPP.

Haleplidis, E., Pentikousis, K., Denazis, S., Salim, J. H., Meyer, D., and Koufopavlou, O. (2015). Software-Defined Networking (SDN): Layers and Architecture Terminology. RFC 7426.

Lawler, C. M., Harper, M. A., and Thornton, M. A. (2007). Components and analysis of disaster tolerant computing. In 2007 IEEE International Performance, Computing, and Communications Conference, pages 380–386.

Leiter, A., Hegyi, A., Galambosi, N., Lami, E., and Fazekas, P. (2022). Automatic failover of 5G container-based user plane function by ONAP closed-loop orchestration. In NOMS 2022-2022 IEEE/IFIP Network Operations and Management Symposium, pages 1–2.

Liang, M., Gao, F., and Shi, M. (2022). A failure detection method of remote disaster recovery and backup system. In ITM Web of Conferences, volume 47, page 01002. EDP Sciences.

Mijumbi, R., Serrat, J., Gorricho, J., Bouten, N., De Turck, F., and Boutaba, R. (2016). Network function virtualization: State-of-the-art and research challenges. IEEE Communications Surveys & Tutorials.

Saha, M., Panda, S. K., and Panigrahi, S. (2021). A hybrid multi-criteria decision making algorithm for cloud service selection. International Journal of Information Technology, 13(4):1417–1422.

Xie, H., Du, S., Zhang, Q., Yang, Y., and Wang, H. (2023). Application of UPF disaster tolerant networking in power private network. In 2023 IEEE 6th Information Technology, Networking, Electronic and Automation Control Conference (ITNEC), volume 6, pages 811–815.

Zaretalab, A., Hajipour, V., and Tavana, M. (2020). Redundancy allocation problem with multistate component systems and reliable supplier selection. Reliability Engineering & System Safety, 193:106629.

Zheng, Y., Fu, Y., Zhang, Y., Hao, S., and Cai, Y. (2020). Research and practice of intelligent operation and maintenance system for essential load guarantee based on 5G and IoT technology. In 2020 4th International Conference on Power and Energy Engineering (ICPEE), pages 53–57.

Zhu, H., Li, J., Hu, J., and Li, W. (2022). Failure-aware and automated disaster backup in the 5G core network. In 2022 International Communication Engineering and Cloud Computing Conference (CECCC), pages 48–53.

Zunino, C., Cena, G., Scanzio, S., and Valenzano, A. (2024). Adaptive seamless redundancy to achieve highly dependable MQTT communication. IEEE Transactions on Industrial Informatics, 20(1):984–994.
Published
2025-05-19
F. SANTOS, Charles; V. NETO, Augusto; FONTES, Ramon R.; IMMICH, Roger; SOUSA JR., Vicente; DA SILVA, Helber W.. Predictive OMS Switchover towards Proactive Disaster Recovery in 5G Networks. In: BRAZILIAN SYMPOSIUM ON COMPUTER NETWORKS AND DISTRIBUTED SYSTEMS (SBRC), 43. , 2025, Natal/RN. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2025 . p. 896-909. ISSN 2177-9384. DOI: https://doi.org/10.5753/sbrc.2025.6402.

Most read articles by the same author(s)

1 2 > >>