Utilizando a RFT para a Detecção de Falhas de Microsserviços do Controlador O-RAN

  • Alexandre Huff UTFPR
  • Matti Hiltunen AT&T Labs Research
  • Elias P. Duarte Jr. UFPR

Abstract


The O-RAN (Open Radio Access Network) Alliance is defining a new open source communication interface (E2) to customize and control RAN elements. The RIC (RAN Intelligent Controller) platform allows implementing RAN control functions through microservices called xApps. This work describes a fault tolerance strategy for microservices (xApps) that run on the RIC controller. The proposed strategy consists of state partitioning techniques with partial replication in groups of xApps and context-aware rerouting of messages. A library called RFT (RIC Fault Tolerance) was implemented and is publicly available for the development of fault-tolerant xApps. This paper describes experimental results that demonstrate the detection of faults of RIC microservices with the RFT.

References

3GPP (2021). The 3rd generation partnership project (3GPP). https://www.3gpp.org/. Acessado em junho de 2021.

Adya, A., Myers, D., Howell, J., Elson, J., Meek, C., Khemani, V., et al. (2016). Slicer: Auto-sharding for datacenter applications. In Proc. OSDI, pages 739–753.

Cattell, R. (2011). Scalable SQL and NoSQL data stores. ACM SIGMOD Rec., 39(4):12–27.

Gomez-Miguelez, I., Garcia-Saavedra, A., Sutton, P., Serrano, P., Cano, C., and Leith, D. (2016). srsLTE: an open-source platform for LTE evolution and experimentation. In Proc. 10th ACM Int. Workshop on Wireless Network Testbeds, Exp. Evaluation, and Characterization, pages 25–32.

Gudipati, A., Perry, D., Li, L. E., and Katti, S. (2013). SoftRAN: Software defined radio access network. In Proc. 2nd ACM SIGCOMM Workshop on Hot Topics in Software Defined Networking, page 25–30.

Habibi, M. A., Nasimi, M., Han, B., and Schotten, H. D. (2019). A comprehensive survey of RAN architectures toward 5G mobile communication system. IEEE Access, 7:70371–70421.

Huff, A. (2021). Composição de Serviços Virtualizados de Rede sobre Múltiplos Orquestradores NFV & Tolerância a Falhas para Microsserviços do Controlador O-RAN. PhD Thesis, UFPR.

Huff, A., Hiltunen, M., and Duarte Jr., E. P. (2021a). RFT: RIC fault tolerance. https://github.com/alexandre-huff/rft. Acessado em junho de 2021.

Huff, A., Hiltunen, M., and Duarte Jr., E. P. (2021b). RFT: Scalable and fault-tolerant microservices for the O-RAN control plane. In 2021 IFIP/IEEE Int. Symp. on Integrated Network Management.

Kaltenberger, F., de Souza, G., Knopp, R., and Wang, H. (2019). The OpenAirInterface 5G new radio implementation: Current status and roadmap. In 23rd Int. ITG Workshop on Smart Antennas (WSA).

Kubernetes (2021). Kubernetes. https://kubernetes.io/. Acessado em junho de 2021.

O-RAN Alliance (2020a). O-RAN architecture description. Technical Specification O-RAN-WG1-O-RAN Architecture Description - v01.00.00, O-RAN Alliance.

O-RAN Alliance (2020b). O-RAN near-real-time RAN intelligent controller architecture & E2 general aspects and principles 1.01. Technical Specification O-RAN.WG3.E2GAP-v01.01, O-RAN Alliance.

O-RAN Alliance (2020c). O-RAN operations and maintenance architecture. Technical Specification ORAN. WG1.OAM-Architecture-v03.00, O-RAN Alliance.

O-RAN Alliance (2021). O-RAN Alliance. https://www.o-ran.org/. Acessado em junho de 2021.

O-RAN SC (2021a). O-RAN SC projects: Near realtime RAN intelligent controller (RIC). https://docs.o-ran-sc.org/en/latest/projects.html. Acessado em junho de 2021.

O-RAN SC (2021b). O-RAN software community. https://o-ran-sc.org/. Acessado em junho de 2021.

OAI (2021). OpenAirInterface: 5G software alliance for democratising wireless innovation. https://www.openairinterface.org/. Acessado em junho de 2021.

Olwal, T. O., Djouani, K., and Kurien, A. M. (2016). A survey of resource management toward 5G radio access networks. IEEE Communications Surveys & Tutorials, 18(3):1656–1686.

Ongaro, D. and Ousterhout, J. (2014). In search of an understandable consensus algorithm. In Proc. USENIX ATC, pages 305–319.

Parvez, I., Rahmati, A., Guvenc, I., Sarwat, A. I., and Dai, H. (2018). A survey on low latency towards 5G: RAN, core network and caching solutions. IEEE Commun. Surveys & Tutorials, 20(4):3098–3130.

Schneider, F. B. (1990). Implementing fault-tolerant services using the state machine approach: a tutorial. ACM Computing Surveys, 22(4):299–319.

Shayea, I., Ergen, M., Azmi, M. H., et al. (2020). Key challenges, drivers and solutions for mobility management in 5G networks: A survey. IEEE Access, 8:172534–172552.

Sherry, J., Gao, P. X., Basu, S., Panda, A., Krishnamurthy, A., Maciocco, C., et al. (2015). Rollbackrecovery for middleboxes. In Proc. SIGCOMM, page 227–240.

van Renesse, R. and Schneider, F. B. (2004). Chain replication for supporting high throughput and availability. In Proc. OSDI.

Zhao, W., Melliar-Smith, P., and Moser, L. (2013). Low latency fault tolerance system. The Computer Journal, 56:716–740.
Published
2021-08-16
HUFF, Alexandre; HILTUNEN, Matti; DUARTE JR., Elias P.. Utilizando a RFT para a Detecção de Falhas de Microsserviços do Controlador O-RAN. In: FAULT TOLERANCE WORKSHOP (WTF), 22. , 2021, Uberlândia. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2021 . p. 99-112. ISSN 2595-2684. DOI: https://doi.org/10.5753/wtf.2021.17207.