Disaster-FD: A fault detector for disaster-prone environments
Abstract
This paper introduces Disaster-FD, a failure detector designed for disaster-prone environments focusing on real-time monitoring of IoT networks. Inspired by Impact-FD, this approach emphasizes active monitoring and assessment of network reliability. The paper explores key elements such as reliability threshold, confidence level, and impact factor. Tests on the IoT-LAB platform demonstrate the functionalities of Disaster-FD in various environments, highlighting its relevance in natural disaster scenarios.References
Adjih, C., Baccelli, E., Fleury, E., Harter, G., Mitton, N., Noel, T., Pissard-Gibollet, R., Saint-Marcel, F., Schreiner, G., Vandaele, J., et al. (2015). FIT IoT-LAB: A large scale open experimental IoT testbed. In 2015 IEEE 2nd World Forum on Internet of Things (WF-IoT), pages 459–464. IEEE.
Aguilera, M. K., Delporte-Gallet, C., Fauconnier, H., and Toueg, S. (2004). Communication-efficient leader election and consensus with limited link synchrony. In Proceedings of the twenty-third annual ACM symposium on Principles of distributed computing, pages 328–337.
Atzori, L., Iera, A., and Morabito, G. (2010). The internet of things: A survey. Computer networks, 54(15):2787–2805.
Chandra, T. D., Hadzilacos, V., and Toueg, S. (1996). The weakest failure detector for solving consensus. Journal of the ACM (JACM), 43(4):685–722.
Chandra, T. D. and Toueg, S. (1996). Unreliable failure detectors for reliable distributed systems. Journal of the ACM (JACM), 43(2):225–267.
Chen, W., Toueg, S., and Aguilera, M. K. (2002). On the quality of service of failure detectors. IEEE Transactions on computers, 51(5):561–580.
Cristian, F. and Fetzer, C. (1999). The timed asynchronous distributed system model. IEEE Transactions on Parallel and Distributed Systems, 10(6):642–657.
Fischer, M. J., Lynch, N. A., and Paterson, M. S. (1985). Impossibility of distributed consensus with one faulty process. Journal of the ACM (JACM), 32(2):374–382.
Janoneda, L. (2022). A cada desastre natural no brasil, em média, 3,4 mil pessoas são afetadas. Acessada: 2022-01-09.
Kovatsch, M., Lanter, M., and Shelby, Z. (2014). Californium: Scalable cloud services for the internet of things with coap. In 2014 International Conference on the Internet of Things (IOT), pages 1–6. IEEE.
Rossetto, A. G. d. M., Geyer, C. F., Arantes, L., and Sens, P. (2018). Impact fd: An unreliable failure detector based on process relevance and confidence in the system. The Computer Journal, 61(10):1557–1576.
Sens, P., Arantes, L., Rosseto, A. G. D. M., and Marin, O. (2024). Stab-fd: A cooperative and adaptive failure detector for wide area networks. Journal of Parallel and Distributed Computing, 186.
Verissimo, P. and Rodrigues, L. (2012). Distributed Systems for System Architects, volume 1. Springer Science & Business Media.
Yang, R., Zhu, S., Li, Y., and Gupta, I. (2019). Medley: A novel distributed failure detector for IoT networks. In Proceedings of the 20th International Middleware Conference, pages 319–331.
Aguilera, M. K., Delporte-Gallet, C., Fauconnier, H., and Toueg, S. (2004). Communication-efficient leader election and consensus with limited link synchrony. In Proceedings of the twenty-third annual ACM symposium on Principles of distributed computing, pages 328–337.
Atzori, L., Iera, A., and Morabito, G. (2010). The internet of things: A survey. Computer networks, 54(15):2787–2805.
Chandra, T. D., Hadzilacos, V., and Toueg, S. (1996). The weakest failure detector for solving consensus. Journal of the ACM (JACM), 43(4):685–722.
Chandra, T. D. and Toueg, S. (1996). Unreliable failure detectors for reliable distributed systems. Journal of the ACM (JACM), 43(2):225–267.
Chen, W., Toueg, S., and Aguilera, M. K. (2002). On the quality of service of failure detectors. IEEE Transactions on computers, 51(5):561–580.
Cristian, F. and Fetzer, C. (1999). The timed asynchronous distributed system model. IEEE Transactions on Parallel and Distributed Systems, 10(6):642–657.
Fischer, M. J., Lynch, N. A., and Paterson, M. S. (1985). Impossibility of distributed consensus with one faulty process. Journal of the ACM (JACM), 32(2):374–382.
Janoneda, L. (2022). A cada desastre natural no brasil, em média, 3,4 mil pessoas são afetadas. Acessada: 2022-01-09.
Kovatsch, M., Lanter, M., and Shelby, Z. (2014). Californium: Scalable cloud services for the internet of things with coap. In 2014 International Conference on the Internet of Things (IOT), pages 1–6. IEEE.
Rossetto, A. G. d. M., Geyer, C. F., Arantes, L., and Sens, P. (2018). Impact fd: An unreliable failure detector based on process relevance and confidence in the system. The Computer Journal, 61(10):1557–1576.
Sens, P., Arantes, L., Rosseto, A. G. D. M., and Marin, O. (2024). Stab-fd: A cooperative and adaptive failure detector for wide area networks. Journal of Parallel and Distributed Computing, 186.
Verissimo, P. and Rodrigues, L. (2012). Distributed Systems for System Architects, volume 1. Springer Science & Business Media.
Yang, R., Zhu, S., Li, Y., and Gupta, I. (2019). Medley: A novel distributed failure detector for IoT networks. In Proceedings of the 20th International Middleware Conference, pages 319–331.
Published
2024-05-24
How to Cite
SILVA, Abadio de Paulo; ROSSETTO, Anubis Graciela de Moraes; SENS, Pierre; ARANTES, Luciana; PASQUINI, Rafael; COELHO, Paulo.
Disaster-FD: A fault detector for disaster-prone environments. In: FAULT TOLERANCE WORKSHOP (WTF), 25. , 2024, Niterói/RJ.
Anais [...].
Porto Alegre: Sociedade Brasileira de Computação,
2024
.
p. 43-56.
ISSN 2595-2684.
DOI: https://doi.org/10.5753/wtf.2024.2903.
