Uma Proposta de Detector de Defeitos Autonômico Usando Engenharia de Controle

  • Alirio Santos de Sá UFBA
  • Raimundo José de Araújo Macêdo UFBA


A escolha do período de monitoramento, na construção de detectores de defeitos, é a chave para uma detecção mais rápida e com um menor consumo de recursos. Todavia, essa escolha não é uma tarefa trivial quando são considerados ambientes com atrasos computacionais desconhecidos e que podem mudar dinamicamente com o tempo. Assim, o presente artigo apresenta e avalia, usando métricas de Qualidade de Serviço (QoS) de detecção, uma estratégia, baseada em teoria de controle, para sintonia automática do período de monitoramento. Tal estratégia visa adequar o consumo de recursos do detectora quantidade de recursos disponíveis no ambiente, atendendo aos requisitos de QoS de detecção.


Ang, K., Chong, G., Li, Y., Ltd, Y., and Singapore, S. (2005). PID control system analysis, design, and technology. IEEE Trans. on Control Systems Technology, 13(4):559–576.

Bertier, M., Marin, O., and Sens, P. (2003). Performance analysis of hierarchical failure detector. In Proc. Of The International Conf. On DSN, pages 635–644, San-francisco,Usa. IEEE Society Press.

Chandra, T. D. and Toueg, S. (1996). Unreliable failure detectors for reliable distributed systems. Journal Of The ACM, 43(2):225–267.

Chen, W., Toueg, S., and Aguilera, M. K. (2002). On the quality of service of failure detectores. IEEE Trans. On Computer, 51(2):561–580.

de Sá, A. S. and Macêdo, R. J. A. (2005). An adaptive failure detection approach for real-time distributed control systems over shared ethernet. In Proc. of 18th Intern.Congress of Mechanical Engineering – Symp. Series in Mechatronics, volume 2, pages 43–50, Ouro Preto, Brazil. COBEM2005.

Falai, L. and Bondavalli, A. (2005). Experimental evaluation of the qos failure detectors on wide area network. In Proc. of the International Conf. On DSN, pages 624–633. IEEE Computer Society.

Felber, P. (1998). The Corba Object Group Service : A Service Approach to Object Groups in CORBA. PhD thesis, Département D’Informatique, École Polytechnique Fédérale De Lausanne.

Fengyuan, R. and Chuang, L. (2003). Speed up the responsiveness of active queue management system. IEICE Trans. on Comm. E86-B (2), pages 630–636.

Hellerstein, J. L., Diao, Y., Parekh, S., and Tilbury, D. M. (2004). Feedback Control of Computing Systems. Wiley-Interscience, Canada.

Henriksson, D. and Cervin, A. (2003). Truetime 1.13 - reference manual. Technical Report Isrn Lutfd2/Tfrt--7605--se, Dep. Of Automatic Control, Lund Institute Of Technology.

Hermant, J. and Lann, L. (2002). Fast asynchronous uniform consensus in real-time distributed systems. IEEETC: IEEE Trans. on Computers, 51.

Huebscher, M. C. and McCann, J. A. (2008). A survey of autonomic computing—degrees, models, and applications. ACM Comput. Surv., 40(3):1–28.

Jacobson, V. (1988). Congestion avoidance and control. ACM Computer Communication Review; Proc. Of The Sigcomm ’88 Symp. In Stanford, Ca, August, 1988, 18, 4:314–329.

Macêdo, R. (2000). Failure detection in asynchronous distributed systems. In 2nd Workshop on Tests and Fault-Tolerance. pp.76-81.

Macêdo, R. J. A. and Lima, F. (2004). Improving the quality of service of failure detectors. Simpósio Brasileiro de Redes de Computadores.

Majumdar, R., Ramamritham, K., Banavar, R., and Moudgalya, K. (2004). Disseminating dynamic data with qos guarantee in a wide area network: a practical control theoretic approach. Real-Time and Embedded Technology and Applications Symp., 2004. Proc.. RTAS 2004. 10th IEEE, pages 510–517.

Mills, K., Rose, S., Quirolgico, S., Britton, M., and Tan, C. (2004). An autonomic failure-detection algorithm. In WOSP ’04: Proceedings of the 4th international workshop on Software and performance, pages 79–83, New York, NY, USA. ACM.

Müller, M. (2004). Performance evaluation of a failure detector using SNMP. Semester project, École Polytechnique Fédérale de Lausanne, Switzerland.

Nunes, R. C. and Jansch-Pôrto, I. (2004). Qos of timeout-based self-tuned failure detectors: the effects of the communication delay predictor and the safety margin. In International Conf. On DSN, pages 753–761. IEEE Computer Society.

Ogata, K. (1995). Discrete-Time Control Systems. Prentice-Hall, Upper Saddle River, NJ 07458, USA, 2nd edition.

Sampaio, L. M. R. and Brasileiro, F. V. (2004). Performance analysis of adaptive consensus protocols based on slowness oracles. In Proc. of the 24th IEEE Inter. Conf. on DCS Workshops, pages 340–346. IEEE Computer Society.

Simon, D. (2002). Analyzing control system robustness. Potentials, IEEE, 21(1):16–19.

The Mathworks (2002). Matlab:The Language of Technical Computing. Nantick, USA.

The Mathworks (2006). Getting Started with Simulink. Nantick, USA.

Xiong, N., Yang, Y., Chen, J., and He, Y. (2006). On the quality of service of failure detectors based on control theory. In Advanced Information Networking and Applications, 2006. AINA 2006. 20th International Conference on, volume 1, pages 6 pp.–.
SÁ, Alirio Santos de; MACÊDO, Raimundo José de Araújo. Uma Proposta de Detector de Defeitos Autonômico Usando Engenharia de Controle. In: WORKSHOP DE TESTES E TOLERÂNCIA A FALHAS (WTF), 10. , 2009, João Pessoa/PB. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2009 . p. 34-47. ISSN 2595-2684. DOI: