An Anomaly Detection Approach for Scale-Out Storage Systems
Resumo
Scale-out storage systems (SoSS) have become increasingly important for meeting availability requirements of web services in cloud platforms. To enhance data availability, SoSS rely on a variety of built-in fault-tolerant mechanisms, including replication, redundant network topologies, advanced request scheduling, and other failover techniques. However, performance issues in cloud services still remain one of the main causes of discontentment among their tenants. In this paper, we propose an anomaly detection approach for SoSS that predicts cloud anomalies caused by memory and network faults. To evaluate our prediction model, we built a testbed simulating a virtual data center using VMware. Experimental results confirm that the injected faults are likely to undermine the data availability in SoSS. They suggest that although unsupervised learning has been the most common method for anomaly detection, a supervisedbased implementation of the same model reduces the false positive rate by roughly 10%. Our analysis also points out that probing SoSS-specific monitoring data at the VM-level contributes to improve the anomaly prediction efficiency.
Palavras-chave:
Monitoring, Predictive models, Unsupervised learning, Training, Throughput, Availability, Loss measurement
Publicado
22/10/2014
Como Citar
SILVESTRE, Guthemberg; SAUVANAUD, Carla; KAÂNICHE, Mohamed; KANOUN, Karama.
An Anomaly Detection Approach for Scale-Out Storage Systems. In: INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD), 26. , 2014, Paris/FR.
Anais [...].
Porto Alegre: Sociedade Brasileira de Computação,
2014
.
p. 294-301.
