Using Machine Learning Techniques to Classify the Interference of HPC applications in Virtual Machines with Uncertain Data
ResumoThis work aims to predict the level of interference caused by concurrent access to shared resources, such as cache and main memory, that can drastically affect the performance of HPC applications executed in clouds, by using some well-known machine learning techniques. As the user does not know the exact number of resource accesses in practice, we propose a human-readable categorization of these accesses. The used dataset contains information about synthetic and real HPC applications, and, to reﬂect the uncertainty of the user categorization, we inserted some noisy data in it. Our results showed that our approach could correctly predict the level of interference in most cases, indicating that it can be a practical solution.
Alves, M. M. and de Assumpção Drummond, L. M. (2017). A multivariate and quantita- tive model for predicting cross-application interference in virtual environments. Jour- nal of Systems and Software, 128:150 – 163.
Breiman, L. (2001). Random forests. Machine learning, 45(1):5–32.
Cover, T. and Hart, P. (1967). Nearest neighbor pattern classication. IEEE transactions on information theory, 13(1):21–27.
Cristianini, N. and Shawe-Taylor, J. (2000). An Introduction to Support Vector Machines and Other Kernel-based Learning Methods. Cambridge University Press.
Goodfellow, I., Bengio, Y., and Courville, A. (2016). Deep Learning. MIT Press. http://www.deeplearningbook.org.
Japkowicz, N. and Shah, M. (2011). Evaluating learning algorithms: a classication perspective. Cambridge University Press.
Likert, R. (1932). A technique for the measurement of attitudes. Archives of Psychology, 22 140:55.
Ludwig, U. L., Xavier, M. G., Kirchoff, D. F., Cezar, I. B., and De Rose, C. A. F. (2019). Optimizing multi-tier application performance with interference and afnity- aware placement algorithms. Concurrency and Computation: Practice and Experi- ence, 31(18):e5098. e5098 cpe.5098.
Melo Alves, M., da Cruz Pestana, R., Alves Prado da Silva, R., and Drummond, L. M. (2017). Accelerating pre-stack kirchhoff time migration by manual vectorization. Con- currency and Computation: Practice and Experience, 29(22):e3935.
Meyer, V., Kirchoff, D. F., da Silva, M. L., and César A. F., D. R. (2020). An interference- aware application classier based on machine learning to improve scheduling in In 2020 28th Euromicro International Conference on Parallel, Distributed clouds. and Network-Based Processing (PDP), pages 80–87.
Otto, C. and Kempka, T. (2017). Prediction of steam jacket dynamics and water balances in underground coal gasication. Energies, 10(6):739.
P. Domingos and Pazzani, M. (1997). On the Optimality of the Simple Bayesian Classier under Zero-One Loss. Machine Learning. Machine Learning, 29:103–130.
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., and Duchesnay, E. (2011). Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12:2825–2830.
Ren, S., He, L., Li, J., Chen, Z., Jiang, P., and Li, C. T. (2019). Contention-aware pre- diction for performance impact of task co-running in multicore computers. Wireless Networks, 7.
Zacarias, F. V., Petrucci, V., Nishtala, R., Carpenter, P., and Mossé, D. (2019). Intelli- gent colocation of workloads for enhanced server efciency. In 2019 31st Internatio- nal Symposium on Computer Architecture and High Performance Computing (SBAC- PAD), pages 120–127.