Analyzing and improving clustering based sampling for microprocessor simulation

  • Yue Luo University of Texas
  • A. Joshi University of Texas
  • A. Phansalkar University of Texas
  • L. John University of Texas
  • J. Ghosh University of Texas

Resumo


We propose a set of statistical metrics for making a comprehensive, fair, and insightful evaluation of features, clustering algorithms, and distance measures in representative sampling techniques for microprocessor simulation. Our evaluation of clustering algorithms using these metrics shows that CLARANS clustering algorithm produces better quality clusters in the feature space and more homogeneous phases for CPI compared to the popular k-means algorithm. We also propose a new micro-architecture independent data locality based feature, reuse distance distribution (RDD), for finding phases in programs, and show that the RDD feature consistently results in more homogeneous phases than basic block vector (BBV) for many SPEC CPU2000 benchmark programs.
Palavras-chave: Sampling methods, Microprocessors, Analytical models, Clustering algorithms, Computational modeling, Computer simulation, Microarchitecture, Computer architecture, Phase measurement, Data mining
Publicado
24/10/2005
LUO, Yue; JOSHI, A.; PHANSALKAR, A.; JOHN, L.; GHOSH, J.. Analyzing and improving clustering based sampling for microprocessor simulation. In: INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD), 17. , 2005, Rio de Janeiro/RJ. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2005 . p. 193-200.