Hiding Communication Delays in Clustered Microarchitectures
Resumo
Clustered micro architectures represent a viable solution for addressing wire delays in communication-bound architectures by partitioning monolithic data path structures into smaller components. While supporting high frequencies, clustered processors usually degrade the instruction throughput due to the inter-cluster communication delays and non-balanced workload distribution. In this paper, we propose and evaluate novel instruction steering policies to reduce or eliminate cross-cluster communication delays while respecting workload balance. Our first technique hides the inter-cluster communication latencies by examining operand readiness information. The proposed policy steers instructions with two register sources to the cluster predicted to generate the last-produced operand. While the later-produced operand is being generated, the transport of the early-produced operand can occur in parallel, hiding the communication delay. Our second technique steers an entire group of instructions co-renamed in a cycle to the same cluster if the number of intra-group register dependencies exceed a threshold. This is done in a round-robin fashion in order to reduce impact on workload balancing.
Palavras-chave:
Microarchitecture, Frequency, Degradation, Wires, Clocks, Throughput, Added delay, Registers, Buffer storage, Computer architecture, clustered microarchittures
Publicado
29/10/2008
Como Citar
LADUCA, Robert J.; SHARKEY, Joseph; PONOMAREV, Dmitry V..
Hiding Communication Delays in Clustered Microarchitectures. In: INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD), 20. , 2008, Campo Grande/MS.
Anais [...].
Porto Alegre: Sociedade Brasileira de Computação,
2008
.
p. 107-114.
