DTA-C: A Decoupled multi-Threaded Architecture for CMP Systems
Resumo
One way to exploit Thread Level Parallelism (TLP) is to use architectures that implement novel multithreaded execution models, like Scheduled Data- Flow (SDF). This latter model promises an elegant decoupled and non-blocking execution of threads. Here we extend that model in order to be used in future scalable CMP systems where wire delay imposes to partition the design. In this paper we describe our approach and experiment with different distributed schedulers, different number of clusters and processors per cluster to show good scalability of our architecture. We describe our approach and present initial results on system scalability and performance. We suggest design choices to improve the scalability of the basic design.
Palavras-chave:
Yarn, Scalability, Pipelines, Computer architecture, Processor scheduling, Wire, Logic, High performance computing, Data engineering, Parallel processing
Publicado
24/10/2007
Como Citar
GIORGI, Roberto; POPOVIC, Zdravko; PUZOVIC, Nikola.
DTA-C: A Decoupled multi-Threaded Architecture for CMP Systems. In: INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD), 19. , 2007, Gramado/RS.
Anais [...].
Porto Alegre: Sociedade Brasileira de Computação,
2007
.
p. 263-270.
