A Quantitative Comparison of PRAM based Emulated Shared Memory Architectures to Current Multicore CPUs and GPUs
2014 (English)In: 27th International Conference on Architecture of Computing Systems (ARCS), 2014, ARCS Workshops: Proc. PASA-2014 11th Workshop on Parallel Systems and Algorithms, Lübeck, Germany, Lübeck, Germany: VDE Verlag GmbH, 2014, 27-33 p.Conference paper (Refereed)
The performance of current multicore CPUs and GPUs is limited in computations making frequent use of communication/synchronization between the subtasks executed in parallel. This is because the directory-based cache systems scale weakly and/or the cost of synchronization is high. The Emulated Shared Memory (ESM) architectures relying on multithreading and efficient synchronization mechanisms have been developed to solve these problems affecting both performance and programmability of current machines. In this paper, we compare preliminarily the performance of three hardware implemented ESM architectures with state-of-the-art multicore CPUs and GPUs. The benchmarks are selected to cover different patterns of parallel computation and therefore reveal the performance potential of ESM architectures with respect to current multicores.
Place, publisher, year, edition, pages
Lübeck, Germany: VDE Verlag GmbH, 2014. 27-33 p.
, PARS-Mitteilungen, ISSN 0177-0454 ; 31
Parallel computing, performance analysis, GPU, chip multiprocessor, shared memory
IdentifiersURN: urn:nbn:se:liu:diva-114341ISBN: 978-3-8007-3579-2OAI: oai:DiVA.org:liu-114341DiVA: diva2:789421
27th International Conference on Architecture of Computing Systems (ARCS) 2014, PASA-2014 11th Workshop on Parallel Systems and Algorithms, Lübeck, Germany, Feb. 2014
FunderSwedish e‐Science Research Center, OpCoReS