liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A Quantitative Comparison of PRAM based Emulated Shared Memory Architectures to Current Multicore CPUs and GPUs
Linköping University, Department of Computer and Information Science, Software and Systems. Linköping University, The Institute of Technology. (PELAB)
(PELAB)
Linköping University, Department of Computer and Information Science, Software and Systems. Linköping University, The Institute of Technology. (PELAB)ORCID iD: 0000-0001-5241-0026
VTT Technical Research Centre of Finland. (Platform Architectures Team)
2014 (English)In: 27th International Conference on Architecture of Computing Systems (ARCS), 2014, ARCS Workshops: Proc. PASA-2014 11th Workshop on Parallel Systems and Algorithms, Lübeck, Germany, Lübeck, Germany: VDE Verlag GmbH, 2014, 27-33 p.Conference paper, Published paper (Refereed)
Abstract [en]

The performance of current multicore CPUs and GPUs is limited in computations making frequent use of communication/synchronization between the subtasks executed in parallel. This is because the directory-based cache systems scale weakly and/or the cost of synchronization is high. The Emulated Shared Memory (ESM) architectures relying on multithreading and efficient synchronization mechanisms have been developed to solve these problems affecting both performance and programmability of current machines. In this paper, we compare preliminarily the performance of three hardware implemented ESM architectures with state-of-the-art multicore CPUs and GPUs. The benchmarks are selected to cover different patterns of parallel computation and therefore reveal the performance potential of ESM architectures with respect to current multicores.

Place, publisher, year, edition, pages
Lübeck, Germany: VDE Verlag GmbH, 2014. 27-33 p.
Series
PARS-Mitteilungen, ISSN 0177-0454 ; 31
Keyword [en]
Parallel computing, performance analysis, GPU, chip multiprocessor, shared memory
National Category
Computer Science
Identifiers
URN: urn:nbn:se:liu:diva-114341ISBN: 978-3-8007-3579-2 (print)OAI: oai:DiVA.org:liu-114341DiVA: diva2:789421
Conference
27th International Conference on Architecture of Computing Systems (ARCS) 2014, PASA-2014 11th Workshop on Parallel Systems and Algorithms, Lübeck, Germany, Feb. 2014
Projects
REPLICASeRC OpCoReS
Funder
Swedish e‐Science Research Center, OpCoReS
Available from: 2015-02-18 Created: 2015-02-18 Last updated: 2015-02-24

Open Access in DiVA

No full text

Other links

IEEE Xplore

Authority records BETA

Hansson, ErikKessler, Christoph

Search in DiVA

By author/editor
Hansson, ErikKessler, Christoph
By organisation
Software and SystemsThe Institute of Technology
Computer Science

Search outside of DiVA

GoogleGoogle Scholar

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 108 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf