liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
OpenCL for programming shared memory multicore CPUs
Linköping University, The Institute of Technology.
Linköping University, Department of Computer and Information Science, Software and Systems. Linköping University, The Institute of Technology. (PELAB)
Linköping University, Department of Computer and Information Science, Software and Systems. Linköping University, The Institute of Technology. (PELAB)ORCID iD: 0000-0001-5241-0026
2012 (English)In: Proceedings of the 5th Workshop on MULTIPROG2012 / [ed] E. Ayguade, B. Gaster, L. Howes, P. Stenström, O. Unsal, HiPEAC Network of Excellence , 2012Conference paper, Published paper (Refereed)
Abstract [en]

Shared memory multicore processor technology is pervasive in mainstream computing. This new architecture challenges programmers to write code that scales over these many cores to exploit the full computational power of these machines. OpenMP and Intel Threading Building Blocks (TBB) are two of the popular frameworks used to program these architectures. Recently, OpenCL has been defined as a standard by Khronos group which focuses on programming a possibly heterogeneous set of processors with many cores such as CPU cores, GPUs, DSP processors. In this work, we evaluate the effectiveness of OpenCL for programming multicore CPUs in a comparative case study with OpenMP and Intel TBB for five benchmark applications: matrix multiply, LU decomposition,2D image convolution, Pi value approximation and image histogram generation. The evaluation includes the effect of compiler optimizations for different frameworks, OpenCL performance on different vendors’ platformsand the performance gap between CPU-specific and GPU-specific OpenCL algorithms for execution on a modern GPU. Furthermore, a brief usability evaluation of the three frameworks is also presented.

Place, publisher, year, edition, pages
HiPEAC Network of Excellence , 2012.
Keyword [en]
parallel programming, parallel computing, benchmarking, GPU computing, multicore processor, OpenCL, Threading Building Blocks (TBB), OpenMP
National Category
Computer Science
Identifiers
URN: urn:nbn:se:liu:diva-93951OAI: oai:DiVA.org:liu-93951DiVA: diva2:628242
Conference
Fifth Workshop on Programmability Issues for Heterogeneous Multicores (MULTIPROG-2012) at HiPEAC-2012, 23 January, Paris, France
Projects
EU FP7 PEPPHER (2010-2012), #248481, www.peppher.eu
Available from: 2013-06-13 Created: 2013-06-13 Last updated: 2017-05-02Bibliographically approved

Open Access in DiVA

fulltext(472 kB)58 downloads
File information
File name FULLTEXT01.pdfFile size 472 kBChecksum SHA-512
ad3db792d5f2649f40379af26a64df57c504f2846ff4fcaf456fb25b9d48d968ff500aafd74a8c1eb919f2a2631b4fe7e5469a020a6e37a1c420ce2725f04364
Type fulltextMimetype application/pdf

Authority records BETA

Ali, AkhtarDastgeer, UsmanKessler, Christoph

Search in DiVA

By author/editor
Ali, AkhtarDastgeer, UsmanKessler, Christoph
By organisation
The Institute of TechnologySoftware and Systems
Computer Science

Search outside of DiVA

GoogleGoogle Scholar
Total: 58 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 555 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf