liu.seSearch for publications in DiVA
Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
OpenCL for programming shared memory multicore CPUs
Linköpings universitet, Tekniska högskolan.
Linköpings universitet, Institutionen för datavetenskap, Programvara och system. Linköpings universitet, Tekniska högskolan. (PELAB)
Linköpings universitet, Institutionen för datavetenskap, Programvara och system. Linköpings universitet, Tekniska högskolan. (PELAB)ORCID-id: 0000-0001-5241-0026
2012 (engelsk)Inngår i: Proceedings of the 5th Workshop on MULTIPROG2012 / [ed] E. Ayguade, B. Gaster, L. Howes, P. Stenström, O. Unsal, HiPEAC Network of Excellence , 2012Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

Shared memory multicore processor technology is pervasive in mainstream computing. This new architecture challenges programmers to write code that scales over these many cores to exploit the full computational power of these machines. OpenMP and Intel Threading Building Blocks (TBB) are two of the popular frameworks used to program these architectures. Recently, OpenCL has been defined as a standard by Khronos group which focuses on programming a possibly heterogeneous set of processors with many cores such as CPU cores, GPUs, DSP processors. In this work, we evaluate the effectiveness of OpenCL for programming multicore CPUs in a comparative case study with OpenMP and Intel TBB for five benchmark applications: matrix multiply, LU decomposition,2D image convolution, Pi value approximation and image histogram generation. The evaluation includes the effect of compiler optimizations for different frameworks, OpenCL performance on different vendors’ platformsand the performance gap between CPU-specific and GPU-specific OpenCL algorithms for execution on a modern GPU. Furthermore, a brief usability evaluation of the three frameworks is also presented.

sted, utgiver, år, opplag, sider
HiPEAC Network of Excellence , 2012.
Emneord [en]
parallel programming, parallel computing, benchmarking, GPU computing, multicore processor, OpenCL, Threading Building Blocks (TBB), OpenMP
HSV kategori
Identifikatorer
URN: urn:nbn:se:liu:diva-93951OAI: oai:DiVA.org:liu-93951DiVA, id: diva2:628242
Konferanse
Fifth Workshop on Programmability Issues for Heterogeneous Multicores (MULTIPROG-2012) at HiPEAC-2012, 23 January, Paris, France
Prosjekter
EU FP7 PEPPHER (2010-2012), #248481, www.peppher.euTilgjengelig fra: 2013-06-13 Laget: 2013-06-13 Sist oppdatert: 2018-01-11bibliografisk kontrollert

Open Access i DiVA

fulltext(472 kB)244 nedlastinger
Filinformasjon
Fil FULLTEXT01.pdfFilstørrelse 472 kBChecksum SHA-512
ad3db792d5f2649f40379af26a64df57c504f2846ff4fcaf456fb25b9d48d968ff500aafd74a8c1eb919f2a2631b4fe7e5469a020a6e37a1c420ce2725f04364
Type fulltextMimetype application/pdf

Personposter BETA

Ali, AkhtarDastgeer, UsmanKessler, Christoph

Søk i DiVA

Av forfatter/redaktør
Ali, AkhtarDastgeer, UsmanKessler, Christoph
Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar
Totalt: 244 nedlastinger
Antall nedlastinger er summen av alle nedlastinger av alle fulltekster. Det kan for eksempel være tidligere versjoner som er ikke lenger tilgjengelige

urn-nbn

Altmetric

urn-nbn
Totalt: 736 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf