liu.seSök publikationer i DiVA
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Benchmarking OpenCL, OpenACC, OpenMP, and CUDA: Programming Productivity, Performance, and Energy Consumption
Linnaeus University, Växjö.
Linköpings universitet, Institutionen för datavetenskap, Programvara och system. Linköpings universitet, Tekniska fakulteten. (PELAB)ORCID-id: 0000-0001-8976-0484
Linnaeus University, Växjö.
Cracow University of Technology, Poland.
Visa övriga samt affilieringar
2017 (Engelska)Ingår i: Proceedings of the 2017 Workshop on Adaptive Resource Management and Scheduling for Cloud Computing, Association for Computing Machinery (ACM), 2017, s. 1-6Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

Many modern parallel computing systems are heterogeneous at their node level. Such nodes may comprise general purpose CPUs and accelerators (such as, GPU, or Intel Xeon Phi) that provide high performance with suitable energy-consumption characteristics. However, exploiting the available performance of heterogeneous architectures may be challenging. There are various parallel programming frameworks (such as, OpenMP, OpenCL, OpenACC, CUDA) and selecting the one that is suitable for a target context is not straightforward. In this paper, we study empirically the characteristics of OpenMP, OpenACC, OpenCL, and CUDA with respect to programming productivity, performance, and energy. To evaluate the programming productivity we use our homegrown tool CodeStat, which enables us to determine the percentage of code lines required to parallelize the code using a specific framework. We use our tools MeterPU and x-MeterPU to evaluate the energy consumption and the performance. Experiments are conducted using the industry-standard SPEC benchmark suite and the Rodinia benchmark suite for accelerated computing on heterogeneous systems that combine Intel Xeon E5 Processors with a GPU accelerator or an Intel Xeon Phi co-processor.

Ort, förlag, år, upplaga, sidor
Association for Computing Machinery (ACM), 2017. s. 1-6
Nyckelord [en]
heterogeneous computing, parallel computing, parallel programming models, comparative study, OpenCL, OpenACC, OpenMP, CUDA, Programming productivity, Performance, Energy consumption, GPU, Xeon-Phi, MeterPU
Nationell ämneskategori
Datavetenskap (datalogi)
Identifikatorer
URN: urn:nbn:se:liu:diva-168604DOI: 10.1145/3110355.3110356ISBN: 9781450351164 (tryckt)OAI: oai:DiVA.org:liu-168604DiVA, id: diva2:1461471
Konferens
2017 Workshop on Adaptive Resource Management and Scheduling for Cloud Computing (ARMS-CC'17), Washington, DC, USA
Tillgänglig från: 2020-08-26 Skapad: 2020-08-26 Senast uppdaterad: 2020-08-27

Open Access i DiVA

Fulltext saknas i DiVA

Övriga länkar

Förlagets fulltext

Person

Kessler, Christoph

Sök vidare i DiVA

Av författaren/redaktören
Memeti, SuejbLi, LuKessler, Christoph
Av organisationen
Programvara och systemTekniska fakulteten
Datavetenskap (datalogi)

Sök vidare utanför DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetricpoäng

doi
isbn
urn-nbn
Totalt: 284 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf