liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Lazy Allocation and Transfer Fusion Optimization for GPU-based Heterogeneous Systems
Linköping University, Department of Computer and Information Science, Software and Systems. Linköping University, Faculty of Science & Engineering.ORCID iD: 0000-0001-8976-0484
Linköping University, Department of Computer and Information Science, Software and Systems. Linköping University, Faculty of Science & Engineering.ORCID iD: 0000-0001-5241-0026
2018 (English)In: 2018 26TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2018), IEEE , 2018, p. 311-315Conference paper, Published paper (Refereed)
Abstract [en]

We present two memory optimization techniques which improve the efficiency of data transfer over PCIe bus for GPU-based heterogeneous systems, namely lazy allocation and transfer fusion optimization. Both are based on merging data transfers so that less overhead is incurred, thereby increasing transfer throughput and making accelerator usage profitable also for smaller operand sizes. We provide the design and prototype implementation of the two techniques in CUDA. Microbench-marking results show that especially for smaller and medium-sized operands significant speedups can be achieved. We also prove that our transfer fusion optimization algorithm is optimal.

Place, publisher, year, edition, pages
IEEE , 2018. p. 311-315
Series
Euromicro Conference on Parallel Distributed and Network-Based Processing, ISSN 1066-6192
Keywords [en]
adaptive message fusion; GPU; CUDA; lazy memory allocation; memory transfer optimization
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:liu:diva-151524DOI: 10.1109/PDP2018.2018.00054ISI: 000443807600045ISBN: 978-1-5386-4975-6 (print)OAI: oai:DiVA.org:liu-151524DiVA, id: diva2:1250597
Conference
26th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP)
Note

Funding Agencies|SeRC; NSC / SNIC [SNIC 2016/5-6]

Available from: 2018-09-24 Created: 2018-09-24 Last updated: 2018-10-19

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full text

Authority records BETA

Li, Lu

Search in DiVA

By author/editor
Li, LuKessler, Christoph
By organisation
Software and SystemsFaculty of Science & Engineering
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 19 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf