liu.seSearch for publications in DiVA
Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Lazy Allocation and Transfer Fusion Optimization for GPU-based Heterogeneous Systems
Linköpings universitet, Institutionen för datavetenskap, Programvara och system. Linköpings universitet, Tekniska fakulteten.ORCID-id: 0000-0001-8976-0484
Linköpings universitet, Institutionen för datavetenskap, Programvara och system. Linköpings universitet, Tekniska fakulteten.ORCID-id: 0000-0001-5241-0026
2018 (engelsk)Inngår i: 2018 26TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2018), IEEE , 2018, s. 311-315Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

We present two memory optimization techniques which improve the efficiency of data transfer over PCIe bus for GPU-based heterogeneous systems, namely lazy allocation and transfer fusion optimization. Both are based on merging data transfers so that less overhead is incurred, thereby increasing transfer throughput and making accelerator usage profitable also for smaller operand sizes. We provide the design and prototype implementation of the two techniques in CUDA. Microbench-marking results show that especially for smaller and medium-sized operands significant speedups can be achieved. We also prove that our transfer fusion optimization algorithm is optimal.

sted, utgiver, år, opplag, sider
IEEE , 2018. s. 311-315
Serie
Euromicro Conference on Parallel Distributed and Network-Based Processing, ISSN 1066-6192
Emneord [en]
adaptive message fusion; GPU; CUDA; lazy memory allocation; memory transfer optimization
HSV kategori
Identifikatorer
URN: urn:nbn:se:liu:diva-151524DOI: 10.1109/PDP2018.2018.00054ISI: 000443807600045ISBN: 978-1-5386-4975-6 (tryckt)OAI: oai:DiVA.org:liu-151524DiVA, id: diva2:1250597
Konferanse
26th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP)
Merknad

Funding Agencies|SeRC; NSC / SNIC [SNIC 2016/5-6]

Tilgjengelig fra: 2018-09-24 Laget: 2018-09-24 Sist oppdatert: 2018-10-19

Open Access i DiVA

Fulltekst mangler i DiVA

Andre lenker

Forlagets fulltekst

Person

Li, Lu

Søk i DiVA

Av forfatter/redaktør
Li, LuKessler, Christoph
Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric

doi
isbn
urn-nbn
Totalt: 71 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf