liu.seSearch for publications in DiVA
Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Optimized Pipelined Parallel Merge Sort on the Cell BE
Dept. of Mathematics and Computer Science FernUniversität in Hagen, Germany.
Linköpings universitet, Tekniska högskolan. Linköpings universitet, Institutionen för datavetenskap, PELAB - Laboratoriet för programmeringsomgivningar.ORCID-id: 0000-0001-5241-0026
2008 (engelsk)Inngår i: 2nd Int. Workshop on Highly Parallel Processing on a Chip HPPC-2008,2008, Berlin: Springer , 2008Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

Chip multiprocessors designed for streaming applications such as Cell BE offer impressive peak performance but suffer from limited bandwidth to off-chip main memory. As the number of cores is expected to rise further, this bottleneck will become more critical in the coming years. Hence, memory-efficient algorithms are required. As a case study, we investigate parallel sorting on Cell BE as a problem of great importance and as a challenge where the ratio between computation and memory transfer is very low. Our previous work led to a parallel mergesort that reduces memory bandwidth requirements by pipelining between SPEs, but the allocation of SPEs was rather ad-hoc. In our present work, we investigate mappings of merger nodes to SPEs. The mappings are designed to provide optimal trade-offs between load balancing, buffer memory consumption, and communication load on the on-chip bus. We solve this multi-objective optimization problem by deriving an integer linear programming formulation and compute Pareto-optimal solutions for the mapping of merge trees with up to 127 merger nodes. For mapping larger trees, we give a fast divide-and-conquer based approximation algorithm. We evaluate the sorting algorithm resulting from our mappings by a discrete event simulation.

sted, utgiver, år, opplag, sider
Berlin: Springer , 2008.
Emneord [en]
Parallel computing, multicore processor, computer architecture, algorithm engineering, parallel sorting, integer linear programming
HSV kategori
Identifikatorer
URN: urn:nbn:se:liu:diva-43703Lokal ID: 74568OAI: oai:DiVA.org:liu-43703DiVA, id: diva2:264563
Tilgjengelig fra: 2009-10-10 Laget: 2009-10-10 Sist oppdatert: 2018-01-12

Open Access i DiVA

Fulltekst mangler i DiVA

Andre lenker

http://www.ida.liu.se/~chrke/publ.html

Person

Kessler, Christoph

Søk i DiVA

Av forfatter/redaktør
Kessler, Christoph
Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric

urn-nbn
Totalt: 125 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf