liu.seSök publikationer i DiVA
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
BlockLib: A Skeleton Library for Cell Broadband Engine
Linköpings universitet.
Linköpings universitet, Tekniska högskolan. Linköpings universitet, Institutionen för datavetenskap, PELAB - Laboratoriet för programmeringsomgivningar.
Linköpings universitet, Tekniska högskolan. Linköpings universitet, Institutionen för datavetenskap, PELAB - Laboratoriet för programmeringsomgivningar.ORCID-id: 0000-0001-5241-0026
2008 (Engelska)Ingår i: Proceedings - International Conference on Software Engineering, New York, USA: ACM , 2008, s. 7-14Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

Cell Broadband Engine is a heterogeneous multicore processor for high-performance computing and gaming. Its architecture allows for an impressive peak performance but, at the same time, makes it very hard to write efficient code. The need to simultaneously exploit SIMD instructions, coordinate parallel execution of the slave processors, overlap DMA memory traffic with computation, keep data properly aligned in memory, and explicitly manage the very small on-chip memory buffers of the slave processors, leads to very complex code. In this work, we adopt the skeleton programming approach to abstract from much of the complexity of Cell programming while maintaining high performance. The abstraction is achieved through a library of parallel generic building blocks, called BlockLib. Macro-based generative programming is used to reduce the overhead of genericity in skeleton functions and control code size expansion. We demonstrate the library usage with a parallel ODE solver application. Our experimental results show that BlockLib code achieves performance close to hand-written code and even outperforms the native IBM BLAS library in cases where several slave processors are used.

Ort, förlag, år, upplaga, sidor
New York, USA: ACM , 2008. s. 7-14
Nyckelord [en]
generic parallel programming, parallel computing, generative programming, software library, multicore processor, software components
Nationell ämneskategori
Datavetenskap (datalogi)
Identifikatorer
URN: urn:nbn:se:liu:diva-43692DOI: 10.1145/1370082.1370088Lokalt ID: 74556ISBN: 978-1-60558-031-9 (tryckt)OAI: oai:DiVA.org:liu-43692DiVA, id: diva2:264552
Konferens
30th International Conference on Software Engineering, ICSE 2008 Co-located Workshops - 1st International Workshop on Multicore Software Engineering, IWMSE
Tillgänglig från: 2009-10-10 Skapad: 2009-10-10 Senast uppdaterad: 2018-01-12

Open Access i DiVA

Fulltext saknas i DiVA

Övriga länkar

Förlagets fulltexthttp://portal.acm.org/citation.cfm?doid=1370082.1370088

Person

Eriksson, MattiasKessler, Christoph

Sök vidare i DiVA

Av författaren/redaktören
Eriksson, MattiasKessler, Christoph
Av organisationen
Tekniska högskolanPELAB - Laboratoriet för programmeringsomgivningar
Datavetenskap (datalogi)

Sök vidare utanför DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetricpoäng

doi
isbn
urn-nbn
Totalt: 361 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf