Hardware and Software Support for NUMA Computing on Configurable Emulated Shared Memory Architectures
2013 (English)In: 2013 IEEE 27th International Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), IEEE conference proceedings, 2013, 640-647 p.Conference paper (Refereed)
The emulated shared memory (ESM) architectures are good candidates for future general purpose parallel computers due to their ability to provide easy-to-use explicitly parallel synchronous model of computation to programmers as well as avoid most performance bottlenecks present in current multicore architectures. In order to achieve full performance the applications must, however, have enough thread-level parallelism (TLP). To solve this problem, in our earlier work we have introduced a class of configurable emulated shared memory (CESM) machines that provides a special non-uniform memory access (NUMA) mode for situations where TLP is limited or for direct compatibility for legacy code sequential computing or NUMA mechanism. Unfortunately the earlier proposed CESM architecture does not integrate the different modes of the architecture well together e.g. by leaving the memories for different modes isolated and therefore the programming interface is non-integrated. In this paper we propose a number of hardware and software techniques to support NUMA computing in CESM architectures in a seamless way. The hardware techniques include three different NUMA-shared memory access mechanisms and the software ones provide a mechanism to integrate NUMA computation into the standard parallel random access machine (PRAM) operation of the CESM. The hardware techniques are evaluated on our REPLICA CESM architecture and compared to an ideal CESM machine making use of the proposed software techniques.
Place, publisher, year, edition, pages
IEEE conference proceedings, 2013. 640-647 p.
Parallel computing, Multicore architecture, Parallel programming model, PRAM, GPU, Benchmarking, Performance analysis
Natural Sciences Computer Science
IdentifiersURN: urn:nbn:se:liu:diva-102597DOI: 10.1109/IPDPSW.2013.146ISBN: 978-0-7695-4979-8OAI: oai:DiVA.org:liu-102597DiVA: diva2:679603
15th Workshop on Advances on Parallel and Distributed Computational Models (APDCM 2013), in conjunction with 2013 IEEE 27th International Parallel and Distributed Processing Symposium, 20-24 May 2013, Boston, Massachusetts USA