GPGPU-Accelerated Parallel and Fast Simulation of Thousand-core Platforms

Christian Pinto , Shivani Raghav, Andrea Marongiu, Martino Ruggiero , David Atienza , Luca Benini
IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing 2011 (CCGRID2011) Newport Beach, California, 23-26 May 2011.
The multicore revolution and the ever-increasing complexity of computing systems is dramatically changing sys- tem design, analysis and programming of computing platforms. Future architectures will feature hundreds to thousands of simple processors and on-chip memories connected through a network-on-chip. Architectural simulators will remain primary tools for design space exploration, software development and performance evaluation of these massively parallel architec- tures. However, architectural simulation performance is a serious concern, as virtual platforms and simulation technology are not able to tackle the complexity of thousands of core future scenarios. The main contribution of this paper is the development of a new simulation approach and technology for many core processors which exploit the enormous par- allel processing capability of low-cost and widely available General Purpose Graphic Processing Units (GPGPU). The simulation of many-core architectures exhibits indeed a high level of parallelism and is inherently parallelizable, but GPGPU acceleration of architectural simulation requires an in-depth revision of the data structures and functional partitioning traditionally used in parallel simulation. We demonstrate our GPGPU simulator on a target architecture composed by several cores (i.e. ARM ISA based), with instruction and data caches, connected through a Network-on-Chip (NoC). Our experiments confirm the feasibility of our approach.
DOI: 10.1109/CCGrid.2011.64
http://infoscience.epfl.ch/record/164471/files/ccgrid11.pdf