Sie verwenden einen Browser, in dem JavaScript deaktiviert ist. Dadurch wird verhindert, dass Sie die volle Funktionalität dieser Webseite nutzen können. Zur Navigation müssen Sie daher die Sitemap nutzen.

You are currently using a browser with deactivated JavaScript. There you can't use all the features of this website. In order to navigate the site, please use the Sitemap .

Co-exploration of NLA kernels and specification of Compute Elements in distributed memory CGRAs

Authors:: Mahadurkar, M. , Merchant, F. , Maity, A. , Vatwani, K. , Munje, I. , Gopalan, N. , Nandy, S. K. , Narayan, R.
Book Title:: SAMOS 2014
Pages:: p.p. 225--232
Date:: 2014
DOI:: 10.1109/SAMOS.2014.6893215
Language:: English
Abstract:: Coarse Grained Reconfigurable Architectures (CGRA) are emerging as embedded application processing units in computing platforms for Exascale computing. Such CGRAs are distributed memory multi-core compute elements on a chip that communicate over a Network-on-chip (NoC). Numerical Linear Algebra (NLA) kernels are key to several high performance computing applications. In this paper we propose a systematic methodology to obtain the specification of Compute Elements (CE) for such CGRAs. We analyze block Matrix Multiplication and block LU Decomposition algorithms in the context of a CGRA, and obtain theoretical bounds on communication requirements, and memory sizes for a CE. Support for high performance custom computations common to NLA kernels are met through custom function units (CFUs) in the CEs. We present results to justify the merits of such CFUs.
Download:: BibTeX