Institute for Communication Technologies and Embedded Systems

Efficient Inter-Kernel Communication for OpenCL Database Operators on FPGAs

Authors:
Drewes, T. ,  Joseph, J. M. ,  Gurumurthy, B. ,  Broneske, D. ,  Saake, G. ,  Pionteck, T.
Book Title:
International Conference on Field-Programmable Technology (FPT)
Pages:
p.p. 266-269
Date:
2018
DOI:
10.1109/FPT.2018.00050
Language:
English
Abstract:
Many modern database engines use OpenCL to target heterogeneous hardware. Queries are evaluated by execution of chains of low-level operators. The common paradigm for OpenCL workloads facilitates communication between kernels using buffers in off-chip memory. This poses a severe performance limitation due to weak memory systems of FPGAs in contrast to the memory hierarchy available in CPUs and GPUs. To overcome this bottleneck, we propose the use of structural optimizations of kernel code. On-chip pipelining and code fusion are analyzed as alternatives to buffer-based inter-kernel communication. We assess the impact on resource utilization and system throughput and thereby demonstrate that properly structured code achieves a speedup of more than 4x over the default paradigm. This shows that it is essential for chains of kernels to consider not only optimization techniques for individual kernels, but also optimization of inter-kernel communication.
Download:
BibTeX