Paper
22 May 2015 Adaptive OpenCL libraries for platform portability
Paul A. Fox, Allyssa L. Batten, Marcus Hayes, Eric J. Kelmelis
Author Affiliations +
Abstract
The OpenCL API provides an abstract mechanism for massively parallel programming on a very wide range of hardware, including traditional CPUs, GPUs, accelerator devices, FPGAs, and more. However, these different hardware architectures and platforms function quite differently. Therefore, coding OpenCL applications that are usefully portable is challenging. Certain considerations are therefore required in developing an effectively portable OpenCL library to enable parallel application development without requiring fully separate code paths for each target platform. By making use of device detection and characterization provided by the OpenCL API, valuable information can be obtained to make runtime decisions for optimization. In particular, the effects of memory affinity change depending on the memory organization of the device architecture. Work partitioning and assignment depend on the device execution model, in particular the types of parallel execution supported and available synchronization primitives. These considerations, in turn, affect the selection and invocation of kernel code. For certain devices, platform-specific libraries are available, while others can benefit from generated kernel code based on the specified device parameters. By parameterizing an algorithm based on how these considerations affect performance, a combination of device parameters can be used to produce an execution strategy that will provide improved performance for that device or collection of devices.
© (2015) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Paul A. Fox, Allyssa L. Batten, Marcus Hayes, and Eric J. Kelmelis "Adaptive OpenCL libraries for platform portability", Proc. SPIE 9478, Modeling and Simulation for Defense Systems and Applications X, 947806 (22 May 2015); https://doi.org/10.1117/12.2177410
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Computer programming

Performance modeling

Detection and tracking algorithms

Field programmable gate arrays

Algorithm development

Computing systems

Databases

Back to Top