Date of Award
Master of Science (MS)
The fundamental operation of matrix multiplication is ubiquitous across a myriad of disciplines. Yet, the identification of new optimizations for matrix multiplication remains relevant for emerging hardware architectures and heterogeneous systems. Frameworks such as OpenCL enable computation orchestration on existing systems, and its availability using the Intel High Level Synthesis compiler allows users to architect new designs for reconfigurable hardware using C/C++. Using the HARPv2 as a vehicle for exploration, we investigate the utility of several of the most notable matrix multiplication optimizations to better understand the performance portability of OpenCL and the implications for such optimizations on this and future heterogeneous architectures. Our results give targeted insights into the applicability of best practices that were for existing architectures when used on emerging heterogeneous systems.
Professor Roger Chamberlain, Professor Christopher Gill, Professor Ning Zhang
Computer and Systems Architecture Commons, Hardware Systems Commons, Numerical Analysis and Scientific Computing Commons, Software Engineering Commons, Systems Architecture Commons, Theory and Algorithms Commons, VLSI and Circuits, Embedded and Hardware Systems Commons
Permanent URL: https://doi.org/10.7936/dbf4-6j92