Testing offloading with a simple zgemm loop.
Source code and results for the paper "Three Practical Task Schedulers for Easy Maximum Parallelism," David Rogers in Software Practice and Experience, 2021.
Workbench Analysis Sequence Processors (WASP) is a project for providing a standard to access analysis sequence input/data using robust and verbose lexical analysis and parse-tree generation.
VTK-m mirror which runs in OLCF CI
Vector addition codes written for multiple parallel programming models.
This project isolates an issue related to CUDA separable compilation that occurs in legacy CMake with CUDA as a TPL. The issue is fixed when using modern CMake with CUDA enabled as a language.
CMake-enabled testing for C++ and Fortran