C

cudaLaunchHostFunc with OpenMP Detach

Simple code to test OpenMP Detach functionality using cudaMemCpy2DAsync for asynchronous copies and cudaLaunchHostFunc to perform the callback which fulfills the OpenMP detached event.