Loading
[OpenMP] Move the recording code to account for KernelLaunchEnvironment
We need to record late to account for the kernel launch environment as well as the potential changes in block and thread count.
We need to record late to account for the kernel launch environment as well as the potential changes in block and thread count.