Enable ROCm/HIP backend in code and build system
instead of MM_ENABLE_GPU, we need MM_ENABLE_CUDA and MM_ENABLE_ROCM (or _HIP). If we're keeping AccFFT, that should be used only under MM_ENABLE_CUDA. For CMake, this would affect BuildKokkos.cmake and BuildHeffte.cmake.