Explore new solvers that require fewer FFTs
This could be either different time stepping schemes that can take larger time steps or an improved elasticity solver.
This is motivated by the observation that non-FFT parts of the code perform better on the GPUs. If we can trade general floating point operations for FFTs, that's a win.