+5
−0
+3
−1
+2
−0
+116
−99
+233
−212
Loading
Overlap tiling implementation with constant bounds The previous overlap tiling implementation used variable bounds that extended into neighboring ranks' domains. This approach was incorrect and violated the constraint that flux_y requires ilo >= GHOST_CELL_PADDING. Changes: - Use constant bounds [G, rows-G) for all substeps in both baseline and overlap tiling modes - Fix flux_y default ihi from (rows-G+1) to (rows-G) for consistency - Simplify halo exchange logic: exchange every step for baseline, every G steps for overlap tiling - Clean up validation warnings for GHOST_CELL_PADDING=1 With this fix: - GHOST_CELL_PADDING=1: Bit-for-bit identical results (m=1, no communication reduction) - GHOST_CELL_PADDING>=2: Communication reduction at the cost of small numerical differences at MPI boundaries due to stale ghost cells Single-process runs are always bit-for-bit identical regardless of GHOST_CELL_PADDING value.