+1
−0
doc/overlap_tiling.rst
0 → 100644
+158
−0
+10
−11
Loading
Implement overlap tiling to reduce MPI communication by performing m = GHOST_CELL_PADDING substeps per halo exchange while maintaining bitwise-identical results. Implementation: - Add overlap_tiling config parameter (default: 0) - Dynamic bounds: substep s computes rows [G+(s-1), rows-G-(s-1)] to avoid using stale halo data from boundaries - flux_y: internal ihi_flux_y = ihi + 1 for stencil requirements - Rename wet_dry_qy_halo → wet_dry_qy (called every iteration when size > 1 for correct wet/dry handling at MPI interfaces) Tested: All configurations produce bitwise-identical results vs baseline Documentation: - doc/overlap_tiling.rst: comprehensive feature documentation - Updated configuration reference and variable index