Solving global shallow water equations on heterogeneous supercomputers
Fig 5
Work flow of the hybrid partition.
Using n + 1 accelerators to process the inner part, and using CPU to process the interpolation, computing of halo and extra outer part. C2A and A2C refer to the data exchange between CPU and the accelerators. Part of the CPU time shown in the figure can be overlapped by the accelerator time.