The objective here is to develop a benchmark for multi node multi-gpu runs for PyFR [1].
The benchmarks are based on the supplementary material of [2]. Minor changes to the files had to be introduced to keep up with the PyFR version. The target architecture is Power8 with 4 GPUs each. There are some specific LSF Data Mover commands due to the adoption of DMD at Daresbury.
WIP
WIP
References
-
B.C. Vermeire, F.D. Witherden, P.E. Vincent, On the utility of GPU accelerated high-order methods for unsteady flow simulations: A comparison with industry-standard tools, Journal of Computational Physics, Volume 334, 1 April 2017, Pages 497-521, ISSN 0021-9991, http://dx.doi.org/10.1016/j.jcp.2016.12.049.