Batched 3D-Distributed FFT Kernels Towards Practical DNS Codes

التفاصيل البيبلوغرافية
العنوان: Batched 3D-Distributed FFT Kernels Towards Practical DNS Codes
المؤلفون: Toshiyuki Imamura, Masaaki Aoki, Mitsuo Yokokawa
بيانات النشر: IOS Press, 2020.
سنة النشر: 2020
الوصف: This work introduces a new idea of batched 3D-FFT with a survey of data decomposition methods and a review of the states-of-arts high performance parallel FFT libraries. Besides, it is argued that the particular usage of multiple FFTs has been associated with the batched execution. The batched 3D-FFT kernel, which is performed on the K computer, shows 45.9% speedup when N and P are 20483 and 128, respectively. The batched FFT allows the developer to take advantage of a flexible internal data layout and scheduling to improve the total performance.
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_________::a29379f3a2b76f70e61c1981360c1c3a
https://doi.org/10.3233/apc200038
حقوق: OPEN
رقم الأكسشن: edsair.doi...........a29379f3a2b76f70e61c1981360c1c3a
قاعدة البيانات: OpenAIRE