Treffer: LMARSpy: A GPU‐Ready Nonhydrostatic Dynamical Core With a Sharpness‐Preserving Monotonicity Limiter and a Conservative Vertical Implicit Solver.

Title:
LMARSpy: A GPU‐Ready Nonhydrostatic Dynamical Core With a Sharpness‐Preserving Monotonicity Limiter and a Conservative Vertical Implicit Solver.
Authors:
Zhang, Weikang1,2 (AUTHOR), Chen, Xi1 (AUTHOR) chenxi@lasg.iap.ac.cn
Source:
Journal of Advances in Modeling Earth Systems. Oct2025, Vol. 17 Issue 10, p1-28. 28p.
Database:
GreenFILE

Weitere Informationen

Global numerical modeling is advancing into the era of kilometer‐scale, non‐hydrostatic simulations, while heterogeneous computing emerges as a pivotal trend in high‐performance computing (HPC). As a strong candidate for next‐generation global kilometer‐scale general circulation models, the A‐grid dynamical core based on the Low Mach number Approximate Riemann Solver must address key challenges: suppresses unphysical oscillations while preserving sharp gradients, mitigating Courant‐Friedrichs‐Lewy (CFL) constraints caused by vertically propagating sound waves, and making full use of heterogeneous computing power. This work presents LMARSpy, a GPU‐optimized non‐hydrostatic dry dynamical core with a sharpness‐preserving monotonicity limiter and a conservative vertical implicit solver. Designed for GPU efficiency, LMARSpy leverages a Python‐based HPC framework to ensure robust compatibility across heterogeneous computing environments. Benchmark tests validate the model's innovations: the limiter effectively suppresses unphysical oscillations while preserving sharpness, incurring only a 10.4% increase in GPU computational cost, as it is applied exclusively in the final sub‐step of the RK3 time‐stepping scheme; the Horizontally Explicit and Vertically Implicit scheme (employing a mass and potential temperature conservative vertical implicit solver) effectively mitigates vertical CFL limitations, delivering at least an order‐of‐magnitude efficiency improvement over explicit schemes when horizontal grid spacing substantially exceeds vertical spacing; and the Python‐based HPC framework enables seamless operation on both CPU and GPU architectures. When the computational scale is sufficient to fully utilize GPU memory, a single Nvidia RTX 3060 Ti GPU delivers 7.6 times the computational speed of a fully‐loaded 16‐thread Intel Core i5‐13490F CPU. Plain Language Summary: As climate change intensifies, extreme weather events are becoming increasingly frequent and severe, necessitating high‐resolution numerical simulations to accurately capture sharp atmospheric changes and complex physical processes. A key challenge lies in resolving these sharp changes while ensuring numerical solution stability and preventing computational errors. Additionally, differences between vertical and horizontal grid resolutions impose strict numerical stability constraints, thereby reducing computational efficiency. Traditional CPU‐based weather models also struggle to adapt to the development of modern heterogeneous computing systems. To address these issues, we developed LMARSpy, a GPU‐optimized weather model featuring numerical limiters that preserve sharp gradients, vertical schemes that relax stability constraints, and a Python‐based high‐performance computing framework. LMARSpy can seamlessly integrate AI surrogate models, marking an important step toward efficient and accurate weather forecasting. Key Points: The sharpness‐preserving monotonicity limiter suppresses unphysical oscillations while maintaining solution accuracyThe conservative vertical implicit solver removes vertical Courant‐Friedrichs‐Lewy restrictions, enabling competitive computational accelerationThe Python‐based high‐performance computing framework supporting GPU executions demonstrates desirable scalability [ABSTRACT FROM AUTHOR]

Copyright of Journal of Advances in Modeling Earth Systems is the property of Wiley-Blackwell and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)