Treffer: Multi-GPU acceleration of large-scale density-based topology optimization.

Title:
Multi-GPU acceleration of large-scale density-based topology optimization.
Authors:
Herrero-Pérez, David1,2 (AUTHOR) david.herrero@upct.es, Martínez Castejón, Pedro J.2 (AUTHOR) pedro.castejon@upct.es
Source:
Advances in Engineering Software (1992). Jul2021, Vol. 157, pN.PAG-N.PAG. 1p.
Database:
Academic Search Index

Weitere Informationen

• Multi-GPU accelerated topology optimization using aggregation-based AMG. • Low setup cost and device memory requirements using efficient interpolation. • Multiple GPUs accelerate computing and increase the device memory available. • Significant speedups in comparison with efficient multi-core computing. • The use of mixed-precision increases performance. This work presents a parallel implementation of density-based topology optimization using distributed GPU computing systems. The use of multiple GPU devices allows us accelerating the computing process and increasing the device memory available for GPU computing. This increment of device memory enables us to address large models that commonly do not fit into one GPU device. The most modern scientific computers incorporate these devices to design energy-efficient, low-cost, and high-computing power systems. However, we should adopt the proper techniques to take advantage of the computational resources of such high-performance many-core computing systems. It is well-known that the bottleneck of density-based topology optimization is the solving of the linear elasticity problem using Finite Element Analysis (FEA) during the topology optimization iterations. We solve the linear system of equations obtained from FEA using a distributed conjugate gradient solver preconditioned by a smooth aggregation-based algebraic multigrid (AMG) using GPU computing with multiple devices. The use of aggregation-based AMG reduces memory requirements and improves the efficiency of the interpolation operation. This fact is rewarding for GPU computing. We evaluate the performance and scalability of the distributed GPU system using structured and unstructured meshes. We also test the performance using different 3D finite elements and relaxing operators. Besides, we evaluate the use of numerical approaches to increase the topology optimization performance. Finally, we present a comparison between the many-core computing instance and one efficient multi-core implementation to highlight the advantages of using GPU computing in large-scale density-based topology optimization problems. [ABSTRACT FROM AUTHOR]