Some Experiences on Parallel Finite Element Computations Using IBM/SP2

Some Experiences on Parallel Finite Element Computations Using IBM/SP2 Yuan-Sen Yang and Shang-Hsien Hsieh National Taiwan University Taipei, Taiwan, R.O.C.

Contents • Parallel Substructure Method • Three Issues : • Mesh Partitioning • Nodal Renumbering within Substructures • Solution of Interface DOFs • Conclusions

Parallel Substructure Method • Partition a structure into several substructures. • Assign each substructure to a processor. • Matrix assembly & static condensation within each substructure.

Parallel Substructure Method (cont.) • Solve the displacements of interface DOFs. • Solve the displacements of internal DOFs in each substructure. • Perform force recovering in each substructure.

Mesh Partitioning • Requirements • Automatic Partitioning • Handling regular & irregular meshes. • Balanced distribution of number of elements. • Minimization of number of interface nodes.

Experiences (Mesh Partitioning) • GR, RST, METIS are used in this work. • Balanced distribution of number of elements is achieved. • Condensational load are unbalanced. RST

Substructural Nodal Renumbering • Purpose: • To reduce the skyline of substructure matrix. • Constraint: • Interface nodes must be numbered after internal nodes • Reversed Cuthill-Mckee (RCM, Liu & Sherman 1975) is modified and used.

Experiences (Substructure Nodal Renumbering) 30STORY. RST. With 4 processors • Help to Reduce the condensational loads. • Rarely balance the condensational loads among processors. Without Substructure Nodal Renumbering RST With modified RCM Substructure Nodal Renumbering

Solution of Interface DOFs • Achieving high parallel efficiency for linear equation solver is not an easy task. • When NP increases NI increases Parallel Efficiency decreases

Experiences (Solution of Interface DOFs) • In this work, a sequential direct method(Cholesky decomposition) is used. • NI is affected by both NP and the performance of the partitioning algorithm.

Conclusions • Mesh partitioning • Computational loads of each processor is not necessarily proportional to its number of elements. • Minimization of interface nodes reduces the interface equations and usually improves the parallel efficiency. • Substructural nodal renumbering • Substructural nodal renumbering always reduces the condensational loads. • But rarely balance the condensational loads among procesors. • Parallel solution of interface DOFs • High-efficiency parallel solvers of interface equations are needed for improving the efficiency of parallel substructure method.

Acknowledgement • This research is supported by the National Science Council of R.O.C., under the project Nos. NSC 86-2211-E-002-029 and NSC 87-2211-E-002-034. • The parallel computations are performed on IBM/SP2 comupters of National Center for High-performance Computing, Hsin-Chu, Taiwan, R.O.C.

IBM/SP2 in NCHC • Model • IBM POWER2 SuperChip (P2SC) • Floating PeakPerformance • 480-MFLOPS • Memory • 128 Mbtyes per node

Some Experiences on Parallel Finite Element Computations Using IBM/SP2

Some Experiences on Parallel Finite Element Computations Using IBM/SP2

Presentation Transcript

Parallel Structure

IBEX 2002 - Session 602 Finite Element Analysis

Parallel LC Resonant Circuit

Parallel prefix adders

Discrete Element Method

2-D FEM Modeling of Woodrow Wilson Bridge

COMSOL Multiphysics

Geometric Computations on GPU: Proximity Queries

Finite Element Method FEM

Finite Automata

Parallel Algorithms

Most Important Element in life

Coordination chemistry approaches to non-metal element-element bond formation

Parallel Graph Algorithms

Scalable Parallel Architectures and their Software

CD5560 FABER Formal Languages, Automata and Models of Computation Lecture 3 Mälardalen University

Local Computations in Large-Scale Networks

Module 3 SIMD array Processors Synchronous array of parallel processors is an array processor

Acyclic List Edge Coloring of Graphs

Chapter 3 Finite Element Method for Plane Stress and Plane Strain Problems (plane problems)