1 / 18

Weighted Matrix Reordering and Parallel Banded Preconditioners for Nonsymmetric Linear Systems

Murat Manguo ğ lu * , Mehmet Koyut ü rk ** , Ananth Grama * and Ahmed Sameh * * Purdue University ** Case Western Reserve University. Weighted Matrix Reordering and Parallel Banded Preconditioners for Nonsymmetric Linear Systems. Support: DARPA, NSF, Intel, NCSA.

moriah
Download Presentation

Weighted Matrix Reordering and Parallel Banded Preconditioners for Nonsymmetric Linear Systems

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Murat Manguoğlu*, Mehmet Koyutürk**, Ananth Grama* and Ahmed Sameh* *Purdue University **Case Western Reserve University Weighted Matrix Reordering and Parallel Banded Preconditioners for Nonsymmetric Linear Systems Support: DARPA, NSF, Intel, NCSA

  2. A computational loop Integration Newton Iteration Linear system solvers k k t

  3. Motivation • New architectures increasingly rely on parallelism • Concurrency and localization play an important role • Algorithms for such platforms must account for concurrency and memory references

  4. Implications: General Sparse Solvers • Maximal use of dense kernels • Development of methods that optimize concurrency • A banded matrix is a natural candidate as a preconditioner

  5. Preprocessing to Obtain the Preconditioner (BiCGStab/GMRES is used as the iterative solver) • ILUPACK : Multilevel ILU [Bollhöfer] • http://www.math.tu-berlin.de/ilupack/ • ILUT : Incomplete LU Factorization from Sparsekit [Saad] • http://www-users.cs.umn.edu/~saad/software/SPARSKIT/sparskit.html • ILUT-I : Improved ILUT [Benzi, et. al. ] • reorder using HSL-MC64 to maximize the product of the diagonals and scale the matrix • apply symmetric RCM reordering • get the incomplete factorization via ILUT

  6. WSO : Our proposed method • reorder using HSL-MC64 to make the diagonal zero free • reorder |A| + |AT | using HSL-MC73 to place larger elements closest to the main diagonal • extract a banded preconditioner, such that %99.9 percent of the weight is inside the band • factorize the banded preconditioner

  7. Test Problems

  8. Comparison to ILUPACK AMF/PQ preconditoners on an uniprocessor [of Sgi-Altix] Outer Iterative Solver: unrestarted GMRES ILUPACK Parameters : droptol : 1e-1 , bound for inv(L), inv(U) : 10 , elbow space : 100

  9. Comparison to ILUT and Improved-ILUT Preconditioners on an uniprocessor [of Clovertown] Outer Iterative Solver : BiCGStab

  10. WSO: Factorization+Solve time Scalability Speed improvement over uniprocessor timing on Sgi-Altix

  11. Reordering and Solve Times of 3 Different Systems on an Uniprocessor

  12. Reservoir Simulation(SPE10 benchmarks) • Problem #1 : N= 2,244,000 • Problem #2 : N= 2,462,265 • “banded systems” →Simple/no reordering to extract a central band as a preconditioner • Results on an SGI-Altix

  13. Reservoir Simulation #1 33 Algebraic Multigrid time: 31.4 seconds (AMD dual core) 21 11 5 2.5 1.4

  14. Reservoir Simulation #1

  15. Reservoir Simulation #1

  16. Reservoir Simulation #2 58 29 14 7 3 2 1

  17. Reservoir Simulation #2

  18. Summary and Future Work • Weighted reordering is an effective method for obtaining a banded preconditioner • Overall the method we propose is both reliable and scalable • Spectral reordering is relatively inexpensive for extracting banded preconditioners for solving several systems with “roughly the same” matrix of coefficients. • Parallel weighted reordering schemes needs to be developed

More Related