1 / 9

Claude Tadonki Mines ParisTech – CRI – Mathématiques et Systèmes

Claude Tadonki Mines ParisTech – CRI – Mathématiques et Systèmes Laboratoire de l’Accélérateur Linéaire/IN2P3/CNRS France claude.tadonki@u-psud.fr. 2nd Workshop on Architecture and Multi-Core Applications

cortez
Download Presentation

Claude Tadonki Mines ParisTech – CRI – Mathématiques et Systèmes

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Claude Tadonki Mines ParisTech – CRI – Mathématiques et Systèmes Laboratoire de l’Accélérateur Linéaire/IN2P3/CNRS France claude.tadonki@u-psud.fr 2nd Workshop on Architecture and Multi-Core Applications 23rd International Symposium on Computer Architecture and High Performance Computing (SBAC PAD 2011) October, 26 – 29 2010, Vitória, Espírito Santo, Brazil.

  2. Large Scale Kronecker Product on SupercomputersC. TADONKI The Kronecker product (définition and applications) 2nd Workshop on Architecture and Multi-Core Applications 23rd International Symposium on Computer Architecture and High Performance Computing (SBAC PAD 2011) October, 26 – 29 2010, Vitória, Espírito Santo, Brazil.

  3. Large Scale Kronecker Product on SupercomputersC. TADONKI The Kronecker product (properties and problem formulation) 2nd Workshop on Architecture and Multi-Core Applications 23rd International Symposium on Computer Architecture and High Performance Computing (SBAC PAD 2011) October, 26 – 29 2010, Vitória, Espírito Santo, Brazil.

  4. Large Scale Kronecker Product on SupercomputersC. TADONKI The Kronecker (complexity and recurrenceequation) • Forming the matrix first would • require a huge amount of memory • yield lot of redundant multiplication, which in total would be Using the so-called normal factorization, we could derive an optimal scheme which reduces the number of floatting point multiplication to 2nd Workshop on Architecture and Multi-Core Applications 23rd International Symposium on Computer Architecture and High Performance Computing (SBAC PAD 2011) October, 26 – 29 2010, Vitória, Espírito Santo, Brazil.

  5. Large Scale Kronecker Product on SupercomputersC. TADONKI The Kronecker product and its applications 2nd Workshop on Architecture and Multi-Core Applications 23rd International Symposium on Computer Architecture and High Performance Computing (SBAC PAD 2011) October, 26 – 29 2010, Vitória, Espírito Santo, Brazil.

  6. Large Scale Kronecker Product on SupercomputersC. TADONKI Performance issues and heuristic for finding a good topology • The total (parallel) execution time depends on • the sizes of the matrices • the gap between virtual topology and physical topology • the way the task is splitted among the processors (decomposition) 2nd Workshop on Architecture and Multi-Core Applications 23rd International Symposium on Computer Architecture and High Performance Computing (SBAC PAD 2011) October, 26 – 29 2010, Vitória, Espírito Santo, Brazil.

  7. Large Scale Kronecker Product on SupercomputersC. TADONKI Performances Weconsider N = 6 matrices of orders30, 36, 32, 18, 24, 16, thus L = 159 252 480 • We see that • our heuristic yields a significant improvment compare to trivial decompositions • we start loosing the scalabily when the number of cores increases (com) • We the turn to hybrid implementation 2nd Workshop on Architecture and Multi-Core Applications 23rd International Symposium on Computer Architecture and High Performance Computing (SBAC PAD 2011) October, 26 – 29 2010, Vitória, Espírito Santo, Brazil.

  8. Large Scale Kronecker Product on SupercomputersC. TADONKI Performance of the hybridimplementation • We see that • the hybrid implementation is better for larger number of cores • for smaller number of cores, the SM implemntation exacerbates on cache misses • Need to investigate on the compromise and a better memory layout. 2nd Workshop on Architecture and Multi-Core Applications 23rd International Symposium on Computer Architecture and High Performance Computing (SBAC PAD 2011) October, 26 – 29 2010, Vitória, Espírito Santo, Brazil.

  9. Large Scale Kronecker Product on SupercomputersC. TADONKI END & QUESTIONS 2nd Workshop on Architecture and Multi-Core Applications 23rd International Symposium on Computer Architecture and High Performance Computing (SBAC PAD 2011) October, 26 – 29 2010, Vitória, Espírito Santo, Brazil.

More Related