1 / 10

Climate Simulation on ApGrid/TeraGrid at SC2003

Climate Simulation on ApGrid/TeraGrid at SC2003. Severs NCSA Cluster (225 CPU). Ninf-G. Client (AIST). Severs AIST Cluster (50 CPU) Titech Cluster (200 CPU) KISTI Cluster (25 CPU). Example - Hybrid QM/MD Simulation -. QM Server. QM Server. QM/MD simulation over the Pacific at SC2004.

Download Presentation

Climate Simulation on ApGrid/TeraGrid at SC2003

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Climate Simulation on ApGrid/TeraGrid at SC2003 Severs NCSA Cluster (225 CPU) Ninf-G Client (AIST) Severs AIST Cluster (50 CPU) Titech Cluster (200 CPU) KISTI Cluster (25 CPU)

  2. Example- Hybrid QM/MD Simulation -

  3. QM Server QM Server QM/MD simulation over the Pacific at SC2004 P32 (512 CPU) TCS (512 CPU) @ PSC Total number of CPUs: 1792 P32 (512 CPU) Ninf-G MD Client F32 (256 CPU) corrosion of Sillicon under stress Close-up view

  4. Total number of CPUs: 1793 • Total Simulation Time: 10 hour 20 min • # steps: 10 (= 7fs) • Average time / step: 1 hour • Size of generated files / step: 4.5GB 1 2 3 4 5 6 7 8 9 10 1 2 3 4 5 6 7 8 9 10

  5. (some of) Lessons Learned • Practically impossible to occupy a large-scale single system for few weeks. • How can we long-run the simulation? • Faults (e.g. HDD crush, network down) cannot be avoided. • We don’t prefer manual restart. The simulation should be capable of automatic recovery from faults. • How can the simulation recover from faults?

  6. Objectives • Develop flexible, robust, and efficient Grid-enabled simulation. • Flexible -- allow dynamic resource allocation/migration, • robust -- detect errors and recover from faults automatically for long runs, and • efficient -- manage thousands of CPUs. • Verify our strategy through large-scale experiments. • Implemented Grid-enabled SIMOX (Separation by Implanted Oxygen) simulation • Run the simulation on Japan-US Grid testbed for few weeks.

  7. QM simulation based on DFT MD Simulation Hybrid QM/CL Simulation (1) • Enabling large scale simulation with quantum accuracy • Combining classical MD Simulation with quantum simulation • CL simulation • Simulating the behavior of atoms in the entire region • Based on the classical MD using an empirical inter-atomic potential • QM simulation • Modifying energy calculated by MD simulation only in the interesting regions • Based on the density functional theory (DFT)

  8. Hybrid QM/CL Simulation (2) • simulation algorithm • Each QM computation is • independent with each other • compute intensive • usually implemented as a MPI program initial set-up Calculate MD forces of QM+MD regions Data of QM atoms Calculate QM force of the QM region Calculate QM force of the QM region Calculate QM force of the QM region Calculate MD forces of QM region MD part QM part QM forces Update atomic positions and velocities

  9. Implementation of Grid-enabled Simulation- multi-scale QM/MD simulation using GridRPC and MPI -

  10. Client QM QM MD MD QM QM MPI GridRPC MD MD QM QM MPI QM QM MPI Server Approach to “gridify” applications The new programming approach, combining GridRPC with MPI, takes advantages of both programming models complementarily to run large-scale applications on the Grid for a long time. Grid RPC enhances the flexibility and robustness by; • dynamic allocation of server programs, and • detection of network/cluster trouble. MPI enhances the efficiency by; • highly parallel computing on a cluster for both client and server programs.

More Related