130 likes | 275 Views
E N D
2. 2
3. 3
4. 4 Modern GPU Architectures FireStream 9250
AMD RV770 Architecture
800 SIMD superscalar processors
Supports SSE-like vec4 operations
IEEE single/double precision
1 TFLOP peak single precision
200 GFLOPS peak double-precision
1 GB GDDR3 on-board memory
< 120 W max - 80 W typical
8-12 GFLOPS per Watt
MSRP $1,000
5. 5 Why the Interest in GPUs? N-Body Simulation
N particles subject to gravitational force - canonical O(N2) algorithm
As much as 123x speedup - 164 GFLOPS sustained (incl. div and sqrt)?
GPU Floating-point engines are powerful, trick is to keep the pipeline full
6. 6
7. 7
8. 8
9. 9
10. 10
11. 11
12. 12
13. 13
14. 14