160 likes | 314 Views
Chapter 1. Performance & Technology Trends. Read Sections 1.5, 1.6, and 1.8. Section 1.5: The Power Wall. Power Trends. Clock rates hit a “power wall”. In CMOS IC technology. × 1000. × 30. 5V → 1V. The power wall. Performance was always improved by increasing frequency (up to 2004)
E N D
Chapter 1 Performance & Technology Trends Read Sections 1.5, 1.6, and 1.8
Section 1.5: The Power Wall Chapter 1 — Computer Abstractions and Technology — 2
Power Trends • Clock rates hit a “power wall” • In CMOS IC technology ×1000 ×30 5V → 1V Chapter 1 — Computer Abstractions and Technology — 3
The power wall • Performance was always improved by increasing frequency (up to 2004) • However by 2006, companies could not reduce generated power and remove more heat • Hence performance improvement could not be achieved by increasing frequency because of the increased power generated >>>>> THE POWER WALL • How else can we improve performance? Chapter 1 — Computer Abstractions and Technology — 4
Read Section 1.6: The Sea Change The Switch to Multiprocessors Chapter 1 — Computer Abstractions and Technology — 5
Uniprocessor Performance Uniprocessor performance is constrained by power, instruction-level parallelism, memory latency Chapter 1 — Computer Abstractions and Technology — 6
A Sea Change is at Hand • The power challenge has forced a change in the design of microprocessors • Since 2002 the rate of improvement in the response time of programs on desktop computers has slowed from a factor of 1.5 per year to less than a factor of 1.2 per year • As of 2006 all desktop and server companies are shipping microprocessors with multiple processors – cores – per chip • Plan is to double the number of cores per chip per generation (about every two years) Plan not followed!!
8 Multicore microprocessors • Require explicitly parallel programming • In single core microprocessors, hardware implemented instruction level parallelism to execute multiple instructions IN PARALLEL • Instruction level parallelism is hidden from the programmer • Parallel programming is hard (harder) to do. Involves: • Programming for performance • Load balancing • Optimizing communication and synchronization With the introduction of multicore microprocessors, The Free Lunch Era Ended !!!
Pitfalls and Fallacies • Pitfalls: Easily made mistakes • Fallacies: • Errors • Myths • …
Pitfall: Amdahl’s Law • Pitfall: Improving an aspect of a computer and expecting a proportional improvement in overall performance • Example: multiply operations account for 80 seconds of a 100 seconds run time of a program • How much improvement in multiply performance to get theprogram to run 5 times faster (i.e. in {100/5} = 20s)? • Can’t be done!
Amdahl’s Law Best Speedupoverall you could ever hope to do: fraction enhanced ExTimeold ExTimenew
Amdahl’s Law example • New CPU 10X faster • I/O bound server, so 60% time waiting for I/O • Apparently, its human nature is to be attracted by 10X faster, vs. keeping in perspective that it is just 1.56X faster 13
Amdahl’s Law example:Make the common case fast • Fraction = 0.1, Speedup = 10 • Fraction = 0.9, Speedup = 10 14
Pitfall: MIPS as a Performance Metric • MIPS: Millions of Instructions Per Second • Doesn’t account for • Differences in ISAs between computers • Differences in complexity between instructions 15
Pitfall: MIPS as a Performance Metric (cont.) • How should MIPS be computed? • It is not the maximum theoretical MIPS quoted by the manufacturer. • CPI varies between programs on a given CPU