Computer Architecture Lecture 4 17 th May, 2006

Computer ArchitectureLecture 417th May, 2006 Abhinav Agarwal Veeramani V.

Recap • Simple Pipeline – hazards and solution • Data hazards • Static compiler techniques – load delay slot, etc. • Hardware solutions – Data forwarding, out-of-order execution, register renaming • Control hazards • Static compiler techniques • Hardware speculation through branch predictors • Structural hazards • Increase hardware resources • Superscalar out-of-order execution • Memory organisation EE Summer Camp '06

Memory Organization in processors • Caches inside the chip • Faster – ‘Closer’ • SRAM cells • They contain recently-used data • They contain data in ‘blocks’ EE Summer Camp '06

Rational behind caches • Principle of spatial locality • Principle of temporal locality • Replacement policy (LRU, LFU, etc.) • Principle of inclusivity EE Summer Camp '06

Outline • Instruction Level Parallelism • Thread-level Parallelism • Fine-Grain multithreading • Simultaneous multithreading • Sharable resources & Non-sharable resources • Chip Multiprocessor • Some design issues EE Summer Camp '06

Instruction Level Parallelism • Overlap execution of many instructions • ILP techniques try to reduce data and control dependencies • Issue out-of-order independent instructions EE Summer Camp '06

Thread Level Parallelism • Two different threads have more independent instructions • Better utilization of functional units • Multi-thread performance is improved drastically EE Summer Camp '06

A simple pipeline EE Summer Camp '06 source: EV8 DEC Alpha Processor, (c) Intel

Superscalar pipeline EE Summer Camp '06 source: EV8 DEC Alpha Processor, (c) Intel

Speculative execution EE Summer Camp '06 source: EV8 DEC Alpha Processor, (c) Intel

Fine Grained Multithreading EE Summer Camp '06 source: EV8 DEC Alpha Processor, (c) Intel

Simultaneous Multithreading EE Summer Camp '06 source: EV8 DEC Alpha Processor, (c) Intel

Out of Order Execution EE Summer Camp '06 source: EV8 DEC Alpha Processor, (c) Intel

SMT pipeline EE Summer Camp '06 source: EV8 DEC Alpha Processor, (c) Intel

Resources – Replication required • Program counters • Register maps EE Summer Camp '06

Replication not required • Register file (rename space) • Instruction queue • Branch predictor • First and second level caches etc. EE Summer Camp '06

Chip multiprocessor • Number of transistors going up • Have more than one core on the chip • These still share the caches EE Summer Camp '06

Some design issues • Trade-off in choosing the cache size • Power and performance • Super pipelining trade-off • Higher clock frequency and speculation penalty + Power • Power consumption EE Summer Camp '06

Novel techniques for power • Clock gating • Run non-critical elements at a slower clock • Reduce voltage swings (Voltage of operation) • Sleep Mode/ Standby Mode • Dynamic Voltage Frequency scaling EE Summer Camp '06

Computer Architecture Lecture 4 17 th May, 2006

Computer Architecture Lecture 4 17 th May, 2006

Presentation Transcript

TIPS: May 17, 2006

17 TH May 2013

May 17 th , 2012

May 17 th , 2010

Thursday 17 th May

17 th May 2007

Computer Architecture Lecture 3 – Part 1 11 th May, 2006

4 th May 2006

Computer Architecture Lecture 3 – Part 2 15 th May, 2006

May 17, 2006

Lecture 12 Slides May 4 th , 2006 (star wars day)

November 17 th 2006

May 6 th 2006

May 17, 2006

ENTER CONFERENCE May 4 th 2006

Lecture 17 Slides May 30 th , 2006

G. FLAMENT, Th SCHLEGEL May 17 th , 2006

Lecture 4: May 27 th 2009

Lecture 9 4 May 2006

EE (CE) 6304 Computer Architecture Lecture #4 (8/30/17)

EE (CE) 6304 Computer Architecture Lecture #9 (10/4/17)

May 17 th , 2010