ECE PowerPoint PPT Presentation

  • Uploaded on
  • Presentation posted in: General

Computer Architecture . Instruction Set Architecture (IBM 360)

Download Presentation


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.

- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -

Presentation Transcript

1. ECE/CS 752: Advanced Computer Architecture I Instructor:Mikko H Lipasti Spring 2008 University of Wisconsin-Madison Lecture notes based on slides created by John Shen, Mark Hill, David Wood, Guri Sohi, and Jim Smith

2. Computer Architecture Instruction Set Architecture (IBM 360) … the attributes of a [computing] system as seen by the programmer. I.e. the conceptual structure and functional behavior, as distinct from the organization of the data flows and controls, the logic design, and the physical implementation. -- Amdahl, Blaaw, & Brooks, 1964 Machine Organization (microarchitecture) ALUS, Buses, Caches, Memories, etc. Machine Implementation (realization) Gates, cells, transistors, wires

3. 752 In Context Prior courses 352 – gates up to multiplexors and adders 354 – high-level language down to machine language interface or instruction set architecture (ISA) 552 – implement logic that provides ISA interface CS 537 – provides OS background (co-req. OK) This course – 752 – covers advanced techniques Modern processors that exploit ILP Modern memory systems that exploit MLP Additional courses ECE 757 covers parallel and multiprocessing ECE 755 covers VLSI design

4. Why Take 752? To become a computer designer Alumni of this class helped design your computer To learn what is under the hood of a computer Innate curiosity To better understand when things break To write better code/applications To write better system software (O/S, compiler, etc.) Because it is intellectually fascinating! What is the most complex man-made device?

5. Computer Architecture Exercise in engineering tradeoff analysis Find the fastest/cheapest/power-efficient/etc. solution Optimization problem with 100s of variables All the variables are changing At non-uniform rates With inflection points Only one guarantee: Today’s right answer will be wrong tomorrow Two high-level effects: Technology push Application Pull

6. Technology Push What do these two intervals have in common? 1776-1999 (224 years) 2000-2001 (2 years)

7. Performance Growth Unmatched by any other industry ! [John Crawford, Intel] Doubling every 18 months (1982-1996): 800x Cars travel at 44,000 mph and get 16,000 mpg Air travel: LA to NY in 22 seconds (MACH 800) Wheat yield: 80,000 bushels per acre

8. Technology Push Technology advances at varying rates E.g. DRAM capacity increases at 60%/year But DRAM speed only improves 10%/year Creates gap with processor frequency! Inflection points Crossover causes rapid change E.g. enough devices for single-chip processor Current issues Power consumption Reliability Variability

9. Application Pull Corollary to Moore’s Law: Cost halves every two years In a decade you can buy a computer for less than its sales tax today. –Jim Gray Computers cost-effective for National security – weapons design Enterprise computing – banking Departmental computing – computer-aided design Personal computer – spreadsheets, email, web Pervasive computing – prescription drug labels

10. Application Pull What about the future? E.g. weather forecasting computational demand Must dream up applications that are not cost-effective today Virtual reality, telepresence Web agents, social networking Wireless, location-aware Proactive (beyond interactive) w/ sensors Recognition/Mining/Synthesis (RMS) ??? This is your job!

11. Trends Moore’s Law for device integration Chip power consumption Single-thread performance trend [source: Intel] Sep 18, 2007 Mikko Lipasti-University of Wisconsin

12. Dynamic Power Static CMOS: current flows when active Combinational logic evaluates new inputs Flip-flop, latch captures new value (clock edge)? Terms C: capacitance of circuit wire length, number and size of transistors V: supply voltage A: activity factor f: frequency Future: Fundamentally power-constrained Sep 18, 2007 Mikko Lipasti-University of Wisconsin

13. Sep 18, 2007 Mikko Lipasti-University of Wisconsin Multicore Mania First, servers IBM Power4, 2001 Then desktops AMD Athlon X2, 2005 Then laptops Intel Core Duo, 2006 Soon, your cellphone ARM MPCore, prototypes for a while now

14. Why Multicore Sep 18, 2007 Mikko Lipasti-University of Wisconsin End of the “panacea” part of the talkEnd of the “panacea” part of the talk

15. Amdahl’s Law f – fraction that can run in parallel 1-f – fraction that must run serially Sep 18, 2007 Mikko Lipasti-University of Wisconsin

16. Fixed Chip Power Budget Amdahl’s Law Ignores (power) cost of n cores Revised Amdahl’s Law More cores ? each core is slower Parallel speedup < n Serial portion (1-f) takes longer Also, interconnect and scaling overhead Sep 18, 2007 Mikko Lipasti-University of Wisconsin

17. Fixed Power Scaling Fixed power budget forces slow cores Serial code quickly dominates Sep 18, 2007 Mikko Lipasti-University of Wisconsin

18. Focus of this Course How to make serial portion fast Currently out of vogue, but not for long! State-of-the-art processor design Pipelining review Superscalar, out-of-order processors Branch prediction Advanced memory systems & I/O Multicore and multithreaded processors

19. Instruction Set Processing The ART and Science of Instruction-Set Processor Design [Gerrit Blaauw & Fred Brooks, 1981] ARCHITECTURE (ISA) programmer/compiler view Functional appearance to user/system programmer Opcodes, addressing modes, architected registers, IEEE floating point IMPLEMENTATION (µarchitecture) processor designer view Logical structure or organization that performs the architecture Pipelining, functional units, caches, physical registers REALIZATION (Chip) chip/system designer view Physical structure that embodies the implementation Gates, cells, transistors, wires

20. Iron Law

21. Iron Law Instructions/Program Instructions executed, not static code size Determined by algorithm, compiler, ISA Cycles/Instruction Determined by ISA and CPU organization Overlap among instructions reduces this term Time/cycle Determined by technology, organization, clever circuit design

22. Our Goal Minimize time, which is the product, NOT isolated terms Common error to miss terms while devising optimizations E.g. ISA change to decrease instruction count BUT leads to CPU organization which makes clock slower Bottom line: terms are inter-related

23. Textbooks Required course textbook: John Paul Shen and Mikko H. Lipasti, Modern Processor Design: Fundamentals of Superscalar Processors, First edition, McGraw-Hill. Recommended textbook: Mark Hill, Norm Jouppi, and Guri Sohi. Readings in Computer Architecture. Morgan Kauffman, 1999

24. Expected Background ECE/CS 552 or equivalent Design simple uniprocessor Simple instruction sets Organization Datapath design Hardwired/microprogrammed control Simple pipelining Basic caches High-level programming experience C/UNIX skills – modify simulators

25. Course Context Assume canonical RISC ISA Register-register ALU ops Load from memory (cache) Store to memory Branches, jumps, calls, returns Modern CISC (x86) processors Translate to equivalent primitives Later: how the translation is done © 2005 Mikko Lipasti 25

26. About This Course Readings Posted on website (one list for each midterm) Make sure you keep up with these! Not necessarily discussed in lecture. Lecture Attendance required Some lectures will be delivered on line Homeworks Due at beginning of class time Must be individual effort

27. About This Course Pop Quizzes Not announced ahead of time Will probably drop one for final grade to accommodate occasional absence Make sure you are ahead on readings! Exams Midterm 1: W 3/12 in class Midterm 2: Sun 5/11 2:45PM (final exam time slot) Keep up with reading list!

28. About This Course Course Project Research project Replicate results from a paper Or attempt something novel Final project includes a written report and an oral presentation Reports due 5/9 Presentations during class time 5/5 and 5/7

29. About This Course Grading Homework and Quizzes 20% Midterm 1 25% Midterm 2 25% Project 30% Web Page (check regularly)

30. About This Course Office Hours Prof. Lipasti: EH 4613, T 2-4, W9-10:30 Communication channels E-mail to instructor, class e-mail list Web page Office hours

31. About This Course Other Resources Computer Architecture Colloquium – Tuesday 4-5PM, 1325 CSS Computer Engineering Seminar – Friday 12-1PM, EH4610 Architecture mailing list: WWW Computer Architecture Page

32. About This Course Lecture schedule: MWF 11:00-11:50 Proposed change: MWF 11:00-12:15 Cancel 1 of 3 lectures (on average) As long as no hard conflicts 32

33. Tentative Schedule

34. Wrapping Up No lecture Friday 1/25 View lecture online instead Pipelining Review, 3 lectures with audio narration Be prepared for discussion/pop quiz on Monday 34

  • Login