a simple computer consists of a processor cpu central processing unit memory and i o l.
Download
Skip this Video
Loading SlideShow in 5 Seconds..
A Simple Computer consists of a Processor (CPU-Central Processing Unit), Memory, and I/O PowerPoint Presentation
Download Presentation
A Simple Computer consists of a Processor (CPU-Central Processing Unit), Memory, and I/O

Loading in 2 Seconds...

play fullscreen
1 / 39

A Simple Computer consists of a Processor (CPU-Central Processing Unit), Memory, and I/O - PowerPoint PPT Presentation


  • 810 Views
  • Uploaded on

A Simple Computer consists of a Processor (CPU-Central Processing Unit), Memory, and I/O. Arithmetic Logic Unit. Input. Control Unit. Memory. Output. Registers. Processor Or CPU. I/O. Basic Functional Units of a Computer.

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about 'A Simple Computer consists of a Processor (CPU-Central Processing Unit), Memory, and I/O' - nico


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
a simple computer consists of a processor cpu central processing unit memory and i o
A Simple Computer consists of a Processor (CPU-Central Processing Unit), Memory, and I/O

Arithmetic

Logic

Unit

Input

Control

Unit

Memory

Output

Registers

Processor

Or CPU

I/O

basic functional units of a computer
Basic Functional Units of a Computer
  • Input – accepts coded information from human operators, from electromechanical devices (such as keyboards), or from other digital medium via digital communication lines.
  • The information received is either stored in the memory or immediately used by the arithmetic and logic unit (ALU) to perform the desired operations.
  • The results are sent back out through the output medium.
  • All actions are coordinated through the control unit.
the information
The Information
  • Categorized as either instructions or data
  • Instructions (or machine instructions) are explicit commands that
    • Govern the transfer of information within a computer as well as between the computer and its I/O devices.
    • Specify the arithmetic and logic operations to be performed.
programs
Programs
  • A list of instructions that performs a task is called a program.
  • Usually the program is stored in memory.
  • The program fetches the instructions from memory, one after another, and performs the desired operations.
  • The computer is completely controlled by the stored program, except for possible interruption by an operator or by I/O devices connected to the machine.
  • Data are numbers and encoded characters that are used as operands by the instructions.
inside the cpu
Inside the CPU
  • Control Unit (CU)

coordinates the sequencing of steps involved in executing machine instructions

  • Arithmetic Logic Unit (ALU)

performs arithmetic and logical operations

  • Registers

storage locations

  • Clock

synchronizes the internal operations of the CPU with the other system components

bus structure
Bus Structure
  • Bus - a group of parallel wires that transfer information from one part of the computer to another.
    • Control Bus

synchronizes the actions of all of the devices attached to the system bus.

    • Address Bus

passes the addresses of instructions and data between the CPU and memory (or I/O).

    • Data Bus

transfers instructions and data between the CPU and memory (or I/O).

bus sizes
Bus Sizes
  • For the 8086 Processor
    • Address Bus – 20 bits
      • can access 1M of memory
      • Addresses defined as $00000-$FFFFF
    • Data Bus – 16 bits (16-bit processor)
      • A word is 16 bits
      • Each word is byte addressable
notes from intel family chart
Notes from Intel Family Chart
  • Notice that 386 – Pentium 4 are 32-bit processors (32-bit data bus – 4 bytes)
  • Notice that 386 and beyond have 32-bit address bus can access (4G of memory addresses).
machine cycle
Machine Cycle
  • Most basic unit of time for machine instructions
  • = the time required for one complete clock cycle.
  • Machine instructions require at least 1 clock cycle to execute. Most require more.
  • Wait states – empty clock cycles of machine execution time (due to memory access time being slower than speed of clock).
instruction execution cycle
Instruction Execution Cycle
  • If using Memory operand (mov ax, 0A69Bh)
    • Calculate address of operand
    • Place address of operand on address bus
    • Wait for memory to get operand and pass it on

data bus

the data path of a typical von neumann machine
The data path of a typical von Neumann Machine

A + B

A

Registers

B

ALU Input Register

A

B

ALU Input Bus

ALU

ALU Output Register

A + B

instruction execution cycle the cpu executes each instruction in a series of small steps
Instruction Execution CycleThe CPU executes each instruction in a series of small steps
  • Fetch the next instruction from memory into the instruction register.
  • Change the program counter to point to the next instruction.
  • Decode the instruction.
  • Fetch any memory operands necessary into a CPU register.
  • Execute the instruction.
  • Store output operand into a CPU register.
execution of von neumann machines
Execution of von Neumann Machines

To fetch the next instruction while the first is executing would speed up the machine

Instructions are stored in prefetch buffers (registers), to be accessed more quickly than waiting for fetch from memory. Prefetching divides instruction execution up into two parts: fetching and actual execution.

Pipelining divides up instruction execution into many parts, each one handled by a piece of dedicated hardware, all which can run in parallel.

2 stage pipelining
2-stage Pipelining
  • Execution Unit: executes the microcode instructions.
  • Bus Interface Unit: accesses memory and provides I/O
a five stage pipeline
A Five-stage Pipeline

A five-stage pipeline.

The state of each stage as a function of time.

how fast does this machine run
How Fast Does This Machine Run?
  • Suppose that the cycle time of this machine is 2 nsec.
  • Then it takes 10nsec for an instruction to progress all the way through the five-stage pipeline.
  • Does the machine run at 100MIPS (1/10n)?
  • No, at every clock cycle (2nsec) a new instruction is completed, so the actual rate of processing is 500MIPS.
how many cycles are required to execute n instructions pipelined versus non pipelined systems
How many cycles are required to execute n instructions?(Pipelined Versus Non-Pipelined Systems)
  • For a system with k stages
  • In non-pipelined systems, n instructions require (n*k) cycles to process.
    • 5 instructions require 5 clock cycles
  • Using a pipelined system with k pipeline stages, n instructions require (k + (n-1)) cycles to complete.
    • 5 instructions require (5 + (5-1)) = 9 clock cycles

(*refer to slide #14)

tradeoffs
Tradeoffs

Pipelining allows a tradeoff between

    • Latency
      • How long it takes to execute an instruction
      • Latency = nT nanosec (where cycle time is T nanosec and the number of stages is n)
  • And
    • Processor Bandwidth
      • How many MIPS the CPU has
      • Bandwidth = 1000/T MIPS

*logically we should measure CPU bandwidth in BIPS or GIPS since we are measuring T in nanosec, but nobody does this.

ia 32 processor pipelining 6 stage execution cycle
IA-32 Processor Pipelining(6-stage Execution Cycle)
  • Bus Interface Unit: accesses memory and provides I/O
  • Code Prefetch Unit: receives instructions from the BIU and inserts them into a holding area (instruction queue)
  • Instruction Decode Unit: decodes machine instructions from the prefetch queue and translates them into microcode.
  • Execution Unit: executes the microcode instructions.
  • Segment Unit: translates logical addresses into linear addresses and performs protection checks
  • Paging Unit: translates linear addresses into physical addresses, performs page protection checks and keeps a list of recently accessed pages
superscalar architecture
Superscalar Architecture
  • If one pipeline is good, then two pipelines must be better.
  • Parallel paths exist through which different instructions can be executed in parallel.
  • It is possible to start the execution of several instructions in every clock cycle.
  • The logical correctness of programs must be maintained.
dual five stage pipelines with a common code prefetch unit
Dual five-stage pipelines with a common Code Prefetch Unit

The code prefetch unit fetches pairs of instructions together and puts each one into

Its own pipeline, complete with its own ALU for parallel operation.

superscalar processor with 5 functional units
Superscalar processor with 5 functional units

Four pipelines duplicates too much hardware. Instead, use a single pipeline and give it multiple functional units. This assumes that the S3 stage can issue instructions faster than the S4 stage can execute them. (Pentium II)

parallelism
Parallelism
  • So far we have dealt with instruction-level parallelism.
  • There is also processor-level parallelism
    • Array processors
    • Multiprocessors
    • Multicomputers
cisc complex instruction set computer
CISCComplex Instruction Set Computer
  • A large number of variable length instructions (more than 128)
  • Multiple addressing modes
  • A small number of internal processor registers
  • Instructions that require multiple numbers of clock cycles to execute
8086 a real cisc
8086(A Real CISC)
  • Over 3000 different instruction forms, each requiring anywhere from one to six bytes
  • Nine different addressing modes are supported
  • The processor only has eight general purpose registers
  • Instruction execution times range from 2 clock cycles to more than 80 cycles for ASCII adjust for multiplication instruction.
intel s i860 risc processor
Intel’s i860 RISC Processor
  • 82 instructions, each 32 bits in length
  • Four addressing modes
  • 32 general purpose registers
  • All instructions execute in one clock cycle
why hasn t risc won out
Why hasn’t RISC won out?
  • Backward compatibility (companies have spent billions of dollars on Intel processor software).
  • Intel has built CPU cores with RISC like structure that executes the simplest and most common instructions in a simgle data path, while interpreting the more complex instructions in the usual CISC way.
design principles of modern computers
Design Principles of Modern Computers
  • All instructions are directly executed in hardware
  • Maximize the rate at which instructions are issued
  • Instructions should be easy to decode
  • Only loads and stores should be able to reference memory
  • Provide plenty of registers
application specific microprocessors digital signal processors
Application Specific MicroprocessorsDigital Signal Processors
  • Previously, analog signals had to be handled with discrete circuits (op-amps, capacitors, inductors, and resistors forming filters, amplifiers, etc…)
  • Now low-cost analog-to-digital and digital-to-analog converters are available.
  • => thus we have digital signal processing systems
dsp systems
DSP systems
  • DSPs are used to perform repetitive complex mathematical computations on the converted analog data.
  • One computation may require as many as 500,000 add-multiply operations.
dsp architecture
DSP Architecture
  • Data and instructions are stored in two different memory areas each with their own buses (Harvard Architecture)
  • Hardware multipliers and adders are built into the processor and optimized to perform a calculation in a single clock cycle.
  • Arithmetic pipelining is used so that several instructions can be operated on at once.
  • Hardware DO loops are provided to speed up repetitive operations
  • Multiple (serial) I/O ports are provided for communication with other processors.
dsp applications
DSP Applications
  • Mulitmedia sound cards (used to compress speech and music signals)
  • DSP can be reprogrammed (allows some sound cards to double as a modem
  • Cellular phones
  • Speech and image compression
  • Optical character recognition
  • Video conferencing
operating system
Operating System
  • A collection of programs (a large program), that are used to control the sharing of and interaction among various computer units as they execute application programs.
  • Performs the tasks required to assign computer resources to individual application programs.
    • Assigning memory and magnetic disk space to program and disk files
    • Moving data between memory and disk units
    • Handling I/O operations
slide38
Example of How A Operating System Manages the execution of more than one application program at the same time
  • Application program has been compiled from a high level language form into machine language form and is stored on disk
  • Assume somewhere in the program, a data file must be read, perform some computation on the data, and print results .
    • Transfer file into memory
    • When transfer is complete, begin execution
    • When point in program is reached that data file is needed, the program requests the operating system to transfer the data file from the disk to memory.
      • The OS performs this task and passes execution control back to the application program, which then proceeds to perform the required computation.
      • When the computation is completed and the results are ready to be printed,