1 / 46

School of Computing Science Simon Fraser University CMPT 300: Operating Systems I

School of Computing Science Simon Fraser University CMPT 300: Operating Systems I Ch 8: Memory Management Dr. Mohamed Hefeeda. Objectives. To provide a detailed description of various ways of organizing memory hardware

csilla
Download Presentation

School of Computing Science Simon Fraser University CMPT 300: Operating Systems I

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. School of Computing Science Simon Fraser University CMPT 300: Operating Systems I Ch 8: Memory Management Dr. Mohamed Hefeeda

  2. Objectives • To provide a detailed description of various ways of organizing memory hardware • To discuss various memory-management techniques, including paging and segmentation • To provide a detailed description of the Intel Pentium, which supports both pure segmentation and segmentation with paging

  3. Background • Program must be brought (from disk) into memory and placed within a process to be run • Main memory and registers are the only storage that CPU can access directly • CPU generates a stream of addresses • Memory does not distinguish between instructions and data • Register is accessed in one CPU clock • Main memory can take many cycles • Cache sits between main memory and CPU registers to accelerate memory access

  4. Binding of Instructions and Data to Memory • Address binding can happen at: • Compile time: • If memory location is known apriori, absolute codecan be generated • must recompile code if starting location changes • Load time: • Must generate relocatable codeif memory location is not known at compile time • Execution time: • Binding delayed until run time • process can be moved during its execution from one memory segment to another • Need hardware support for address maps • Most common (more on this later)

  5. Multi-step Processing of a User Program

  6. Logical vs. Physical Address Space • Logical address • generated by the CPU • also referred to as virtual address • User programs deal with logical addresses; never see the real physical addresses • Physical address • address seen by the memory unit • Both are the same if address binding is done in • Compile time or • Load time • But they differ if address binding is done in • Execution time •  we need to map logical addresses to physical ones

  7. Memory Management Unit (MMU) • MMU: Hardware device that maps virtual address to physical address • MMU also ensures memory protection • Protect OS from user processes, and protect user processes from one another • Example: Relocation register

  8. Memory Protection: Base and Limit Registers • A pair of base and limit registers define the logical address space • Later, we will see other mechanisms (paging hardware)

  9. Memory Allocation • Main memory is usually divided into two partitions: • Part for the resident OS • usually held in low memory with interrupt vector • Another for user processes • OS allocates memory to processes • Contiguous memory allocation • Process occupies a contiguous space in memory • Non-contiguous memory allocation (paging) • Different parts (pages) of the process can be scattered in the memory

  10. Contiguous Allocation (cont’d) • When a process arrives, OS needs to find a large-enough hole in memory to accommodate it • OS maintains information about: • allocated partitions • free partitions (holes) • Holes have different sizes and scattered in the memory • Which hole to choose for a process? OS OS OS OS process 5 process 5 process 5 process 5 process 9 process 9 process 8 process 10 process 2 process 2 process 2 process 2

  11. Contiguous Allocation (cont’d) • Which hole to choose for a process? • First-fit: Allocate first hole that is big enough • Best-fit: Allocate smallest hole that is big enough • must search entire list, unless ordered by size • Produces the smallest leftover hole • Worst-fit: Allocate the largest hole; • must also search entire list • Produces the largest leftover hole • Performance: • First-fit and best-fit perform better than worst-fit in terms of speed and storage utilization • What are pros and cons of contiguous allocation? • Pros: Simple to implement • Cons: Memory fragmentation

  12. Contiguous Allocation: Fragmentation • External Fragmentation • Total memory space exists to satisfy a request, but it is not contiguous • Solutions to reduce external fragmentation? • Memory compaction • Shuffle memory contents to place all free memory together in one large block • Compaction is possible only if relocation is dynamic, and is done at execution time • Internal Fragmentation • Occurs when memory has fixed-size partitions (pages) • Allocated memory may be larger than requested • the difference is internal to the allocated partition, and cannot being used

  13. Paging: Non-contiguous Memory Allocation • Process is allocated memory wherever it is available • Divide physical memory into fixed-sized blocks called frames • size is power of 2, between 512 bytes and 8,192 bytes • OS keeps track of all free frames • Divide logical memory into blocks of same size called pages • To run a program of size n pages, need to find n free frames and load program • Set up a page table to translate logical to physical addresses

  14. page number page offset p d m - n n Address Translation Scheme • Address generated by CPU is divided into: • Page number (p) – used as an index into a pagetable which contains base address of each page in physical memory • Page offset (d) – combined with base address to define the physical memory address that is sent to the memory unit • Address pace = 2mentries • Pagesize =2n entries • Number of page = 2 (m-n) entries

  15. Paging Hardware

  16. Paging Model of Logical and Physical Memory

  17. Paging Example Page size = 4 bytes Memory size = 8 pages = 32 bytes Logical address 0: page = 0/4 = 0, offset = 0%4 = 0 maps to  frame 5 + offset 0  physical address 20 Logical address 13: page = 13/4 = 3, offset = 13%4 = 1 maps to  frame 2 + offset 1  physical address 9

  18. Free Frames After allocation Before allocation Note: Every process must have its own page table

  19. Implementation of Page Table • Page table is kept in main memory • Page-table base register (PTBR) points to page table • Page-table length register (PRLR) indicates size of page table • What is the downside of keeping P.T. in memory? • Memory slow down. Every data/instruction access requires two memory accesses: One for page table and one for data/instruction • Solution? • use a special fast-lookup associative memory to accelerate memory access • Called Translation Look-aside Buffer (TLB) • Stores part of page table (of currently running process) • TLBs store process identifier (address space identifier) in each TLB entry  provide address-space protection for that process

  20. Paging Hardware With TLB

  21. Effective Access Time • Associative Lookup =  time unit • Assume memory cycle time is tm time unit • Typically  << tm • Hit ratio  • percentage of times a page is found in associative memory • Effective Access Time (EAT) = ? EAT = ( + tm)  + ( + tm + tm) (1 – ) =  + (2 – ) tm As  approaches 1, EAT approaches tm

  22. Memory Protection • Memory protection implemented by associating protection bits with each frame • Can specify whether a page is: read only, write, read-write • Another bit (valid-invalid) may be used • “valid” indicates whether a page is in the process’ address space, i.e., a legal page to access • “invalid” indicates that the page is not in the process’ address space

  23. Valid (v) or Invalid (i) Bit In A Page Table

  24. Shared Pages • Suppose that you have code (e.g., emacs) that are being used by several processes at the same time. Does every process need to have a separate copy of that code in memory? • NO. Put the code in “Shared Pages” • One copy of read-only (reentrant) code shared among processes, e.g., text editors, compilers, … • Shared code must appear in same location in the logical address space of all processes • What if the shared code creates per-process data? • Each process keeps “private pages” for data • Private pages can appear anywhere in address space

  25. Example: Shared/Private Pages Private page for process P1 Shared pages

  26. Structure of the Page Table • Assume that we have 32-bit address space, and page size is 1024 byes • How many pages do we have? • 222 pages • What is the maximum size of the page table, assuming that each entry takes 4 bytes? • 16 Mbytes • Anything wrong with this number? • Huge size (for each process, mostly unused). Solutions? • Hierarchical Paging • Hashed Page Tables • Inverted Page Tables

  27. Hierarchical Page Tables • Break up the logical address space into multiple page tables • A simple technique is a two-level page table

  28. Two-Level Page-Table Scheme

  29. Two-Level Paging Example • Logical address (on 32-bit machine with 1K page size) is divided into: • a page number consisting of 22 bits • a page offset consisting of 10 bits • Since page table is paged, page number is further divided into: • a 12-bit page number • a 10-bit page offset • Thus, a logical address is as follows: • where pi is an index into the outer page table, and p2 is the displacement within the page of the outer page table page number page offset p2 pi d 10 10 12

  30. Address Translation in 2-Level Paging Note: OS creates the outer page table and one page of the inner page table. More pages of the inner table are created on-demand.

  31. Three-level Paging Scheme • For 64-bit address space, we may have a page table like: • But the outer page table is huge, we may have 3-level: • The outer page table is still large!

  32. Hashed Page Tables • Common in address spaces > 32 bits • Fixed-size hash table • Page number is hashed into a page table • Collisions may occur  • An entry in page table may contain a chain of elements that hash to same location

  33. Hashed Page Table • Page number is hashed • If multiple entries, search for the correct one •  cost of searching, linear in worst-case!

  34. Inverted Page Table • One entry for each frame of physical memory • Entry has: page # stored in that frame, and info about process (PID) that owns that page

  35. Inverted Page Table (cont’d) • Pros and Cons of inverted page table? • Pros • Decreases memory needed to store each page table, • Cons • Increases time needed to search the table when a page reference occurs • Can be alleviated by using a hash table to reduce the search time • Difficult to implement shared pages • Because only one entry for each frame in the page table, i.e., one virtual address for each frame • Examples • 64-bit UltraSPARC and PowerPC

  36. Segmentation • Memory management scheme that supports user view of memory • A program is a collection of segments • A segment is a logical unit such as: main program, procedure, function, method, object, local variables, global variables, common block, stack, symbol table, arrays

  37. 1 1 4 2 3 4 2 3 user space physical memory space Logical View of Segmentation

  38. Segmentation Architecture • Logical address has the form: • <segment-number, offset> • Segment table • maps logical address to physical address • Each entry has • Base: starting physical address of segment • Limit: length of segment • Segment-table base register (STBR) points to location of segment table in memory

  39. Segmentation Hardware

  40. Segmentation Example

  41. Segmentation Architecture (cont’d) • Protection • With each entry in segment table associate: • validation bit = 0  illegal segment • read/write/execute privileges • Protection bits associated with segments  code sharing occurs at segment level (more meaningful) • Segments vary in length  • Memory allocation is a dynamic storage-allocation problem  Problem? • External fragmentation. Solution? • Segmentation with paging: divide a segment into pages, pages could be anywhere in memory • Get benefits of segmentation (user’s view) and paging (reduced fragmentation)

  42. Example: Intel Pentium • Supports segmentation and segmentation with paging • CPU generates logical address • Given to segmentation unit  produces linear address • Linear address given to paging unit  generates physical address in main memory

  43. Pentium Segmentation Unit • Segment size up to 4 GB • Max #segs per process 16 K • 8 K private segs  use local descriptor table • 8 K shared segs  use global descriptor table

  44. Pentium Paging Unit • Page size either: • 4 KB  two-level paging, or • 4 MB  one-level paging p2 d p1

  45. Linux on Pentium • Linux is designed to run on various processors • three-level paging (to support 64-bit architectures) • On Pentium, the middle directory size is set to 0 • Limited use of segmentation • On Pentium, Linux uses only six segments

  46. Summary • CPU generates logical (virtual) addresses • Binding to physical addresses can be static (compile time) or dynamic (load or run time) • Contiguous memory allocation • First-, Best-, and worst-fit • Fragmentation (external) • Paging: noncontiguous memory allocation • Logical address space is divided into pages, which are mapped using page table to memory frames • Page table (one table for each process) • Access time: use cache (TLB) • Size: use two- and three-levels for 32 bits address spaces; use hashed and inverted page tables for > 32 bits • Segmentation: variable size, user’s view of memory • Segmentation + paging: example Intel Pentium

More Related