Computer Organization: Hierarchical Speed

Download as pdf or txt
Download as pdf or txt
You are on page 1of 25

Computer organization

MODULE 4

MEMORY

 Memory is used for storing programs and data. The memory unit that communicates
directly with the processor is called main memory
 Devices that provide backup storage are called auxiliary devices. Examples include
magnetic disks and tapes.
 Only programs and data currently needed by the processor reside in main memory. All
other information is stored in auxiliary memory and transferred to main memory when
needed.
 The entire computer memory can be viewed as a hierarchy. The memory hierarchy consists
of all storage devices employed in a computer system

MEMORY HIERARCHY

The hierarchical arrangement of memory units ina computer system is called the memory
hierarchy. Each level of the hierarchy has the properties of higher speed, smaller size, and
lower cost than lower levels.The overall goal of memory hierarchy is to obtain the highest
possible average access speed while minimizing the total cost of the entire memory system.
The hierarchical arrangement of memory units are shown below:

1
Computer organization

Processor registers are located inside the processor. Each register typically holds a word of data.
The fastest access is to data held in processor registers.

Processor cache

 Intermediate stage between the ultra fast registers and much slower main memory
 Implemented directly on the processor chip. This cache is small because it competes for space
on the processor chip which must implement many other functions.
 The primary cache is also referred to as level 1 or L1 cache.

Secondary cache

 A larger secondary cache s placed between the primary cache and rest of the memory.
 It is referred to as level 2 or L2 cache and is usually implemented using SRAM chips.

Main memory

 Central storage unit in a computer system. Main memory is much larger but significantly
slower than cache memory

2
Computer organization

 The access time for main memory is about ten times longer than the access time for L1 cache
memory.

Secondary memory

 Secondary storage is the slowest and cheapest form of memory.


 Secondary storage devices include magnetic disks like hard drives and floppy disks, optical
disks such as CDs and CDROMs.

Principle of inclusion
Information stored in memory hierarchy satisfies 3 important properties:

1. Inclusion
2. Coherence
3. Locality

Inclusion property

The inclusion property implies that all information items are originally stored in the outermost
level, Mn. During processing, subsets of Mn are copied into Mn-1. Similarly subsets of Mn-1 are
copied into Mn-2 and so on. In other words, if an information is found in Mi, then copies of the
same can also be found in all upper levels Mi+1,Mi+2…

Coherence property

The coherence property requires that copies of the same information item at successive memory
levels be consistent. If a word is modified in cache, copies of that must be updated immediately or
eventually at all higher levels.

Locality of reference

The memory hierarchy was developed based on a program behavior known as locality of
references. Memory references are generated by the CPU for either instruction or data access.
These accesses tend to be clustered in certain regions in time, space and ordering.

Locality of reference has 3 dimensions namely:

 Temporal
 Spatial
 Sequential

Temporal locality

3
Computer organization

Recently executed items (instruction or data) are likely to be referenced again in near future. Eg :
iterative loops.

Spatial locality

This refer to the tendency for a process to access items whose addresses are near one another. Eg :
operation on arrays.

Sequential locality

In typical programs, the execution of instructions follows a sequential order unless branch
instructions create out-of-order executions.

Memory interleaving
 Here instead of organizing the memory as a single unit it is organized as many modules.

 Each module will have its own Address Buffer Register (ABR) and Data Buffer Register
(DBR).

 Now memory access operations can be done in more than one module at the same time.
Thus the rate of transmission of words to and from the main memory can be increased.

 The number of modules that can be kept busy depends on how the individual addresses are
distributed over the modules. If consecutive words are kept in different modules, the rate of
transmission can be increased.

The low order k bits of the memory address select a module and the high order m bits name a
location within that module. So consecutive addresses will be in successive modules. A request for

4
Computer organization

access to consecutive locations can keep several modules busy at the same time resulting in faster
access to a block of data and higher average utilization of the memory system.

Main memory

Internal organization of memory chips:


 Memory cells are usually organized in the form of array, in which each cell is capable of
storing one bit of in formation.
 Each row of cells constitute a memory word and all cells of a row are connected to a
common line called as word line.
 The cells in each column are connected to Sense / Write circuit by two bit lines.
 The Sense / Write circuits are connected to data input or output lines of the chip.
 During a write operation, the sense / write circuit receive input information and store it in
the cells of the selected word.
 During a write operation, the sense/write circuit receives input information and stores it in
cells of the selected word.
 Figure shows a very small memory chip consisting 16 words of 8 bits each. This is referred
to as a 16x8 organization.
 The data input and the output of each sense/write circuit are connected to a single
bidirectional data line that can be connected to the data bus of a computer.
 Two control lines R/W and CS are provided in addition to address and data lines. The
read/write input specifies the required operation and chip select input selects a given chip
in a multichip memory system

5
Computer organization

SRAM(Static RAM)
Memories that consists of circuits capable of retaining their state as long as power is applied are
known as static memory.

 Two inverters are cross connected to form a latch .The latch is connected to two bit lines
by transistors T1 and T2.
 These transistors act as switches that can be opened / closed under the control of the word
line.

6
Computer organization

 When the word line is at ground level, the transistors are turned off and the latch retain its
state.
Read Operation:
 In order to read the state of the SRAM cell, the word line is activated to close switches T1 and
T2. If the cell is in state 1, the signal on bit line b is high and the signal on the bit line b is
low. Thus b and b are complement of each other.
 The Sense / write circuit at the end of the bit line monitors the state of b and b’ and set
the output accordingly

Merit :

 It has low power consumption because the current flows in the cell only when the cell is being
activated accessed.

 Static RAMs can be accessed quickly. It access time is few nanoseconds.

Demerit:

 SRAMs are said to be volatile memories because their contents are lost when the power is
interrupted.

DRAM

 The information stored in a dynamic memory cell in the form of a charge on a capacitor and this
charge can be maintained only for tens of Milliseconds.

 The contents must be periodically refreshed by restoring by restoring this capacitor charge to its
full value.

7
Computer organization

 In order to store information in the cell, the transistor T is turned „on‟ & the appropriate
voltage is applied to the bit line, which charges the capacitor.

 After the transistor is turned off, the capacitor begins to discharge which is caused by the
capacitors own leakage resistance.

 Hence the information stored in the cell can be retrieved correctly before the threshold value of
the capacitor drops down.

 During a read operation, the transistor is turned „on‟ & a sense amplifier connected to the bit
line detects whether the charge on the capacitor is above the threshold value.

 If charge on capacitor > threshold value -> Bit line will have logic value „1.
 If charge on capacitor < threshold value -> Bit line will set to logic value „0.

READ ONLY MEMORY(ROM):

 Both SRAM and DRAM chips are volatile, which means that they lose the stored information if
power is turned off.

 Many applications require non-volatile memory (which retain the stored information if power is
turned off). Eg: Operating System software has to be loaded from disk to memory which requires
the program that boots the Operating System i.e it requires non-volatile memory.

8
Computer organization

 Non-volatile memory is used in embedded system. Since the normal operation involves only
reading of stored data , a memory of this type is called ROM.

 At Logic value ‘0’ Transistor(T) is connected to the ground point(P).Transistor switch is


closed & voltage on bitline nearly drops to zero.
 At Logic value ‘1’ Transistor switch is open.The bitline remains at high voltage.To read the
state of the cell,the word line is activated. A Sense circuit at the end of the bitline generates the
proper output value.

Types of ROM

PROM:-Programmable ROM:
 PROM allows the data to be loaded by the user.
 Programmability is achieved by inserting a „fuse at point P in a ROM cell.
 Before it is programmed, the memory contains all 0s

9
Computer organization

 The user can insert 1s at the required location by burning out the fuse at these locations using high-
current pulse.
 This process is irreversible.

Merit:
 It provides flexibility.
 It is faster.
 It is less expensive because they can be programmed directly by the user.

EPROM:-Erasable reprogrammable ROM:


 EPROM allows the stored data to be erased and new data to be loaded.
Merits:
 It provides flexibility during the development phase of digital system.
 It is capable of retaining the stored information for a long time.

Demerits:
 The chip must be physically removed from the circuit for reprogramming and its entire
contents are erased by UV light.
EEPROM:-Electrically Erasable ROM:
 A significant disadvantage of EPROMs is that a chip must be physically removed from the
circuit for reprogramming and that its entire contents are erased by the ultraviolet light.
 EEPROMs are another version of erasable PROMs that can be both programmed and erased
electrically.
 They do not have to be removed for erasure and it is possible to erase the cell contents
selectively
 The only disadvantage of EEPROMs is that different voltages are needed for erasing,
writing and reading the stored data

Merits:
 It can be both programmed and erased electrically.
 It allows the erasing of cell contents selectively.

Demerits:
 It requires different voltage for erasing ,writing and reading the stored data.

Cache memory
The cache memory is a small and fast memory that is placed between the main memory
and the CPU. It holds the currently active and more frequently used segments of program
and its data.

10
Computer organization

 The effectiveness of cache mechanism is based on the property of Locality of reference’.


 Many instructions in the localized areas of the program are executed repeatedly
during some time period and remainder of the program is accessed relatively
infrequently is called locality of reference. It manifests itself in 2 ways. They
are:

 Temporal(The recently executed instruction are likely to be executed again


very soon.)

 Spatial(The instructions in close proximity to recently executed instruction are


also likely to be executed soon.)
 If the active segment of the program is placed in cache memory, then the total execution time can
be reduced significantly.
 The correspondence between main memory block and the block in cache memory is specified by a
mapping function.
 The Cache control hardware decides which block should be removed to create space for the new
block that contains the referenced word.The collection of rules for making this decision is called
the replacement algorithm.
 The cache control circuit determines whether the requested word currently exists in the cache.If it
exists, then Read/Write operation will take place on appropriate cache location. In this case
Read/Write hit will occur.
 In a Read operation, the memory will not involve.
 The write operation is proceed in 2 ways. They are,

 Write-through protocol :Here the cache location and the main memory locations are
updated simultaneously.

11
Computer organization


 Write-back protocol :This technique is to update only the cache location and to
mark it as with associated flag bit called dirty/modified bit. The word in the main
memory will be updated later, when the block containing this marked word is to be
removed from the cache to make room for a new block.
 If the requested word currently not exists in the cache during read operation, read miss will
occur. To overcome the read miss Load –through / Early restart protocol is used. After the
entire block is loaded into cache,the particular word requested is forwarded to the processor.
 If the requested word not exists in the cache during write operation,then Write Miss will
occur.
 If Write through protocol is used,the information is written directly into main memory.
 If Write back protocol is used then block containing the addressed word is first brought intothe
cache and then the desired word in the cache is over-written with the new information.

Mapping Function:

Direct Mapping:

 It is the simplest technique in which block j of the main memory maps onto block „j modulo 128 of
the cache.
 Thus whenever one of the main memory blocks 0,128,256 is loaded in the cache, it is stored in
block 0.
 Blocks 1,129,257 are stored in cache block 1 and so on.
.

12
Computer organization

Placement of block in the cache is determined from memory address

 The memory address is divided into 3 fields. They are,

Low Order 4 bit field (word)Selects one of 16 words in a block.


7 bit cache block fieldWhen new block enters cache, 7 bit determines the cache position in
which this block must be stored.
5 bit Tag fieldThe high order 5 bits of the memory address of the block is stored in 5 tag bits
associated with its location in the cache.
 As execution proceeds, the high order 5 bits of the address is compared with tag bits associated
with that cache location.
 If they match, then the desired word is in that block of the cache.

 If there is no match, then the block containing the required word must be first read from the main
memory and loaded into the cache.

13
Computer organization

Merit:
 It is easy to implement.

Demerit:
 It is not very flexible.

Associative Mapping:

 In this method, the main memory block can be placed into any cache block position.

 12 tag bits will identify a memory block when it is resolved in the cache.
 The tag bits of an address received from the processor are compared to the tag bits of each block of
the cache to see if the desired block is present. This is called associative mapping.
 It gives complete freedom in choosing the cache location.
 A new block that has to be brought into the cache has to replace (eject)an existing block if the
cache is full.
 In this method, the memory has to determine whether a given block is in the cache. A search of this
kind is called an associative Search.
Merit:
14
Computer organization

 It is more flexible than direct mapping technique.

Demerit:
 Its cost is high.

Set-Associative Mapping:

 Hybrid between a direct mapped cache and set associative cache.


 Combines the simplicity of direct mapping with the flexibility of associative mapping.
 Blocks of cache are grouped into sets and the mapping allows a block of main memory to
reside in any block of a specified set
 The position of a memory block in cache is given by:

(block number) MOD(number of sets in cache)

Levels of cache

1. Level 1(primary) cache

15
Computer organization

Level 1 or primary cache is the fastest memory on the PC and is referred to as internal cache. It is
built directly into the processor itself. This cache is very small, ranging from 8 KB to 64 KB, but it
is extremely fast. It runs at the same speed as the processor. If the processor requests information
and can find it in the level 1 cache that is the best case because information is there immediately
and the system does not have to wait.

2. Level 2(secondary) cache

The level 2 cache is a secondary cache to the level 1 cache. It is referred to as external cache and
is larger and slightly slower. It is used to catch recent accesses that is not caught by level 1 cache
and is usually 64 KB to 2 MB in size. A level 2 cache is found either on the motherboard or a
daughter board that inserts into the motherboard.

Main memory update policies

The read policies are:

 Read Through - reading a word from main memory to CPU


 No Read Through - reading a block from main memory to cache and then from cache to CPU

 Write back policy

In write back policy, the cache is modified on a write and the main memory is updated only
when the line in the cache is removed.

 Write through policy

The main memory is updated at the same time as the cache ie whenever there is a write to a
particular address location, both the cache and the main memory are written into. This implies
that no book keeping need be done in order to determine which of the lines in the cache need to
be written back to main memory as in the case of write back. It increases the memory traffic,
since there is a main memory access on every write.

Disk memory

16
Computer organization

 Magnetic Disk system consists o one or more disk mounted on a common spindle.
 A thin magnetic film is deposited on each disk, usually on both sides.
 The disks are placed in a rotary drive so that the magnetized surfaces move in close proximity
to read /write heads.
 Each head consists of magnetic yoke & magnetizing coil.

 Digital information can be stored on the magnetic film by applying the current pulse of suitable
polarity to the magnetizing coil.
 Only changes in the magnetic field under the head can be sensed during the Read operation.
Therefore if the binary states 0 & 1 are represented by two opposite states of magnetization, a
voltage is induced in the head only at 0-1 and at 1-0 transition in the bit stream.
 A consecutive (long string) of 0‟s & 1‟s are determined by using the clock which is mainly
used for synchronization.
 The Read/Write heads must be maintained at a very small distance from the moving disk
surfaces in order to achieve high bit densities.
 When the disk are moving at their steady state, the air pressure develops between the disk
surfaces & the head & it forces the head away from the surface.
 The flexible spring connection between head and its arm mounting permits the head to fly at
the desired distance away from the surface.

Winchester Technology
 Read/Write heads are placed in a sealed, air –filtered enclosure called the Winchester
Technology.
 In such units, the read/write heads can operate closure to magnetic track surfaces because the
dust particles which are a problem in unsealed assemblies are absent.
 It has a larger capacity for a given physical size.
 The data intensity is high because the storage medium is not exposed to contaminating
elements.

17
Computer organization

The disk system has 3 parts. They are :

Disk Platter(Usually called Disk)


Disk Drive(spins the disk & moves Read/write heads)
Disk Controller(controls the operation of the system.)

Data organization on disk

 Each surface is divided into concentric tracks. Each track is divided into sectors.
 The set of corresponding tracks on all surfaces of a stack of disk form a logical cylinder.
 The data are accessed by specifying the surface number, track number and the sector number.
 The Read/Write operation start at sector boundaries. Data bits are stored serially on each track.
Each sector usually contains 512 bytes.
 An unformatted disk has no information on its tracks. The formatting process divides the disk
physically into tracks and sectors
 The disk is divided into logical partitions. They are,
Primary partition
Secondary partition

Disk performance

Disk performance is measured in terms of following performance factors:

Seek time

 The time required to move the read/write head to the proper track. This depends on the
initial position of the head relative to the track specified in the address.
 Measured in milliseconds. Average values are in 5 to 8 ms range.

Rotational delay/ latency

18
Computer organization

 Amount of time that the drive takes for the platter to spin, bringing the sector to the right
position.
 Measured in milliseconds
 Faster the platter spins, lower the latency

Access time

 The sum of seek time and latency is called the disk access time
Access time = seek time + latency
 Total delay between the beginning of a read or write operation and the time when the drive
actually begins reading or writing data.
 Measured in milliseconds

Rotational delay

 Speed at which the disk platters spin, in revolutions per minute.


 Average value is 5000 to 7200 rpm.

Areal density

Specifies the amount of data that can be stored on the drive.

Disk caching

 Disk caching is the mechanism for improving the time it takes to read from or write to a
herd disk.
 Disk cache holds the data that has been recently read and adjacent data areas that are
likely to be addressed next.
 A disk drive is connected to the rest of a computer system using some standard
interconnection scheme.
 Normally a standard bus, such as SCSI bus is used. The SCSI bus is capable of
transferring data at much higher rates than the rate at which data can be read from disk
tracks.
 An efficient way to deal with possible differences in transfer rates between the disk and the
SCSI bus is to include a data buffer/cache in the disk unit.This buffer is a semiconductor
memory, capable of storing a few megabytes of data.
 When a read request arrives at the disk, the controller can first check to see if the desired
data are already available in the cache.
 If so, data can be accessed and placed on the SCSI bus in microseconds rather than
milliseconds.
 Otherwise, the data are read from the disk in the usual way and stored in the cache.

19
Computer organization

20
Computer organization

Module 5

Virtual memory

 Techniques that automatically move program and data blocks into the physical main memory when
they are required for execution is called the Virtual Memory.
 The binary address that the processor issues either for instruction or data are called the virtual /
Logical address.

21
Computer organization

 The virtual address is translated into physical address by a combination of hardware and software
components. This kind of address translation is done by MMU(Memory Management Unit).
 When the desired data are in the main memory , these data are fetched /accessed immediately.
 If the data are not in the main memory, the MMU causes the Operating system to bring the data
into memory from the disk.

Virtual Memory Organisation

Address Translation:
 In address translation,all programs and data are composed of fixed length units called Pages.
 The Page consists of a block of words that occupy contiguous locations in the main memory.
 The pages are commonly range from 2K to 16K bytes in length.
 The cache bridge speed up the gap between main memory and secondary storage and it is
implemented in software techniques.
 Each virtual address generated by the processor contains virtual Page number(Low order bit)
and offset(High order bit)

 OffsetSpecifies the location of a particular byte (or word) within a page

22
Computer organization

.
Page Table:

It contains the information about the main memory address where the page is stored & the
current status of the page.
Page Frame:
 An area in the main memory that holds one page is called the page frame.

Page Table Base Register:


 It contains the starting address of the page table.

 Virtual Page Number+Page Table Base registerGives the address of the corresponding
entry in the page table.ie it gives the starting address of the page if that page currently resides
in memory.
Control Bits in Page Table:
 The Control bits specifies the status of the page while it is in main memory.

 The control bit indicates the validity of the page ie)it checks whether the page is actually
loaded in the main memory.
 It also indicates that whether the page has been modified during its residency in the
memory;this information is needed to determine whether the page should be written back to the
disk before it is removed from the main memory to make room for another page.

Fig:Virtual Memory Address Translation

 The Page table information is used by MMU for every read & write access.

23
Computer organization

 The Page table is placed in the main memory but a copy of the small portion of the page table is
located within MMU.
 This small portion or small cache is called Translation LookAside Buffer(TLB).
 This portion consists of the page table entries that corresponds to the most recently accessed
pages and also contains the virtual address of the entry.
 Given a virtual address ,the MMU looks in TLB for the referenced page.
 If the page table entry for this page is found in TLB,the physical address is obtained
immediately.
 If there is a miss in TLB ,then the required entry is obtained from the page table in the main
memory & TLB is updated.
 When a program generates an access request to a page that is not in the main memory ,then Page
Fault will occur.
 The whole page must be brought from disk into memory before an access can proceed.
 When it detects a page fault, the MMU asks the operating system to generate an interrupt.
 The operating System suspend the execution of the task that caused the page fault and begin
execution of another task whose pages are in main memory because the long delay occurs while
page transfer takes place.
 When the task resumes either the interrupted instruction must continue from the point of
interruption or the instruction must be restarted.
 If a new page is brought from the disk when the main memory is full,it must replace one of the
resident pages. In that case, it uses LRU algorithm which removes the least referenced Page.

A modified page has to be written back to the disk before it is removed from the main memory. In that
case,write –through protocol is used

Overlay
 overlaying means replacement of a block of stored instructions or data with another.
Overlaying is a programming method that allows programs to be larger than the
computer's main memory.

24
Computer organization

Transfer of data between disk and main memory is performed using DMA

25

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy