0% found this document useful (0 votes)
36 views

DATA STORAGE and Compression

Uploaded by

VengaiChiseva
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
36 views

DATA STORAGE and Compression

Uploaded by

VengaiChiseva
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 5

DATA STORAGE

 Data storage is measured in a variety of units, each representing a


different size of storage capacity. The smallest unit of measurement is
the bit, which represents a single binary digit (either 0 or 1)
 A nibble is a group of 4 bits, while a byte is a group of 8 bits
 Kibibyte (KiB), mebibyte (MiB), gibibyte (GiB), tebibyte (TiB), pebibyt
e (PiB), and exbibyte (EiB) are all larger units of measurement
 Specifically, 1 KiB is equal to 210 bytes, 1 MiB is equal to 220 bytes, 1 GiB is
equal to 230 bytes, 1 TiB is equal to 240 bytes, 1 PiB is equal to 250 bytes,
and 1 EiB is equal to 260 bytes

To calculate the file size of an image file:

 Determine the resolution of the image in pixels (width x height)


 Determine the colour depth in bits (e.g. 8 bits for 256 colours)
 Multiply the number of pixels by the colour depth to get the total number
of bits
 Divide the total number of bits by 8 to get the file size in bytes
 If necessary, convert to larger units like kibibytes, mebibytes, etc

Calculating image file size walkthrough:

An image measures 100 by 80 pixels and has 128 colours (so this must use 7
bits)

100 x 80 x 7 = 56000 bits ÷ 8 = 7000 bytes ÷ 1024 = 6.84 kibibytes

To calculate the file size of a sound file:

 Determine the sample rate in Hz (e.g. 44,100 Hz)


 Determine the sample resolution in bits (e.g. 16 bits)
 Determine the length of the track in seconds
 Multiply the sample rate by the sample resolution to get the number of
bits per second
 Multiply the number of bits per second by the length of the track to get
the total number of bits
 Divide the total number of bits by 8 to get the file size in bytes
 If necessary, convert to larger units like kibibytes, mebibytes, etc

Calculating sound file size walkthrough:

A sound clip uses 48KHz sample rate, 24 bit resolution and is 30 seconds long.

48000 x 24 = 1152000 bits per second x 30 = 34560000 bits for the whole clip

34560000 ÷ 8 = 4320000 bytes ÷ 1024 = 4218.75 kibibytes ÷ 1024 = 4.12


mebibytes
COMPRESSION

Compression is reducing the size of a file. This is done to reduce the amount of
storage space it takes up or to reduce the bandwidth when sending a file. There
are 2 types of compression:

 Lossless Compression:
o A compression algorithm is used to reduces the file
size without permanently removing any data
o Repeated patterns in the file are identified and indexed
o The data is replaced with the index and positions stored
o The number of times the pattern appears is also stored
o Techniques like run-length encoding (RLE) and Huffman
encoding are used
o RLE replaces sequences of repeated characters with a
code that represents the character and the number of times it
is repeated
o Huffman encoding replaces frequently used characters with shorter
codes and less frequently used characters with longer codes

 Lossy Compression:
o Lossy compression reduces the file size by permanently
removing some data from the file
o This method is often used for images and audio files where minor
details or data can be removed without significantly impacting
the quality
o Techniques like down sampling, reducing resolution or colour
depth, and reducing the sample rate or resolution are used for
lossy compression
o The amount of data removed depends on the level of compression
selected and can impact the quality of the final file

 Overall:
o Compression is necessary to reduce the size of large
files for storage, transmission, and faster processing
o The choice between lossy and lossless compression methods
depends on the type of file and its intended use
o Lossy compression is generally used for media files where minor
data loss is acceptable while lossless compression is used for text,
code, and archival purposes

Character Sets
 Text is a collection of characters that can be represented in binary, which
is the language that computers use to process information
 To represent text in binary, a computer uses a character set, which is
a collection of characters and the corresponding binary codes that
represent them
 One of the most commonly used character sets is the American Standard
Code for Information Interchange (ASCII), which assigns a unique 7-bit
binary code to each character, including uppercase and lowercase
letters, digits, punctuation marks, and control characters
 E.g. The ASCII code for the uppercase letter 'A' is 01000001, while the
code for the character '?' is 00111111
 ASCII has limitations in terms of the number of characters it can represent,
and it does not support characters from languages other than
English
 To address these limitations, Unicode was developed as a character
encoding standard that allows for a greater range of characters and
symbols than ASCII, including different languages and emojis
 Unicode uses a variable-length encoding scheme that assigns a
unique code to each character, which can be represented in binary form
using multiple bytes
 E.g. The Unicode code for the heart symbol is U+2665, which can be
represented in binary form as 11100110 10011000 10100101
 As Unicode requires more bits per character than ASCII, it can result
in larger file sizes and slower processing times when working with text-
based data

REPRESENTING SOUND

 Sound is a type of analog signal that is captured and converted into


digital form to be processed by a computer.
 To convert sound into digital form, a process called sampling is used.
This involves taking measurements of the sound wave at regular
intervals and converting these measurements into binary data
 The quality of the digital sound depends on the sample rate, which is
the number of samples taken per second. A higher sample
rate results in a more accurate representation of the original sound
wave, but also increases the file size of the digital sound
 E.g. A typical CD-quality digital sound has a sample rate of 44.1 kHz,
which means that 44,100 samples are taken per second
 The sample resolution is another factor that affects the quality of the
digital sound. This refers to the number of bits per sample, which
determines the level of detail and accuracy of each sample
 A higher sample resolution results in a more accurate
representation of the sound wave, but also increases the file size of
the digital sound
 E.g. A CD-quality digital sound typically has a sample resolution of 16
bits, which means that each sample is represented by a 16-bit binary
number
 It's important to choose the appropriate sample rate and resolution based
on the specific requirements of the digital sound application. E.g. A high-
quality music recording may require a higher sample rate and resolution
than a voice recording for a podcast

 Two examples of sound files are:


o MIDI
 Musical Instrument Digital Interface (file)
 Stores a set of instructions (for how the sound should be
played)
 It does not store the actual sounds
 Data in the file has been recorded using digital instruments
 Specifies the note to be played
 Specifies when each note plays and stops playing
 Specifies the duration of the note
 Specifies the volume of the note
 Specifies the tempo
 Specifies the type of instrument
 Individual notes can be edited

o MP3
 MP3 is a format for digital audio
 MP3 is an actual recording of the sound
 MP3 is a (lossy) compression format
 It is recorded using a microphone

REPRESENTING IMAGES

 A bitmap image is made up of a series of pixels, which are small dots of


colour that are arranged in a grid. Each pixel can be represented by
a binary code, which is processed by a computer
 The resolution of an image refers to the number of pixels in the
image. A higher resolution image has more pixels and is,
therefore, sharper and more detailed but also requires more storage
space
 The colour depth of an image refers to the number of bits used to
represent each colour. A higher colour depth means that more
colours can be represented, resulting in a more realistic image but
also requires more storage space
 E.g. an 8-bit colour depth allows for 256 different colours to be
represented (28=256), while a 24-bit colour depth allows for over 16
million different colours to be represented (224=16,777,216)
 The file size of an image increases as the resolution and colour
depth increase. This is because more pixels and colours require more
binary data to represent them
 The quality of an image also increases as the resolution and colour
depth increase. However, it's important to balance the desired quality
with the practical limitations of storage space

Worked example
An image has a resolution of 600 x 400 and a colour depth of 1 byte. Calculate
the file size of the image giving your answer in bytes. Show your working

[2]

 [2]

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy