0% found this document useful (0 votes)

10 views40 pages

C10 - Ai - Computer Vision

The document provides an overview of Computer Vision (CV) as a domain of Artificial Intelligence, detailing its objectives, applications, and fundamental concepts such as image representation, feature extraction, and object detection. It highlights various applications of CV, including facial recognition, self-driving cars, and medical imaging, while explaining the basics of images, pixels, and color representation. Additionally, it discusses tasks associated with CV, such as classification and segmentation, and emphasizes the importance of features in image processing.

Uploaded by

abbad2160

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views40 pages

C10 - Ai - Computer Vision

Uploaded by

abbad2160

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 40

INDIAN SCHOOLMUSCAT

CLASSX
ARTIFICIAL INTELLIGENCE
UNIT4
COMPUTERVISION (CV)
OBJECTIVES
• Introduction to Computer Vision
• Applications of CV
• Concepts of Computer Vision
• Understand the basic concepts of image representation,
• feature extraction,
• object detection, and segmentation.
• Understanding CV Concepts
• Computer Vision Tasks
• Basics of Images-Pixel, Resolution, Pixel value , Grayscale and RGB
images
Computer Vision
As humans we can see things, analyse it and then do the required
action on the basis of what we see.

But can machines do the same? Can machines have the eyes that
humans have? If you answered Yes, then you are absolutely right.
The Computer Vision domain of Artificial Intelligence, enables
machines to see through images or visual data, process and
analyse them on the basis of algorithms and methods in order to
analyse actual phenomena with images.
The concept of computer vision was first introduced in the 1970s.
Applications of Computer Vision
Facial Recognition: With the advent of smart cities and
smart homes, Computer Vision plays a vital role in making
the home smarter. Security being the most important
application involves use of Computer Vision for facial
recognition. It can be either guest recognition or log
maintenance of the visitors.
It also finds its application in schools for an attendance
system based on facial recognition of students.
Applications of Computer Vision
Face Filters: The modern-day apps like Instagram and
snapchat have a lot of features based on the usage of
computer vision. The application of face filters is one
among them. Through the camera the machine or the
algorithm is able to identify the facial dynamics of the
person and applies the facial filter selected.
Applications of Computer Vision
Google’s Search by Image: The maximum amount of
searching for data on Google’s search engine comes from
textual data, but at the same time it has an interesting
feature of getting search results through an image. This
uses Computer Vision as it compares different features of
the input image to the database of images and give us the
search result while at the same time analysing various
features of the image.
Applications of Computer Vision
Computer Vision in Retail: The retail field has been one
of the fastest growing field and at the same time is using
Computer Vision for making the user experience more
fruitful. Retailers can use Computer Vision techniques to
track customers’ movements through stores, analyse
navigational routes and detect walking patterns.
Inventory Management is another such application.
Through security camera image analysis, a Computer
Vision algorithm can generate a very accurate estimate of
the items available in the store. Also, it can analyse the use
of shelf space to identify suboptimal configurations and
suggest better item placement.
Applications of Computer Vision
Self-Driving Cars: Computer Vision is the fundamental
technology behind developing autonomous vehicles. Most
leading car manufacturers in the world are reaping the
benefits of investing in artificial intelligence for developing
on-road versions of hands-free technology.
This involves the process of identifying the objects, getting
navigational routes and also at the same time environment
monitoring.
Applications of Computer Vision
Medical Imaging: For the last decades, computer-
supported medical imaging application has been a
trustworthy help for physicians. It doesn’t only create and
analyse images, but also becomes an assistant and helps
doctors with their interpretation. The application is used
to read and convert 2D scan images into interactive 3D
models that enable medical professionals to gain a
detailed understanding of a patient’s health condition.
Applications of Computer Vision
Google Translate App: All you need to do to read signs in
a foreign language is to point your phone’s camera at the
words and let the Google Translate app tell you what it
means in your preferred language almost instantly. By
using optical character recognition to see the image and
augmented reality to overlay an accurate translation, this
is a convenient tool that uses Computer Vision.
How Computer Vision works ?

Computer Vision is a domain of Artificial Intelligence, that

deals with the images. It involves the concepts of image
processing and machine learning models to build a
Computer Vision based application.
Computer Vision Tasks

The various applications of Computer Vision are based on

a certain number of tasks which are performed to get
certain information from the input image which can be
directly used for prediction or forms the base for further
analysis.
Computer Vision Tasks
The tasks used in a computer vision application are :

For Single Objects For Multiple Objects

classification Object Detection

classification +
Instance
Localisation
Segementation
Single objects
Classification
Image Classification problem is the task of assigning an input image one
label from a fixed set of categories. This is one of the core problems inCV
that, despite its simplicity, has a large variety of practical applications.
Classification + Localisation
This is the task which involves both processes of identifying what object is
present in the image and at the same time identifying at what location
that object is present in that image. It is used only for single objects.
Multiple objects
Object Detection
Object detection is the process of finding instances of real-world objects such as faces,
bicycles, and buildings in images or videos. Object detection algorithms typically use
extracted features and learning algorithms to recognize instances of an object category.
It is commonly used in applications such as image retrieval and automated vehicle
parking systems.
Instance Segmentation
Instance Segmentation is the process of detecting instances of the objects, giving them a
category and then giving each pixel a label on the basis of that. A segmentation
algorithm takes an image as input and outputs a collection of regions (or segments).
Basics of Images
We all see a lot of images around us and use them daily either
through our mobile phones or computer system. But do we ask some
basic questions to ourselves while we use them on such a regular
basis.
Basics of Pixels
 The word “pixel” means a picture element. Every photograph,
in digital form, is made up of pixels.
 They are the smallest unit of information that make up a
picture.
 Usually round or square, they are typically arranged in a 2-
dimensional grid.
Pixels
In the image below, one portion has been magnified many times over so that you can
see its individual composition in pixels. As you can see, the pixels approximate the
actual image.
The more pixels you have, the more closely the image resembles the original.
Resolution

The number of pixels in an image is sometimes called the

resolution. When the term is used to describe pixel count, one
convention is to express resolution as the width by the height, for
example a monitor resolution of 1280×1024. This means there are
1280 pixels from one side to the other, and 1024 from top to
bottom.
Another convention is to express the number of pixels as a
single number, like a 5 mega pixel camera (a megapixel is a
million pixels). This means the pixels along the width multiplied
by the pixels along the height of the image taken by the camera
equals 5 million pixels. In the case of our 1280×1024 monitors, it
could also be expressed as 1280 x 1024 = 1,310,720, or 1.31
megapixels.
Pixel value
 Each of the pixels that represents an image stored inside a
computer has a pixel value which describes how bright that
pixel is, and/or what colour it should be.
 The most common pixel format is the byte image, where this
number is stored as an 8-bit integer giving a range of possible
values from 0 to 255.
 Typically, zero is to be taken as no colour or black and 255 is
taken to be full colour or white.
Why do we have a value of 255 ?
In the computer systems, computer
data is in the form of ones and
zeros, which we call the binary
Number of No. of
system. Each bit in a computer bits Different Patterns patterns
system can have either a zero or a 1 0, 1 2^1 =2
one.(one byte=8 bits) 2 00, 01 , 10 , 11 2^2=4
Since each pixel uses 1 byte of an 000 ,001, 010,011,
image, which is equivalent to 8 bits 3 100 ,101, 110, 111 2^3=8
of data. Since each bit can have two .
possible values which tells us that .
the 8 bit can have 255 possibilities 8 2^8=256
of values which starts from 0 and
ends at 255.
Grayscale Images
 Grayscale images are images which have a range of shades of gray
without apparent colour. The darkest possible shade is black, which
is the total absence of colour or zero value of pixel.
 The lightest possible shade is white, which is the total presence of
colour or 255 value of a pixel .
 Intermediate shades of gray are represented by equal brightness levels
of the three primary colours.
 A grayscale has each pixel of size 1 byte having a single plane of 2d
array of pixels. The size of a grayscale image is defined as the Height x
Width of that image.
Grayscale Images

Here is an example of a grayscale image. as you check, the value of pixels are within the range of 0-255.The
computers store the images we see in the form of these numbers.
RGB Images
All the images that we see around are coloured images. These
images are made up of three primary colours Red, Green and Blue.
All the colours that are present can be made by combining different
intensities of red, green and blue.
How do computers store RGB images?
Every RGB image is stored in the form of three different channels
called the R channel, G channel and the B channel.
Each plane separately has a number of pixels with each pixel value
varying from 0 to 255. All the three planes when combined
together form a colour image. This means that in a RGB image,
each pixel has a set of three different values which together give
colour to that particular pixel.
How do computers store RGB images?
As you can see, each colour
image is stored in the form of
three different channels, each
having different intensity. All
three channels combine
together to form a colour we
see.
How do computers store RGB images?
In the above given image, if we split the image into three
different channels, namely Red (R), Green (G) and Blue (B), the
individual layers will have the following intensity of colours of
the individual pixels.
These individual layers when stored in the memory looks like the
image on the extreme right.
How do computers store RGB images?
The images look in the grayscale image because each pixel has a
value intensity of 0 to 255 and as studied earlier, 0 is considered
as black or no presence of colour and 255 means white or full
presence of colour. These three individual RGB values when
combined together form the colour of each pixel.
Therefore, each pixel in the RGB image has three values to form
the complete colour.
Image Features
In computer vision and image processing, a feature is a piece of
information which is relevant for solving the computational
task related to a certain application. Features may be specific
structures in the image such as points, edges or objects.
Good Features

In this image how would we determine the exact

location of each patch?
The blue patch is a flat area and difficult to find
and track. Wherever you move the blue patch it
looks the same. The black patch has an edge.
Moved along the edge (parallel to edge), it looks
the same. The red patch is a corner. Wherever
you move the patch, it looks different, therefore
it is unique. Hence, corners are considered to be
good features in an image.
Image Features
Conclusion
In image processing, we can get a lot of features from the image. It can be
either a blob(binary long objects), an edge or a corner. These features help
us to perform various tasks and then get the analysis done on the basis of
the application. Now the question that arises is which of the following are
good features to be used?
As you saw in the previous activity, the features having the corners are easy
to find as they can be found only at a particular location in the image,
whereas the edges which are spread over a line or an edge look the same
all along. This tells us that the corners are always good features to extract
from an image followed by the edges.
ACTIVITY 1:
:
Go to this online link
https://www.w3schools.com/colors/colors_rgb.asp. On
the basis of this online tool, try and answer all the
below mentioned questions.
1) What is the output colour when you put R=G=B=255 ?

2) What is the output colour when you put R=G=B=0 ?

3) How does the colour vary when you put either of the three as 0 and then keep on varying the other two?

4) How does the output colour change when all the three colours are varied in same proportion ?

5) What is the RGB value of your favourite colour from the colour palette?
Image Features Activity-2
Imagine that your security camera is capturing an image.
At the top of the image we are given six small patches of
images. Our task is to find the exact location of those
image patches in the image.
Take a pencil and mark the exact location of those
patches in the image.
Image Features
Were you able to find the exact location of all the patches?

Which one was the most difficult to find?

Which one was the easiest to find?

Image Features
Let us take individual patches into account at once and then
check the exact location of those patches.
For Patch A and B: The patch A and B are flat surfaces in the
image and are spread over a lot of area. They can be present at
any location in a given area in the image.
For Patch C and D: The patches C and D are simpler as compared
to A and B. They are edges of a building and we can find an
approximate location of these patches but finding the exact
location is still difficult. This is because the pattern is the same
everywhere along the edge.
For Patch E and F: The patches E and F are the easiest to find in
the image. The reason being that E and F are some corners of the
building. This is because at the corners, wherever we move this
patch it will look different. So corners are good image features.
Activities:
● Game- Emoji Scavenger Hunt
https://emojiscavengerhunt.withgoogle.com/
● RGB Calculator:
https://www.w3schools.com/colors/colors_rgb.asp
● Create your own pixel art:
www.piskelapp.com
● Create your own convolutions:
http://setosa.io/ev/image-kernels/
Thank you

Computer Vision Class 10 Notes
100% (5)
Computer Vision Class 10 Notes
7 pages
Class 10 AI 417 Computer Vision
No ratings yet
Class 10 AI 417 Computer Vision
22 pages
Computer Vision Class X
No ratings yet
Computer Vision Class X
17 pages
Ip CV Summary Finaaaal-1
No ratings yet
Ip CV Summary Finaaaal-1
178 pages
COMPUTER VISION Notes
No ratings yet
COMPUTER VISION Notes
3 pages
Computer Vision
No ratings yet
Computer Vision
19 pages
Classical Computer Vision - Session 1
No ratings yet
Classical Computer Vision - Session 1
130 pages
Chapter-4 Computer Vision Study Material
No ratings yet
Chapter-4 Computer Vision Study Material
4 pages
CH 3
No ratings yet
CH 3
22 pages
Computer Vision
No ratings yet
Computer Vision
17 pages
CV - Unit 1
No ratings yet
CV - Unit 1
14 pages
Wa0000.
No ratings yet
Wa0000.
18 pages
CH 05 Computer Vision 1
No ratings yet
CH 05 Computer Vision 1
27 pages
Unit-5 Computer Vision (Ai)
No ratings yet
Unit-5 Computer Vision (Ai)
14 pages
6960795-Class10 Ai Partb Unit5 Computervision
No ratings yet
6960795-Class10 Ai Partb Unit5 Computervision
17 pages
Chunk 2
No ratings yet
Chunk 2
31 pages
Computer Vision
No ratings yet
Computer Vision
36 pages
Computer Vision
No ratings yet
Computer Vision
29 pages
Computer Vision
No ratings yet
Computer Vision
12 pages
Computer Vision
No ratings yet
Computer Vision
13 pages
Computer Vision
No ratings yet
Computer Vision
15 pages
Computer Vision Notes
No ratings yet
Computer Vision Notes
4 pages
Computer Vision XTH
No ratings yet
Computer Vision XTH
9 pages
Introduction To Computer Vision
No ratings yet
Introduction To Computer Vision
8 pages
Computer Vision: Facial Recognition
No ratings yet
Computer Vision: Facial Recognition
9 pages
Ai CV Notes
No ratings yet
Ai CV Notes
6 pages
2023 - 12 - 06 7 - 57 PM Office Lens
No ratings yet
2023 - 12 - 06 7 - 57 PM Office Lens
11 pages
Screenshot 2023-10-23 at 5.51.17 AM
No ratings yet
Screenshot 2023-10-23 at 5.51.17 AM
14 pages
Computer Vision
No ratings yet
Computer Vision
21 pages
Ch-Computer Vision
No ratings yet
Ch-Computer Vision
6 pages
Computer Vision-Notes X
No ratings yet
Computer Vision-Notes X
2 pages
Introduction To Computer Vision: Domain of AI
No ratings yet
Introduction To Computer Vision: Domain of AI
4 pages
AI 10th Grade Pdfs
No ratings yet
AI 10th Grade Pdfs
30 pages
Class X Artificial Intelligence: Computer Vision
No ratings yet
Class X Artificial Intelligence: Computer Vision
54 pages
Ai
No ratings yet
Ai
14 pages
Computer Vision Notes
No ratings yet
Computer Vision Notes
4 pages
PartA Unit5 Ass01
No ratings yet
PartA Unit5 Ass01
3 pages
HW 675075 1compu
No ratings yet
HW 675075 1compu
3 pages
Machine - Learning (Computer Vision)
No ratings yet
Machine - Learning (Computer Vision)
56 pages
Unit 1 Introduction
No ratings yet
Unit 1 Introduction
25 pages
Q Ans ComputerVision Ass01
No ratings yet
Q Ans ComputerVision Ass01
2 pages
Computer Vision and Image Processing (Updated)
No ratings yet
Computer Vision and Image Processing (Updated)
165 pages
52 BDB
No ratings yet
52 BDB
3 pages
Computer Vision Class 10 AI Notes CBSE
No ratings yet
Computer Vision Class 10 AI Notes CBSE
8 pages
Computer Vision Class X
No ratings yet
Computer Vision Class X
39 pages
CV (Unit1&2ans)
No ratings yet
CV (Unit1&2ans)
32 pages
PDF Computer Vision
No ratings yet
PDF Computer Vision
3 pages
Computer Vision
No ratings yet
Computer Vision
23 pages
Question Bank 9
No ratings yet
Question Bank 9
6 pages
ASSIGNMENT 5 - X - AI Handout Computer Vision1
No ratings yet
ASSIGNMENT 5 - X - AI Handout Computer Vision1
3 pages
AI-Computer Vision
No ratings yet
AI-Computer Vision
16 pages
Computer Vision and Data Science Notes
No ratings yet
Computer Vision and Data Science Notes
11 pages
Multimedia and Computer Vision Unit 5
No ratings yet
Multimedia and Computer Vision Unit 5
25 pages
Computer Vision
No ratings yet
Computer Vision
3 pages
Unit-5 Computer Vision
No ratings yet
Unit-5 Computer Vision
3 pages
Computer Vision
No ratings yet
Computer Vision
13 pages
Computer Vision
No ratings yet
Computer Vision
4 pages
G7 How To 2009
No ratings yet
G7 How To 2009
68 pages
Computer Vision Class 10 Notes
No ratings yet
Computer Vision Class 10 Notes
5 pages
Pdf&rendition 1
No ratings yet
Pdf&rendition 1
2 pages
Digital Image Processing - 21ec722
No ratings yet
Digital Image Processing - 21ec722
67 pages
ISMMUN 2024 ROP PPT - PPTX - 20240620 - 204330 - 0000
No ratings yet
ISMMUN 2024 ROP PPT - PPTX - 20240620 - 204330 - 0000
31 pages
C10 - Ai - Unit 3 - NLP - Half Yearly
No ratings yet
C10 - Ai - Unit 3 - NLP - Half Yearly
37 pages
Ganesh College of Engineering, Salem: Ec8093-Digital Image Processing Notes
No ratings yet
Ganesh College of Engineering, Salem: Ec8093-Digital Image Processing Notes
239 pages
DVT 510MR DVT Cognex Vision New in Stock!: Email
No ratings yet
DVT 510MR DVT Cognex Vision New in Stock!: Email
158 pages
Fabric Defect Detection Using Deep Learning: Comsats Universityb of Information Technology Islamabad (Sahiwal Campus)
No ratings yet
Fabric Defect Detection Using Deep Learning: Comsats Universityb of Information Technology Islamabad (Sahiwal Campus)
118 pages
Computer Graphics Multimedia Notes
No ratings yet
Computer Graphics Multimedia Notes
115 pages
Data Science 2024
No ratings yet
Data Science 2024
9 pages
Histogram
No ratings yet
Histogram
10 pages
2.A - Some Basic Relationships Between Pixels Draft
No ratings yet
2.A - Some Basic Relationships Between Pixels Draft
32 pages
ChauffeurNet: Learning To Drive. Alex Krizhevsky
No ratings yet
ChauffeurNet: Learning To Drive. Alex Krizhevsky
10 pages
Hidden Camera Detection: Vaishali Koul, Rakshita Macheri, Ribhuvats, Liya Baby, Poonam Bari
100% (1)
Hidden Camera Detection: Vaishali Koul, Rakshita Macheri, Ribhuvats, Liya Baby, Poonam Bari
5 pages
Ai ch5 Computer Vision
No ratings yet
Ai ch5 Computer Vision
10 pages
24-06-24 Ready For Mobilization Mobilization
No ratings yet
24-06-24 Ready For Mobilization Mobilization
2 pages
Evaluation Methods - Scenario Question (2024-25)
No ratings yet
Evaluation Methods - Scenario Question (2024-25)
13 pages
Assignment 3. Enhancement
No ratings yet
Assignment 3. Enhancement
6 pages
SM PKG X Ray MSR - 1115 508 - 0
No ratings yet
SM PKG X Ray MSR - 1115 508 - 0
30 pages
1.digital Image Processing Basics
No ratings yet
1.digital Image Processing Basics
85 pages
2 5427330694831422442
No ratings yet
2 5427330694831422442
8 pages
1.9 Color and Gray Scale - Attributes of Output
100% (1)
1.9 Color and Gray Scale - Attributes of Output
26 pages
A Survey On Edge Detection Methods PDF
No ratings yet
A Survey On Edge Detection Methods PDF
36 pages
Java (TM) Advanced Imaging API
No ratings yet
Java (TM) Advanced Imaging API
21 pages
Value Scale Printable Worksheet
No ratings yet
Value Scale Printable Worksheet
5 pages
Ec 1009 - Digital Image Processing
75% (4)
Ec 1009 - Digital Image Processing
30 pages
EE 583 Lecture08
No ratings yet
EE 583 Lecture08
28 pages
Image Acquisition: What Is An Image ?
No ratings yet
Image Acquisition: What Is An Image ?
10 pages
Question 1: (30 Marks) : Try Exam
No ratings yet
Question 1: (30 Marks) : Try Exam
3 pages
Underwater Image Restoration Using Fusion and Wavelet Transform Strategy
No ratings yet
Underwater Image Restoration Using Fusion and Wavelet Transform Strategy
8 pages
Steam Gen: Detection and Classification of Discontinuities Using Discrete Wavelet Transform and MFL Testing
No ratings yet
Steam Gen: Detection and Classification of Discontinuities Using Discrete Wavelet Transform and MFL Testing
10 pages
Visual Identity Guidelines August 2021
No ratings yet
Visual Identity Guidelines August 2021
79 pages
Paper Thermofluid
No ratings yet
Paper Thermofluid
16 pages
Curriculum Vitae: Dr. Anastasios L. Kesidis
No ratings yet
Curriculum Vitae: Dr. Anastasios L. Kesidis
6 pages
Digital Image Processing Question Bank
100% (3)
Digital Image Processing Question Bank
29 pages
VL2019205005315 Da
No ratings yet
VL2019205005315 Da
9 pages
Content Based Image Retrieval: Unlocking Visual Databases
From Everand
Content Based Image Retrieval: Unlocking Visual Databases
Fouad Sabry
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

C10 - Ai - Computer Vision

Uploaded by

C10 - Ai - Computer Vision

Uploaded by

INDIAN SCHOOLMUSCAT

Computer Vision is a domain of Artificial Intelligence, that

The various applications of Computer Vision are based on

For Single Objects For Multiple Objects

The number of pixels in an image is sometimes called the

In this image how would we determine the exact

2) What is the output colour when you put R=G=B=0 ?

Which one was the most difficult to find?

Which one was the easiest to find?

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.