0% found this document useful (0 votes)
29 views

Computer Vision - Human Visual System

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
29 views

Computer Vision - Human Visual System

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 81

Computer vision

Light and human visual system

Le Thanh Ha, Ph.D


Assoc. Prof. at University of Engineering and Technology,
Vietnam National University
ltha@vnu.edu.vn; lthavnu@gmail.com; 0983 692 592
Lecture content
• Light
• Human eye
• Perceiving objects and scenes
• Illutions

2/6/2023 Le Thanh Ha, Lab of HMI 2


LIGHT

2/6/2023 Le Thanh Ha, Lab of HMI 3


Light

c
= • : wavelength
f
• c: speed of light (2.998x108m/s)
• f: frequency

2/6/2023 Le Thanh Ha, Lab of HMI 4


Isaac Newton prism

• Sunlight consists of continuous spectrum of


colors ranging from violet to red

2/6/2023 Le Thanh Ha, Lab of HMI 5


Light and Electromagnetic Spectrum

2/6/2023 Le Thanh Ha, Lab of HMI 6


Light properties
• Light is a particular type of EM radiation:
– Can be presented by frequency or wavelength
– Natural white light (include all frequencies) comes from the sun (S. I.
Newton’s glass prism)
– When an object is radiated by while light:
• Object absorbs some ranges of light’s frequency and reflects the others
• Human eyes perceive the reflected light and assume it as the object’s color.
• Eg: green objects reflect ligth with wavelengths primarily in [500,570] nm range and
absorb most of the energy at other wavelengths.

Why we still perceive scene around even thought there is no sun?

2/6/2023 Le Thanh Ha, Lab of HMI 7


Light properties
• Light that is advoid of color is called monochromatic light (ML).
– Intensity of ML varies from black to gray, and white → gray level, or just
intensity.

• Color light:
– Luminance: amount of energy perceived.
– Chrominance:
• Frequencies of wavelight
• Hue
• Purity

2/6/2023 Le Thanh Ha, Lab of HMI 8


Light properties

• Frequency (hue, color)


• Luminance (sum of area under the freq. curve)
• Purity: (ED-EW)/ ED

2/6/2023 Lê Thanh Hà 9
Human Visual Perception
• Human perception encompasses both
– physiological aspects
– psychological aspects.
HUMAN EYE

11
Human Visual System

Exposure
Image formation Detection Processing
Control

•Cornea •Iris/pupil •Retina •Brain


•Lens •Photoreceptor •Rods
sensitivity •Cones
Human Eye
Ciliary Muscle
Sclera Ear side (Temporal)
Iris Vitreous Humor

Fovea
Pupil Eyelens
Retina

Optic Nerve
Cornea
Nose side (Nasal)
Aqueous Humor

Suspensory ligament Choroid

• Human eye is a complete imaging system.


Image Formation

Object Image

• The curved surfaces of the eye focus the image


onto the back surface of the eye.
Cornea
• The outer wall of the eye is formed by
Sclera the hard, white sclera.
Cornea • Cornea is the clear portion of the sclera.
• 2/3 of the refraction takes place at the
cornea.
Iris and Pupil
Iris
• Colored iris controls the size of the opening
(pupil) where the light enters.
Pupil
• Pupil determines the amount of light, like
the aperture of a camera.

Iris open Iris closed


Dilated pupil Constricted pupil
Lens
Ciliary muscle
• Eye lens is made of transparent fibers in a clear
membrane.
Lens

• Suspended by suspensory ligament.


Suspensory
Ligament
• Used as a fine focusing mechanism by the eye;
Transparent
provides 1/3 of eye’s total refracting power.
Fibers

• Non-uniform index of refraction.

Cross section of the eye lens


Accommodation
Distant object
• The suspensory ligaments attach the lens to the
ciliary muscle.

• When the muscle contracts, the lens bulges out


Relaxed muscle in the back, decreasing its focal length.
Taut ligaments
Near object
• The process by which the lens changes shape to
focus is called accommodation.

Contracted muscle
Slack ligaments
Aqueous Humor and Vitreous Humor
• Transparent gelatinous liquid filling the
Vitreous Humor
eye.

• Provides nutrients to the cornea and eye


lens.

Aqueous Humor • Also helps maintain the eyeball shape with


its pressure.
Retina
• Retina is the photosensitive “detector” for the eye.

Retina • Two types of receptors in the retina: rods for low light
level, and cones for color.

Fovea
• Located at the center of the retina, fovea contains a
greater concentration of cones.
Optic Nerve

• Signals from the receptors leave through the optic nerve


to the brain.
Rods and Cones
Synaptic endings
Cell nucleus
Inner segments

Outer segments

Rod Cone
• Highly sensitive to low light Sensitive to high light level
level or scotopic conditions. or photopic conditions.
• Black and white. Three types of cones
• Dispersed in the periphery of responsible for color vision.
the retina. Concentrated in the fovea.
Adaptation
• Why can’t you see immediately after you enter a
Photopic (cones)
movie theater from daylight?
Threshold of detection
(log scale)

Scotopic (rods) • The threshold of detection changes with overall


light level.

• The switch is quite gradual, until the sensitivities


0 5 10 15 20 25 30
of cones and rods cross over at about 7 minutes
in the dark.
Time in dark (minutes)
Distribution of Photoreceptors
Visual Axis
Temporal Nasal
• Cones are concentrated in the fovea.
80 º 80º
60 º 60 º
40 º 40 º • Rods predominate the periphery.
Number of receptors per mm2

20 º 20 º
0º Blind spot
160 • There is a blind spot where there are no
140 Rods
120 photoreceptors, at the point where the
100 nerves exit the eye (optic nerve).
80
60
40 Cones
20
60 º 40 º 20 º 0 º 20 º 40 º 60 º 80 º
Angle
Human Vision
– Human Cone Response to Color
• three cone types (S,I,L) correspond to B,G,R

S I L

Relative response

400 460 490 500 530 600 650 700

Wavelength (nm)

Blue Cyan Green Red


Retina
Light Cones

Rods

To optic nerve Bipolar cells

Amicrine cells

Ganglion cells

Horizontal
• The retina is made of network of nerve cells. cells
• The network works together to reduce the amount of information
in a process called lateral inhibition.
Eye Defects
Object at infinity
• Image focuses on the
retina for a normal eye.
Normal
• Distant objects look blurry
for a myopic (near sighted)
eye.
Myopic

• Near objects look blurry for


a hyperopic (far sighted)
Hyperopic
eye.

Eyes at relax state.


Myopia - Near sightedness
Far object • Distant objects look blurry
because the eye cannot
Myopic eye relaxed
Blurry
relax any farther so that
the image is focused
Near object before the retina.
Myopic eye relaxed
In focus
• Near object in focus
without accommodation.
Far object • Corrected with a negative
Myopia corrected
lens.
with a negative
lens

The virtual image from the diverging lens appears to be closer.


Hyperopia - Far sightedness
Far object
• Near objects look blurry
Hyperopic eye because the eye cannot
Partially accommodated accommodate enough
In focus for near objects.
Near object
• Far object in focus.
Hyperopic eye
Fully accommodated • Corrected with a positive
Blurry lens.
Near object

Hyperopia
corrected with a
positive lens

Light from the converging lens looks as though it is coming from the distance.
PERCEIVING OBJECTS AND SCENES
The Challenge of Object Perception

• Human perceptual system


• The stimulus on the receptors is ambiguous.
– Inverse projection problem: An image on the retina can be
caused by an infinite number of objects. How many round
objects are there? How large? How far away?
• Objects can be hidden or blurred.
– Occlusions are common in the environment.
Figure 5.3 The principle behind the inverse projection problem. The
page of the book that is near the eye creates a rectangular image on
the retina. However, this image could also have been created by the
larger more distant square, by the tilted trapezoid and by many other
stimuli. This is why we say that the image on the retina is ambiguous.
Viewpoint changes image

• Objects look different from different viewpoints


– Viewpoint invariance: the ability to recognize an object
regardless of the viewpoint
– This is a difficult task for computers to perform
The Structuralist Approach

• Approach established by Wundt (late 1800s)


– States that perceptions are created by combining elements
called sensations
Figure 5.9 According to structuralism, a number of sensations
(represented by the dots) adds up to create our perception of the face.
Birth of Gestalt Psychology

Structuralism could not explain


apparent movement produced
by a toy stroboscope.
Stimulated the founding of
Gestalt psychology in the
1920s by Wertheimer, Koffka,
and Kohler

Max Wertheimer
VL 5.2
Figure 5.10 (a) Wertheimer’s demonstration of movement perception.
The Gestalt Approach

• The whole differs from the sum of its parts.


– Perception is not built up from sensations, but is a result of
perceptual organization.
• Develop some principles of perceptual organization.
Figure 5.11 (a) This can be seen as a cube floating in front of eight
discs or as a cube seen through eight holes. In the first case, the
edges of the cube appear as illusory contours.
(b) The cube without the black circles.
Some Gestalt “Laws”

– Pragnanz – “good figure”, every stimulus is seen as simply as


possible
– Similarity - similar things are grouped together
Figure 5.13 (a) This is usually perceived as five circles, not as the
nine shapes in (b). An example of good figure.
Figure 5.14 (a) Perceived as horizontal rows or vertical columns or
both. (b) Perceived as vertical columns. An example of similarity.
More Gestalt Laws

• Good continuation - connected points resulting in straight or


smooth curves belong together
– Lines are seen as following the smoothest path
• Proximity - things that are near to each other are grouped
together
• Common region - elements in the same region tend to be grouped
together
Figure 5.17 Because of good continuation, we perceive
this pattern as continuous interwoven strands.
Principles of Perceptual Organization - continued

• Meaningfulness or familiarity - things form groups if they appear


familiar or meaningful
• Uniform connectedness - connected region of visual properties
are perceived as single unit
• Synchrony - elements occurring at the same time are seen as
belonging together
• Common fate - things moving in same direction are grouped
together
Proximity

Common Region

Connectedness

Synchrony

Figure 5.18 Grouping by (a) proximity; (b) common region; (c)


connectedness; and (d) synchrony. Synchrony occurs when yellow
lights blink on and off together.
Figure 5.20 The Forest Has Eyes by Bev Doolittle (1984).
Can you find the 12 faces in this picture?
Perceptual Segregation

• Figure-ground segregation - determining what part of environment


is the figure so that it “stands out” from the background
Figure 5.21 A version of Rubin’s reversible face-vase
figure.
Properties of figure and ground

• The figure is more “thinglike” and more memorable than


ground.
• The figure is seen in front of the ground.
• The ground is more uniform and extends behind figure.
• The contour separating figure from ground belongs to the
figure (border ownership).
Figure 5.22 (a) When the vase is perceived as figure, it is
seen in front of a homogeneous dark background.
(b) When the faces are seen as figure, they are seen in
front of a homogeneous light background.
Horizontal vs Vertical. Which is figure and which is
ground for each?
Figure 5.24 (a) Stimuli from Vecera et al. (2002).
(b) Percentage of trials on which lower or left areas were seen as figure.
Figure-Ground Segregation - continued

• Factors that determine which area is figure:


– Elements located in the lower part of displays
– Units that are symmetrical
– Elements that are small
– Units that are oriented vertically
– Elements that have meaning
PERCEIVING SCENES AND OBJECTS IN
SCENES AND FACES (FURTHER READING)

2/6/2023 Le Thanh Ha, Lab of HMI 54


Perceiving Scenes and Objects in Scenes

• A scene contains:
– background elements.
– objects organized in meaningful ways with each other and the
background.
• Difference between objects and scenes
– A scene is acted within
– An object is acted upon
Perceiving Scenes and Objects in Scenes - continued

• Research on perceiving gists of scenes


– Potter showed that people can do this when a picture is only
presented for 1/4 second
– Fei-Fei used masking to show that the overall gist is perceived
first followed by details.
– VL 5.17
Global image features of scences

• Global image features of scenes


– Degree of naturalness (forest vs street)
– Degree of openness (visible horizon)
– Degree of roughness (beach vs forest)
– Degree of expansion (or convergence)
– Color (blue for ocean, green for lawn)
• Such features are holistic and perceived rapidly
Regularities in the Environment

• Physical regularities - regularly occurring physical properties


– Oblique effect - people perceive horizontals and vertical more
easily than other orientations
– Uniform connectedness - objects are defined by areas of the
same color or texture
– Light-from-above heuristic - light in natural environment comes
from above us
Figure 5.37 (a) Some of these discs are perceived as jutting out and
some are perceived as indentations. Why? Light coming from above
would illuminate (b) the top of a shape that is jutting out and (c) the
bottom of the indentation.
Target object (barbell)

Figure 5.39 (a) One of the scenes shown to Hollingworth’s (2005)


observers. In this picture the target object is the barbell, although
observers do not know this when they are viewing the scene. “Non-
target” scenes are the same but do not include the target. Ignore circles.
Semantic regularities

--characteristics associated with the functions of scenes


– Study by Hollingworth (2005)
• Observers were presented with a scene either with or without
a target object.
• They then saw the target followed by a blank screen and
were asked where the object was in the scene or where they
would expect it to be.
• Both groups could accurately predict where the object would
be.
Figure 5.39 Circles indicate the observers’ judgments of the position
of the target object for trials in which they had seen the object in the
scene (small circle) and trials in which the object had not appeared in
the scene (larger circle).
Regularities in the Environment

• Palmer experiment
– Observers saw a context scene flashed briefly, followed by a
target picture.
– Results showed that:
• Targets congruent with the context were identified 80% of the
time .
• Targets that were incongruent were only identified 40% of the
time.
Scene presented first Which object?

Figure 5.40 Stimuli used in Palmer’s (1975) experiment. The scene at


the left is presented first, and the observer is then asked to identify
one of the objects on the right briefly flashed.
Role of Inference in Perception

• Theory of “unconscious” inference


– Created by Helmholtz (1866/1911) to explain why stimuli can
be interpreted in more than one way
– Main Principle - perceptions are result of “unconscious
assumptions” or inferences about the environment
– Likelihood principle - objects are perceived based on what is
most likely to have caused the pattern
Which if the best (most likely) display?

Figure 5.42 The display in (a) is usually interpreted as being (b) -- a


blue rectangle in front of a red rectangle. It could, however, be (c) -- a
blue rectangle and an appropriately positioned six-sided red figure.
Percieving Faces
Brain Modules Involved in Perceiving Faces

• Fusiform face area (FFA) - responds only to faces


• Amygdala (AG) - activated by emotional aspects of faces
• Superior temporal sulcus (STS) - responds to where the person is
looking and to mouth movements
• Frontal Cortex (FC) - activated when evaluating facial
attractiveness
Figure 5.45 The human brain, showing some of the areas in involved in perceiving faces: FC = frontal
cortex; STS = superior temporal sulcus; FG = fusiform gyrus; OC = occipital cortex; AG = amygdala. Note
that the labels indicate a general area of cortex, but not the overall extent of the area. Also, the amygdala is
located deep inside the cortex, approximately under the label shown here.
Perceiving an Object - Face or House?

• Experiment by Tong et al.


– Binocular rivalry used again with people
– Picture of a house shown to one eye and a face to another
– Participants pushed button to indicate perception.
– fMRI showed an increase in activity in
• Parahippocampal place area for the house
• Fusiform face area for the face
Observers in Tong and
coworkers (1998) experiment
viewed the overlapping red
house and green face
through red-green glasses,
so the house image was
presented to the right eye
and the face image to the left House→ PPA (right and left)
eye. Face-→ FFA (left hemi only)
When the observers
perceived the house, activity
occurred in the
parahippocampal place area
(PPA), in the left and right
hemispheres (red ellipses).
When the observers
perceived the face, activity
occurred in the fusiform face
area (FFA) in the left House→ PPA (right and left)
hemisphere (green ellipse). Face-→ FFA (left hemi only)
ILLUTION

2/6/2023 Le Thanh Ha, Lab of HMI 73


GIÁC QUAN + NIỀM TIN = THỰC TẠI

GIÁC QUAN + NIỀM TIN = ẢO GIÁC


Các dạng ảo ảnh thị giác

• Ảo ảnh liên quan đến tổ chức nhận thức

• Ảo ảnh độ sâu và nhận thức chuyển động

• Ảo ảnh mù độ cong

• Ảo ảnh nhận thức tương lai

• Ảo ảnh bất biến mầu sắc – độ sáng


Ảo ảnh liên quan đến tổ chức nhận thức
(Perceptual organization)

figure ground Rabbbit-duck


Kanizsa ‘s triangle
illusion illusion
Độ sâu và nhận thức chuyển động
(Depth and motion perception)

Vertical Horizontal
Ponzo illusion
illision
Ảo ảnh mù độ cong
(the illumination "Curvature Blindness" illusion)
Ảo ảnh nhận thức tương lai
(Future perception)

Hering illusion
Ảo ảnh bất biến mầu sắc – độ sáng
(color and brightness constancies)

Simultaneous
Contrast Illusion
Hermann Grid While’s Effect
2/6/2023 Le Thanh Ha, Lab of HMI 81

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy