0% found this document useful (0 votes)

42 views21 pages

DL Unit 4

The document discusses autoencoders, a type of neural network used for unsupervised learning and dimensionality reduction, detailing their purpose of learning efficient data representations. It outlines various types of autoencoders, including denoising, sparse, contractive, and variational autoencoders, each with unique advantages and drawbacks. The document emphasizes the importance of preventing the network from learning the identity function and highlights the training processes involved in these architectures.

Uploaded by

vishnupriyavp2606

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views21 pages

DL Unit 4

Uploaded by

vishnupriyavp2606

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 21

23CS2902 DEEP LEARNING

UNIT -4 MORE DEEP LEARNING ARCHITECTURES

AUTO ENCODER:

 An autoencoder is a type of artificial neural network used to learn efficient

data codings in an unsupervised manner.

 The goal of an autoencoder is to learn a representation for a set of data,

usually for dimensionality reduction by training the network to ignore signal
noise.
 Along with the reduction side, a reconstructing side is also learned, where
the autoencoder tries to generate from the reduced encoding a representation
as close as possible to its original input.
 This helps autoencoders to learn important features present in the data.
 When a representation allows a good reconstruction of its input then it has
retained much of the information present in the input.

TYPES OF AUTO ENCODERS:

There are, basically, 7 types of autoencoders:

 Denoisingautoencoder
 Sparse Autoencoder
 Deep Autoencoder
 Contractive Autoencoder
 UndercompleteAutoencoder
 Convolutional Autoencoder
 VariationalAutoencoder

DENOISING AUTOENCODER:

 A DenoisingAutoencoder is a modification on the autoencoder to prevent

the network learning the identity function.

 Autoencoders are Neural Networks which are commonly used for feature
selection and extraction. However, when there are more nodes in the
hidden layer than there are inputs, the Network is risking to learn the so-
called
“Identity Function”, also called “Null Function”, meaning that the
output equals the input, marking the Autoencoder useless.
 DenoisingAutoencoders solve this problem by corrupting the data on
purpose by randomly turning some of the input values to zero. In general,
the percentage of input nodes which are being set to zero is about 50%.
Other sources suggest a lower count, such as 30%. It depends on the amount
of data and input nodes you have.
 Specifically, if the autoencoder is too big, then it can just learn the data, so
the output equals the input, and does not perform any useful representation
learning or dimensionality reduction.
 When calculating the Loss function, it is important to compare the output
values with the original input, not with the corrupted input. That way, the
risk of learning the identity function instead of extracting features is
eliminated.

Advantages-

 It was introduced to achieve good representation. Such a representation is

one that can be obtained robustly from a corrupted input and that will be
useful for recovering the corresponding clean input.
 Corruption of the input can be done randomly by making some of the input
as zero. Remaining nodes copy the input to the noised input.
 Minimizes the loss function between the output node and the corrupted
input.
 Setting up a single-thread denoisingautoencoder is easy.
 Drawbacks-

 To train an autoencoder to denoise data, it is necessary to perform

preliminary stochastic mapping in order to corrupt the data and use as input.
 This model isn't able to develop a mapping which memorizes the training
data because our input and target output are no longer the same.


2) Sparse Autoencoder
 Sparse autoencoders have hidden nodes greater than input nodes.
 They can still discover important features from the data.
 A generic sparse autoencoder is visualized where the obscurity of a node
corresponds with the level of activation.
 Sparsity constraint is introduced on the hidden layer. This is to prevent
output layer copy input data.
 Sparsity may be obtained by additional terms in the loss function during the
training process, either by comparing the probability distribution of the
hidden unit activations with some low desired value,or by manually zeroing
all but the strongest hidden unit activations.
 Some of the most powerful AIs in the 2010s involved sparse autoencoders
stacked inside of deep neural networks.
The structure of an SAE and what makes it different from an Undercomplete
AE

The purpose of an autoencoder is to encode important information efficiently. A

common approach to achieve that is by creating a bottleneck, which forces the
model to preserve what’s essential and discard unimportant bits.

Autoencoder can distinguish between what’s essential by simultaneously

training an encoder and decoder, with the goal of the decoder being the
recreation of original data from encoded representation.

The diagram below provides an example of an UndercompleteAutoencoder Neural

Network with the bottleneck in the middle.

UndercompleteAutoencoder architecture. Image by author, created

using AlexNail’s NN-SVG tool.
Meanwhile, the goal of an SAE is the same as an Undercomplete AE, but it
achieves it differently. Instead of (or in addition to) relying on fewer neurons, SAE
uses regularisation to enforce sparsity.

By sparsity, we mean that fewer neurons can be activated at the same time,
creating an information bottleneck similar to that of Unercomplete AE. See the
below illustration.

Advantages-

 Sparse autoencoders have a sparsity penalty, a value close to zero but not
exactly zero. Sparsity penalty is applied on the hidden layer in addition to
the reconstruction error. This prevents overfitting.
 They take the highest activation values in the hidden layer and zero out the
rest of the hidden nodes. This prevents autoencoders to use all of the hidden
nodes at a time and forcing only a reduced number of hidden nodes to be
used.
Drawbacks-

 For it to be working, it's essential that the individual nodes of a trained

model which activate are data dependent, and that different inputs will result
in activations of different nodes through the network.

Contractive Autoencoder
 The objective of a contractive autoencoder is to have a robust learned
representation which is less sensitive to small variation in the data.
 Robustness of the representation for the data is done by applying a penalty
term to the loss function.
 Contractive autoencoder is another regularization technique just like sparse
and denoisingautoencoders.
 However, this regularizer corresponds to the Frobenius norm of the Jacobian
matrix of the encoder activations with respect to the input.
 Frobenius norm of the Jacobian matrix for the hidden layer is calculated with
respect to input and it is basically the sum of square of all elements.
https://www.geeksforgeeks.org/contractive-autoencoder-cae/

Advantages-

 Contractive autoencoder is a better choice than denoisingautoencoder to learn

useful feature extraction.
 This model learns an encoding in which similar inputs have similar encodings.
Hence, we're forcing the model to learn how to contract a neighborhood of inputs
into a smaller neighborhood of outputs.
VariationalAutoencoder:

Variationalautoencoder is different from autoencoder in a way such

that it provides a statistic manner for describing the samples of the
dataset in latent space. Therefore, in variationalautoencoder, the
encoder outputs a probability distribution in the bottleneck layer
instead of a single output value.
Advantages-

 It gives significant control over how we want to model our latent distribution
unlike the other models.
 After training you can just sample from the distribution followed by
decoding and generating new data.
Drawbacks-
 When training the model, there is a need to calculate the relationship of each
parameter in the network with respect to the final output loss using a
technique known as backpropagation. Hence, the sampling process requires
some extra attention.
Deep Learning Srihari

5. Denoising Autoencoders
• An autoencoder that receives a corrupted data point
as input and is trained to predict the original,
uncorrupted data point as its output
• Traditional autoencoders minimize L(x, g ( f (x)))
• where L is a loss function penalizing g( f (x)) for being
dissimilar from
x, such as L2 norm of difference: mean squared error
• A DAE
L(x, g(f (x! )))
minimizes
• wher x! is a copy of x that is corrupted by some form of
e noise
• The autoencoder must undo this corruption rather
than simply copying their input
3
Deep Learning Srihari

Example of Noise in a DAE

• An autoencoder with high capacity can end up
learning an identity function (also called null
function) where input=output
• A DAE can solve this problem by corrupting the data
input
• How much noise to add?
• Corrupt input nodes by setting 30-50% of random input nodes to
zero

Original input, corrupted data, reconstructed

data
4
Deep Learning Srihari

DAE Training procedure

• Computational graph of cost function below
• DAE trained to reconstruct clean data point x from the
corrupted Accomplished by minimizing loss L=-log
pencoder(x|h=f(x))

Corruption process, C( x! |x) is a conditional

distribution over corrupted x! given the
samples
data sample x

The autoencoder learns a reconstruction

distribution preconstruct(x| x!) estimated from
training pairs (x,x! ) as follows:
1 Sample a training sample x from the training data
2. Sample a corrupted version x! from C(x! |x)
3. Use (x,x! ) as a training example for
estimating the autoencoder distribution
precoconstruct(x| x!) =pdecoder(x|h) with h the output of
encoder f (x! ) and pdecoder typically defined by a
decoder g(h)
• DAE performs SGD on the expectation Ex! ~p^data(x) log p decoder (x|h=fx(! ))
Deep Learning Srihari

Estimating the Score

• An autoencoder can be based on encouraging the
model to have the same score as the data
distribution at every training point x
• The score is a particular gradient ∇x log p(x)
field is:
• Learning the gradient field of log pdata is one way to learn the
structure of
pdata itself
• Score Matching works by fitting the slope (score) of
the model density to the slope of the true underlying
density at the data points
• DAE, with conditionally Gaussian p(x|h), estimates
this score as (g(f(x)-x)
• The DAE is trained to minimize ||g(f( x!)-x)||2
• DAE estimates a vector fields as illustrated next

7
Deep Srihari
Learning DAE learns a vector
field

• Training examples x lie on a low-dimensional manifold

• Training examples x are red crosses
• Gray circle is equiprobable corruptions
• The vector field (g(f(x)-x), indicated by green arrows,
estimates
the ∇x log which is the slope of the density of
score p(x) data
8
Deep Learning Srihari

Vector field learnt by a DAE

• 1-D curved manifold near which the data concentrate

• Each arrow proportional to reconstruction minus input
vector of DAE and points towards higher probability
• Where probability is maximum arrows shrink

Jntuk r20 Unit-V Deep Learning Techniques (WWW - Jntumaterials.co - In)
No ratings yet
Jntuk r20 Unit-V Deep Learning Techniques (WWW - Jntumaterials.co - In)
61 pages
Unit 5
No ratings yet
Unit 5
23 pages
Vae - Gan 1
No ratings yet
Vae - Gan 1
136 pages
Ch3 Auto Encoder
No ratings yet
Ch3 Auto Encoder
40 pages
DeepLearning 4 and 5
No ratings yet
DeepLearning 4 and 5
60 pages
Autoencoder - Unit 4
No ratings yet
Autoencoder - Unit 4
39 pages
Unit VI
No ratings yet
Unit VI
46 pages
Deep Learning Module-2 & 4
No ratings yet
Deep Learning Module-2 & 4
48 pages
Lecture 23b Auto Encoder
No ratings yet
Lecture 23b Auto Encoder
27 pages
Autoencoders
No ratings yet
Autoencoders
35 pages
Auto Encoder
No ratings yet
Auto Encoder
73 pages
DeepLearning Unit IV Notes
No ratings yet
DeepLearning Unit IV Notes
58 pages
Unit V
No ratings yet
Unit V
32 pages
Unit-V DL
No ratings yet
Unit-V DL
31 pages
Brief Introduction On Current Research Areas - Autoencoders
No ratings yet
Brief Introduction On Current Research Areas - Autoencoders
20 pages
DL Unit - 4
No ratings yet
DL Unit - 4
26 pages
Unit 5e - Autoencoders
No ratings yet
Unit 5e - Autoencoders
32 pages
Unit-5 Auto Encoders in Deep Learning
No ratings yet
Unit-5 Auto Encoders in Deep Learning
23 pages
DUnit IV
No ratings yet
DUnit IV
22 pages
Autoencoders U
No ratings yet
Autoencoders U
44 pages
MODULE 5 Auto-Encoders and Generative Models
No ratings yet
MODULE 5 Auto-Encoders and Generative Models
25 pages
L23 Autoencoders
No ratings yet
L23 Autoencoders
16 pages
Autoencoder 2
No ratings yet
Autoencoder 2
16 pages
Autoencoders in Machine Learning
No ratings yet
Autoencoders in Machine Learning
7 pages
Autoencoders
No ratings yet
Autoencoders
20 pages
Auto Encoder
No ratings yet
Auto Encoder
39 pages
DL Class5
No ratings yet
DL Class5
23 pages
Denoising Autoencoders
No ratings yet
Denoising Autoencoders
13 pages
DL M3 Tech
No ratings yet
DL M3 Tech
15 pages
DLA Unit 5
No ratings yet
DLA Unit 5
18 pages
Autoencoders
No ratings yet
Autoencoders
12 pages
Unit 4
No ratings yet
Unit 4
10 pages
Neural Network Unsupervised Machine Learning: What Are Autoencoders?
No ratings yet
Neural Network Unsupervised Machine Learning: What Are Autoencoders?
22 pages
Lec16 - Autoencoders
No ratings yet
Lec16 - Autoencoders
18 pages
Vae Gan
No ratings yet
Vae Gan
214 pages
Unit II
No ratings yet
Unit II
35 pages
Unsupervised Deep Learning-Unit 4
No ratings yet
Unsupervised Deep Learning-Unit 4
26 pages
M2 - Autoencoders
No ratings yet
M2 - Autoencoders
25 pages
21458.basic and Advanced Regulatory Control System Design and Application PDF
100% (7)
21458.basic and Advanced Regulatory Control System Design and Application PDF
390 pages
Autoencoder
No ratings yet
Autoencoder
4 pages
Autoencoders
No ratings yet
Autoencoders
4 pages
Study Materials - Denoising Autoencoders
No ratings yet
Study Materials - Denoising Autoencoders
7 pages
Auto Encoders
No ratings yet
Auto Encoders
4 pages
Neural Network Unsupervised Machine Learning: What Are Autoencoders?
No ratings yet
Neural Network Unsupervised Machine Learning: What Are Autoencoders?
22 pages
Unit 3
No ratings yet
Unit 3
23 pages
Ad3501-Dl-Unit 5 Notes
No ratings yet
Ad3501-Dl-Unit 5 Notes
16 pages
AAI Module 3
No ratings yet
AAI Module 3
11 pages
Experiment 4
No ratings yet
Experiment 4
26 pages
Module 4
No ratings yet
Module 4
10 pages
7& 9 Autoencoder and Variational Autoencoder
No ratings yet
7& 9 Autoencoder and Variational Autoencoder
13 pages
Unit5 Autoencoders
No ratings yet
Unit5 Autoencoders
45 pages
D5 PPT
No ratings yet
D5 PPT
79 pages
Autoencoder
No ratings yet
Autoencoder
39 pages
DL Unit 5
No ratings yet
DL Unit 5
19 pages
1 Autoencoders
No ratings yet
1 Autoencoders
22 pages
UNIT-5 Part1
No ratings yet
UNIT-5 Part1
15 pages
ch14 Autoencoder
No ratings yet
ch14 Autoencoder
42 pages
Introduction To Autoencoders: A Brief Overview
No ratings yet
Introduction To Autoencoders: A Brief Overview
27 pages
Lecture 14 Autoencoders
No ratings yet
Lecture 14 Autoencoders
39 pages
CCNP and CCIE Enterprise Core ENCOR 350-401 Exam Cram Indice
No ratings yet
CCNP and CCIE Enterprise Core ENCOR 350-401 Exam Cram Indice
10 pages
Deep Learning: Prof:Naveen Ghorpade
No ratings yet
Deep Learning: Prof:Naveen Ghorpade
43 pages
SYS-3010 Syringe Pump Operation Manual - V1.1
50% (2)
SYS-3010 Syringe Pump Operation Manual - V1.1
62 pages
Introduction To MChip Advance Card Application Specifications - Payment
No ratings yet
Introduction To MChip Advance Card Application Specifications - Payment
38 pages
Hyundai Monitor Manual
No ratings yet
Hyundai Monitor Manual
26 pages
Document For A Three Phase SPV Systsem
No ratings yet
Document For A Three Phase SPV Systsem
49 pages
Microsoft .NET SDK 8.0.106 (x64) 20240710131539
No ratings yet
Microsoft .NET SDK 8.0.106 (x64) 20240710131539
17 pages
The Application of Computer Vision Machine and Deep Learning Alg
No ratings yet
The Application of Computer Vision Machine and Deep Learning Alg
58 pages
Epic Games vs. Brandon Broom
No ratings yet
Epic Games vs. Brandon Broom
63 pages
Lab 1
No ratings yet
Lab 1
11 pages
HBR - Strategy
No ratings yet
HBR - Strategy
3 pages
Introduction To Algorithms and Programming Concepts
No ratings yet
Introduction To Algorithms and Programming Concepts
22 pages
Fisher Specification Manager Software
No ratings yet
Fisher Specification Manager Software
4 pages
Doc-20240330-Wa0002 240330 194818
No ratings yet
Doc-20240330-Wa0002 240330 194818
10 pages
IPSA Introduction NJ
No ratings yet
IPSA Introduction NJ
13 pages
AA Math Requirement September 2023
No ratings yet
AA Math Requirement September 2023
3 pages
WhatsApp - Mobile Tracker Free 2
No ratings yet
WhatsApp - Mobile Tracker Free 2
1 page
8086 Question Bank
No ratings yet
8086 Question Bank
4 pages
Iris Recognition: Detecting The Pupil
No ratings yet
Iris Recognition: Detecting The Pupil
8 pages
Unit 9 - P5
No ratings yet
Unit 9 - P5
3 pages
Exercise No. 2
No ratings yet
Exercise No. 2
3 pages
Coursera Quiz Key
No ratings yet
Coursera Quiz Key
1 page
Hana File System
No ratings yet
Hana File System
8 pages
Simplifying Mashup Component Selection With A Combined Similarity-And Social-Based Technique
No ratings yet
Simplifying Mashup Component Selection With A Combined Similarity-And Social-Based Technique
8 pages
Regridding With CDO
No ratings yet
Regridding With CDO
9 pages
Department of Biotechnology: Biotechnology Eligibility Test (BET) 2020
No ratings yet
Department of Biotechnology: Biotechnology Eligibility Test (BET) 2020
2 pages
Handout 4 Types of Media
No ratings yet
Handout 4 Types of Media
2 pages
Basic VRML and Easy Examples: Annotated VRML 97 Reference Manual
No ratings yet
Basic VRML and Easy Examples: Annotated VRML 97 Reference Manual
5 pages
IC368 Computational Intelligence in Control Engineering
No ratings yet
IC368 Computational Intelligence in Control Engineering
3 pages
Name-Debasisha Sahu Mobile No: 8249164485, 9040990054
No ratings yet
Name-Debasisha Sahu Mobile No: 8249164485, 9040990054
4 pages
Generative Ai: A Comprehensive Guide to Innovative Ai Models (A Step-by-step Understanding of Fundamental Concepts With Practical Applications)
From Everand
Generative Ai: A Comprehensive Guide to Innovative Ai Models (A Step-by-step Understanding of Fundamental Concepts With Practical Applications)
Anthony Phillips
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

DL Unit 4

Uploaded by

DL Unit 4

Uploaded by

23CS2902 DEEP LEARNING

UNIT -4 MORE DEEP LEARNING ARCHITECTURES

 An autoencoder is a type of artificial neural network used to learn efficient

 The goal of an autoencoder is to learn a representation for a set of data,

TYPES OF AUTO ENCODERS:

There are, basically, 7 types of autoencoders:

 A DenoisingAutoencoder is a modification on the autoencoder to prevent

 It was introduced to achieve good representation. Such a representation is

 To train an autoencoder to denoise data, it is necessary to perform

The purpose of an autoencoder is to encode important information efficiently. A

Autoencoder can distinguish between what’s essential by simultaneously

The diagram below provides an example of an UndercompleteAutoencoder Neural

UndercompleteAutoencoder architecture. Image by author, created

 For it to be working, it's essential that the individual nodes of a trained

 Contractive autoencoder is a better choice than denoisingautoencoder to learn

Variationalautoencoder is different from autoencoder in a way such

Example of Noise in a DAE

Original input, corrupted data, reconstructed

DAE Training procedure

Corruption process, C( x! |x) is a conditional

The autoencoder learns a reconstruction

Estimating the Score

• Training examples x lie on a low-dimensional manifold

Vector field learnt by a DAE

• 1-D curved manifold near which the data concentrate

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.