0% found this document useful (0 votes)

9 views

Activation

Uploaded by

aparnashaji73ap6n

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

Activation

Uploaded by

aparnashaji73ap6n

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Q) Define activation function.

Explain different types of activation

functions.

 Activation Functions are extremely important feature of the Artificial

Neural Network. They basically decide whether a neuron should be
activated or not. It limits the output signal to a finite value.
 Activation Function does the non-linear transformation to the
input making it capable to learn more complex relation between input
and output. It make the network capable of learning more complex
pattern.
 Without an activation function, the neural network is just a linear
regression model as it performs only summation of product of input
and weights.
Eg. In the below image 2 requires a complex relation which is curve unlike a
simple linear relation in image 1.

Fig. Illustrating the need of Activation Function for a complex problem.

Activation function must be efficient and it should reduce the computation

time because the neural network sometimes trained on millions of data
points.

Types of AF:
The Activation Functions can be basically divided into 3 types-
1. Binary step Activation Function
2. Linear Activation Function
3. Non-linear Activation Functions

1. Binary Step Function

A binary step function is a threshold-based activation function. If the input
value is above or below a certain threshold, the neuron is activated and
sends exactly the same signal to the next layer.We decide some threshold
value to decide output that neuron should be activated or deactivated.It is
very simple and useful to classify binary problems or classifier.
Eg.f(x) = 1 if x > 0 else 0 if x <= 0

2. Linear or Identity Activation Function

As you can see the function is a line or linear. Therefore, the output of the
functions will not be confined between any range.

Fig: Linear Activation Function

Equation: f(x) = x
Range : (-infinity to infinity)
It doesn‟t help with the complexity or various parameters of usual data that
is fed to the neural networks
3. Non-linear Activation Function
The Nonlinear Activation Functions are the most used activation functions.
Nonlinearity helps to makes the graph look something like this.
Fig: Non-linear Activation Function
The main terminologies needed to understand for nonlinear functions are:
Derivative or Differential: Change in y-axis w.r.t. change in x-axis.It is also
known as slope.
Monotonic function: A function which is either entirely non-increasing or
non-decreasing.

The Nonlinear Activation Functions are mainly divided on the basis of

their range or curves-

Advantage of Non-linear function over the Linear function :

Differential is possible in all the non -linear function.
Stacking of network is possible, which helps us in creating deep neural nets.
It makes it easy for the model to generalize

3.1 Sigmoid(Logistic AF)(σ):

The main reason why we use sigmoid function is it exists between 0 to 1.
It is especially used for models where we have to predict the probability as
output. Since probability of anything exists only between the range of 0 and
1, sigmoid is the right choice.

Fig: Sigmoid Function (S-shaped Curve)

The function is differentiable and monotonic. But function derivative is
not monotonic.
The logistic sigmoid function can cause a neural network to get stuck at the
training time.
Advantages
1. Easy to understand and apply
2. Easy to train on small dataset
3. Smooth gradient, preventing “jumps” in output values.
4. Output values bound between 0 and 1, normalizing the output of each
neuron.
Disadvantages:
 Vanishing gradient—for very high or very low values of X, there is
almost no change to the prediction, causing a vanishing gradient
problem. This can result in the network refusing to learn further, or
being too slow to reach an accurate prediction.
 Outputs not zero centered.
 Computationally expensive

3.2 TanH(Hyperbolic Tangent AF):

TanH is also like logistic sigmoid but in better way. The range of the
TanHfunction is from -1 to +1.

TanH is often preferred over the sigmoid neuron because it is zero centred.
The advantage is that the negative inputs will be mapped strongly negative
and the zero inputs will be mapped near zero in tanh graph.

tanh(x) = 2 * sigmoid(2x) - 1

Fig. Sigmoid Vs Tanh

The function is differentiable and monotonic. But function derivative is

not monotonic.
Advantages
 Zero centered—making it easier to model inputs that have strongly
negative, neutral, and strongly positive values.
Disadvantages
 Like the Sigmoid function is also suffers from vanishing gradient
problem
 hard to train on small datasets

3.3 ReLU(Rectified Linear Unit):

The ReLU is the most used activation function. It is used in almost all
convolution neural networks in hidden layers only.
The ReLU is half rectified(from bottom). f(z) = 0, if z < 0
= z, otherwise
R(z) = max(0,z)
The range is 0 to inf.

Advantages
 Avoids vanishing gradient problem.
 Computationally efficient—allows the network to converge very
quickly
 Non-linear—although it looks like a linear function, ReLU has a
derivative function and allows for backpropagation

Disadvantages
 Can only be used with a hidden layer
 hard to train on small datasets and need much data for learning non-
linear behavior.
 The Dying ReLU problem—when inputs approach zero, or are
negative, the gradient of the function becomes zero, the network
cannot perform backpropagation and cannot learn.

The function and its derivative both are monotonic.

All the negative values are converted into zero, and this conversion rate is so
fast that neither it can map nor fit into data properly which creates a
problem.
Leaky ReLU Activation Function

We needed the Leaky ReLU activation function to solve the „Dying ReLU‟
problem.
Leaky ReLU we do not make all negative inputs to zero but to a value near
to zero which solves the major issue of ReLU activation function.

R(z) = max(0.1*z,z)

Advantages
 Prevents dying ReLU problem—this variation of ReLU has a small
positive slope in the negative area, so it does enable backpropagation,
even for negative input values
 Otherwise like ReLU
Disadvantages
 Results not consistent—leaky ReLU does not provide consistent
predictions for negative input values.

3.4 Softmax:

 Sigmoid able to handle more than two cases(class label).

 Softmax can handle multiple cases. Softmax function squeeze the
output for each class between 0 and 1 with sum of them is 1.
 It is ideally used in the final output layer of the classifier, where we
are actually trying to attain the probabilities.
 Softmax produces multiple outputs for an input array. For this
reason, we can build neural network models that can classify more
than 2 classes instead of binary class solution.

sigma = softmax
zi = input vector
e^{zi}} = standard exponential function for input vector
K = number of classes in the multi-class classifier
e^{zj} = standard exponential function for output vector
e^{zj} = standard exponential function for output vector
Advantages
Able to handle multiple classes only one class in other activation
functions—normalizes the outputs for each class between 0 and 1with the
sum of the probabilities been equal to 1, and divides by their sum, giving the
probability of the input value being in a specific class.
Useful for output neurons—typically Softmax is used only for the output
layer, for neural networks that need to classify inputs into multiple
categories.

Q) Explain about Deep feedforward networks or feedforward neural

networks or multilayer perceptron (MLP).
A deep neural network is a neural network with atleast two hidden layers.
Deep neural networks use sophisticated mathematical modeling to process
data in different ways.Traditional machine learning algorithms are linear,
deep learning algorithms are stacked in a hierarchy.

Fig. Deep Feedforward Network

Deep learning creates many layers of neurons, attempting to learn
structured representation, layer by layer.

The goal of a feedforward network is to approximate some function f ∗. For

example,for a classifier, y = f ∗(x) maps an input x to a category y.

A feedforward network defines a mapping y = f (x; θ) and learns the value of

the parameters θ that result in the best function approximation.

These models are called feedforward because information flows through the
function being evaluated from x, through the intermediate computations

Activation Function
No ratings yet
Activation Function
43 pages
UNIT-III Activation-function
No ratings yet
UNIT-III Activation-function
6 pages
Act_Fun
No ratings yet
Act_Fun
7 pages
4 4 Choosing The Right Activation Function For Neural Networks
No ratings yet
4 4 Choosing The Right Activation Function For Neural Networks
25 pages
Activation Function
No ratings yet
Activation Function
36 pages
Activation Functions
No ratings yet
Activation Functions
8 pages
Activation Function in NN
No ratings yet
Activation Function in NN
29 pages
Activation Function
No ratings yet
Activation Function
31 pages
Need and Use of Activation Functions in Anndeep Learning
No ratings yet
Need and Use of Activation Functions in Anndeep Learning
7 pages
Types of Neural Network Activation Functions_ How to Choose_ (1)
No ratings yet
Types of Neural Network Activation Functions_ How to Choose_ (1)
36 pages
Study of Ensemble of Activation Functions in Deep Learning
No ratings yet
Study of Ensemble of Activation Functions in Deep Learning
10 pages
Activation Functions in Neural Networks
No ratings yet
Activation Functions in Neural Networks
10 pages
Module1 - Upto Loss Function
No ratings yet
Module1 - Upto Loss Function
137 pages
Unit 3 Deep Learning
No ratings yet
Unit 3 Deep Learning
11 pages
Activation Functions in Neural Networks: What Is Activation Function?
No ratings yet
Activation Functions in Neural Networks: What Is Activation Function?
11 pages
Deep Learning: International Islamic University of Chittagong
No ratings yet
Deep Learning: International Islamic University of Chittagong
31 pages
5 TH
No ratings yet
5 TH
22 pages
Activation Functions
No ratings yet
Activation Functions
9 pages
Lecture 2.1.2activation Function
No ratings yet
Lecture 2.1.2activation Function
15 pages
Functii de Activare1
No ratings yet
Functii de Activare1
89 pages
Perceptron in Machine Learning
No ratings yet
Perceptron in Machine Learning
11 pages
Lec08-1Activation Functions
No ratings yet
Lec08-1Activation Functions
19 pages
7 Types of Neural Network Activation Functions
No ratings yet
7 Types of Neural Network Activation Functions
16 pages
SoftComp 02
No ratings yet
SoftComp 02
33 pages
Lect 5- Non Linear Activation Functions
No ratings yet
Lect 5- Non Linear Activation Functions
41 pages
Module1
No ratings yet
Module1
124 pages
Unit Iv
No ratings yet
Unit Iv
34 pages
Activation Functions
No ratings yet
Activation Functions
3 pages
Machine Learning (CSO851) - Lecture 08
No ratings yet
Machine Learning (CSO851) - Lecture 08
27 pages
Deep Learning Tutorial 3
No ratings yet
Deep Learning Tutorial 3
12 pages
lecture 9-NN- modified
No ratings yet
lecture 9-NN- modified
94 pages
Mod 2.3 - Activation Function, Loss Functions
No ratings yet
Mod 2.3 - Activation Function, Loss Functions
12 pages
Feed Forward NN
No ratings yet
Feed Forward NN
35 pages
M2 PPT
No ratings yet
M2 PPT
84 pages
Ad3451 Ml Unit 4 Notes
No ratings yet
Ad3451 Ml Unit 4 Notes
34 pages
activatn fn 2
No ratings yet
activatn fn 2
10 pages
Performance Analysis of Various Activation Functio
No ratings yet
Performance Analysis of Various Activation Functio
7 pages
Activation Function
No ratings yet
Activation Function
9 pages
Fundamentals Deep Learning Activation Functions When To Use Them
No ratings yet
Fundamentals Deep Learning Activation Functions When To Use Them
15 pages
4-Neural Networks and Activation Function
No ratings yet
4-Neural Networks and Activation Function
28 pages
Activation Functions in Neural Networks - 241102 - 224129
No ratings yet
Activation Functions in Neural Networks - 241102 - 224129
7 pages
4 - Activation Functions in Neural Networks
No ratings yet
4 - Activation Functions in Neural Networks
12 pages
Activation Functions - Ipynb - Colaboratory
No ratings yet
Activation Functions - Ipynb - Colaboratory
10 pages
Mod 2.3 - Activation Function
No ratings yet
Mod 2.3 - Activation Function
9 pages
Deep Learning
No ratings yet
Deep Learning
10 pages
Activation Functions
No ratings yet
Activation Functions
6 pages
26- netinput activation function forward and back propogation
No ratings yet
26- netinput activation function forward and back propogation
41 pages
Activation functions 2
No ratings yet
Activation functions 2
5 pages
DL Answers
No ratings yet
DL Answers
24 pages
Activation Function: Deep Neural Networks
No ratings yet
Activation Function: Deep Neural Networks
47 pages
Artificial Neural Networks(ANN)
No ratings yet
Artificial Neural Networks(ANN)
67 pages
Deep Learning
No ratings yet
Deep Learning
5 pages
Pr1_ANN_Writeup.docx
No ratings yet
Pr1_ANN_Writeup.docx
7 pages
NN unit_1
No ratings yet
NN unit_1
27 pages
Activation Function
No ratings yet
Activation Function
4 pages
Neural Network example and Activation Functions Summary
No ratings yet
Neural Network example and Activation Functions Summary
2 pages
Activation Function
No ratings yet
Activation Function
18 pages
ML_Lec-22
No ratings yet
ML_Lec-22
25 pages
3-Activation Function, Loss Function-24-07-2024
No ratings yet
3-Activation Function, Loss Function-24-07-2024
19 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
Computer System Servicing: TLE-Information and Communication Technology
No ratings yet
Computer System Servicing: TLE-Information and Communication Technology
14 pages
FOURTH QUARTER EXAM in CSS-G10-2022-2023
100% (1)
FOURTH QUARTER EXAM in CSS-G10-2022-2023
2 pages
Seminar Report
No ratings yet
Seminar Report
28 pages
Final year Project
No ratings yet
Final year Project
15 pages
Transformer Galley 10kva 440V/233V: Installation Instruction
No ratings yet
Transformer Galley 10kva 440V/233V: Installation Instruction
5 pages
12 Digital Marketing
No ratings yet
12 Digital Marketing
28 pages
PSI 8.1LT_116kWe_HD_SchedaTecnica_Datasheet
No ratings yet
PSI 8.1LT_116kWe_HD_SchedaTecnica_Datasheet
4 pages
Is 1979 1985 PDF
No ratings yet
Is 1979 1985 PDF
103 pages
Bosello Wre Thunder en 60 020 0040ii (2).PDF
No ratings yet
Bosello Wre Thunder en 60 020 0040ii (2).PDF
5 pages
PSE Strata Palo Alto Networks Practice Questions
No ratings yet
PSE Strata Palo Alto Networks Practice Questions
6 pages
Chen 1993
No ratings yet
Chen 1993
23 pages
TOS - Math - Grade 7 - 1st Periodical - SY 2023-2024
No ratings yet
TOS - Math - Grade 7 - 1st Periodical - SY 2023-2024
14 pages
Python Dictionary
No ratings yet
Python Dictionary
3 pages
STS - Group 4 (When Technology and Humanity Cross) - PPT
No ratings yet
STS - Group 4 (When Technology and Humanity Cross) - PPT
28 pages
Critical Infrastructure Protection
100% (1)
Critical Infrastructure Protection
8 pages
GMECG2X00 cMT3162X Installation
No ratings yet
GMECG2X00 cMT3162X Installation
2 pages
Giáo trình vật liệu và công nghệ cơ khí - Pgs.Ts.Hoàng Tùng
No ratings yet
Giáo trình vật liệu và công nghệ cơ khí - Pgs.Ts.Hoàng Tùng
162 pages
Pile Group Tool Documentation
No ratings yet
Pile Group Tool Documentation
30 pages
CDV July2024 (REST-API) Reference en
No ratings yet
CDV July2024 (REST-API) Reference en
60 pages
4 Sugar Concentration Tools and Equipment
0% (1)
4 Sugar Concentration Tools and Equipment
3 pages
LTM230HT05 V
No ratings yet
LTM230HT05 V
34 pages
BP Process Safety Series Safe Furnace and Boiler Firing 5th ed. Edition Collective - The complete ebook set is ready for download today
100% (2)
BP Process Safety Series Safe Furnace and Boiler Firing 5th ed. Edition Collective - The complete ebook set is ready for download today
47 pages
Protecting and Port Scanning
No ratings yet
Protecting and Port Scanning
3 pages
Adwea Approved Vendors List
No ratings yet
Adwea Approved Vendors List
321 pages
Business Analytics 2nd Edition Evans Test Bank - Quickly Download For The Best Reading Experience
No ratings yet
Business Analytics 2nd Edition Evans Test Bank - Quickly Download For The Best Reading Experience
45 pages
MBA-ASSIGNMENT-TOPICS
No ratings yet
MBA-ASSIGNMENT-TOPICS
26 pages
Temper Tantrum Tango
No ratings yet
Temper Tantrum Tango
1 page
MPLS VPN
No ratings yet
MPLS VPN
48 pages
AH_SYSTEMS_AK-285T_Datasheet
No ratings yet
AH_SYSTEMS_AK-285T_Datasheet
16 pages
Manual Rosemount 2120 Vibrating Fork Liquid Level Switch Data1
No ratings yet
Manual Rosemount 2120 Vibrating Fork Liquid Level Switch Data1
5 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Activation

Uploaded by

Activation

Uploaded by

Q) Define activation function.

Explain different types of activation

 Activation Functions are extremely important feature of the Artificial

Fig. Illustrating the need of Activation Function for a complex problem.

Activation function must be efficient and it should reduce the computation

1. Binary Step Function

2. Linear or Identity Activation Function

Fig: Linear Activation Function

The Nonlinear Activation Functions are mainly divided on the basis of

Advantage of Non-linear function over the Linear function :

3.1 Sigmoid(Logistic AF)(σ):

Fig: Sigmoid Function (S-shaped Curve)

3.2 TanH(Hyperbolic Tangent AF):

Fig. Sigmoid Vs Tanh

The function is differentiable and monotonic. But function derivative is

3.3 ReLU(Rectified Linear Unit):

The function and its derivative both are monotonic.

 Sigmoid able to handle more than two cases(class label).

Q) Explain about Deep feedforward networks or feedforward neural

Fig. Deep Feedforward Network

The goal of a feedforward network is to approximate some function f ∗. For

A feedforward network defines a mapping y = f (x; θ) and learns the value of

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.