0% found this document useful (0 votes)

13 views

Activation Functions

Uploaded by

shubhodippal01

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views

Activation Functions

Uploaded by

shubhodippal01

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 9

Activation Functions

Dr. Chiradeep Mukherjee

Department of CST and CSIT
University of Engineering and Management Kolkata
Activation Functions
Definition: An Activation Function decides whether a neuron should be activated or
not. This means that it will decide whether the neuron's input to the network is
important or not in the process of prediction using simpler mathematical operations.

Types of Activation Function: The Activation Functions can be basically divided

into 2 types-
i) Linear Activation Function
ii) Non-linear Activation Functions The Nonlinear Activation Functions are the most used
activation functions. It makes it easy for the model to
generalize or adapt with variety of data and to differentiate
Linear Activation Function: It doesn’t between the output. The main terminologies needed to
understand for nonlinear functions are:
help with the complexity or various
parameters of usual data that is fed to Derivative or Differential: Change in y-axis w.r.t.
change in x-axis. It is also known as slope.
the neural networks.
Monotonic function: A function which is either entirely
non-increasing or non-decreasing.
Purpose of Activation Functions
• The purpose of an activation function in a neural network is to introduce non-linearity into the model. Here's a more detailed
breakdown of its role:
• 1. Introducing Non-linearity
• Without an activation function, a neural network would essentially be a linear model, no matter how many layers it has. This is
because a composition of linear functions is still a linear function.
• The activation function enables the network to learn complex, non-linear relationships in the data. By applying non-linearity, the
network can model a wider range of patterns and behaviors, making it capable of solving more complex tasks.
• 2. Enabling Neural Networks to Learn Complex Patterns
• Real-world data (like images, speech, text, etc.) is often non-linear, and the relationships between inputs and outputs can be
intricate. Without non-linear activation functions, the neural network would be limited in its capacity to model such relationships.
• Activation functions allow the neural network to learn hierarchical patterns, making it suitable for complex tasks such as image
recognition, natural language processing, and more.
• 3. Control of Output Range
• Activation functions can also serve to control the range of the output of a neuron. For example, functions like sigmoid or tanh
compress the output to a specific range (e.g., between 0 and 1 for sigmoid or -1 to 1 for tanh), which can be useful for certain tasks
like binary classification.
• Some activation functions, like ReLU (Rectified Linear Unit), do not compress the output but rather set a lower bound, which can
help with issues like vanishing gradients.
Purpose of Activation Functions
• 4. Gradient Flow
• In backpropagation, gradients are propagated backward through the network to adjust the weights. The behavior of the activation
function affects how well gradients flow through the network.
• Activation functions like ReLU (and its variants like Leaky ReLU or Parametric ReLU) help mitigate issues such as vanishing
gradients, which can occur with functions like sigmoid or tanh when the gradients become very small and prevent effective
learning.
• Common Activation Functions:
• Sigmoid: Outputs values between 0 and 1, typically used for binary classification.
• Tanh: Outputs values between -1 and 1, often used when both positive and negative outputs are needed.
• ReLU (Rectified Linear Unit): Outputs the input directly if positive, and zero otherwise. It's widely used because it helps mitigate
the vanishing gradient problem and is computationally efficient.
• Leaky ReLU: A variation of ReLU that allows a small, non-zero gradient when the input is negative, which can help during
training by preventing "dead neurons."
• Softmax: Used in the output layer for multi-class classification tasks, converting the output into a probability distribution.
• In summary, the activation function plays a key role in enabling a neural network to learn and approximate complex functions by
introducing non-linearity, ensuring efficient gradient flow, and controlling the output values of the neurons.
Non-Linear Activation Functions
i) Sigmoid function ii) Hyperbolic Tangent (tanh) function:

σ(z)= g(z)=

σ(z) graph g(z) graph

Note: The functionality of tanh activation function is better because it provides the output
between +1 and -1 with zero mean. Fixing mean at 0 makes next layer’s decision easier.
Non-Linear Activation Functions

iii) Rectified Linear Unit iv) Leaky ReLU Function:

(ReLU)
PReLU(z) = max(α*z,z)
ReLU(z) = max(0,z)
ReLU(z) graph PReLU(z) graph
Choice of Activation Functions
i) Why and when sigmoid function:
• The main reason why we use sigmoid function is because it exists between (0 to 1). Therefore, it is especially
used for models where we have to predict the probability as an output. Since probability of anything exists only
between the range of 0 and 1, sigmoid is the right choice for OUTPUT layer of binary classification problem.
• The function is differentiable. That means, we can find the slope of the sigmoid curve at any two points.
• The function is monotonic but function’s derivative is not.
• The logistic sigmoid function can cause a neural network to get stuck at the training time. EXCEPT FOR
BINARY CLASSIFICATION, WE SHOULD AVOID IT.

ii) Why and when tanh Function:

• The range of the tanh function is from (-1 to 1).
• The advantage is that the negative inputs will be mapped strongly negative and the zero inputs will be mapped
near zero in the tanh graph.
• The function is differentiable.
• The function is monotonic while its derivative is not monotonic.
• The tanh function is mainly used classification between two classes.
• Both tanh and logistic sigmoid activation functions are used in feed-forward nets. AND IT IS
RECOMMENDED TO USE tanh IN HIDDEN LAYER.
Derivative of Activation Functions
i) When we implement backpropagation for a neural network, we need to compute the slope
or the derivative of the activation function,
ii) when updating the curve, it is needed to know in which direction and how much to
change or update the curve depending upon the slope. That is why we use differentiation in
almost every part of Machine Learning and Deep Learning.

Derivative of Sigmoid Function:

σ(z)= ==== σ(z)(1- σ(z))

Derivative of Activation Functions
Derivative of tanh Function:
g(z)= =
==1-(g(z)2)

Plots of Derivatives of Activation

Functions:

5 TH
No ratings yet
5 TH
22 pages
Activation Function
No ratings yet
Activation Function
31 pages
UNIT-III Activation-function
No ratings yet
UNIT-III Activation-function
6 pages
Act_Fun
No ratings yet
Act_Fun
7 pages
3-Activation Function, Loss Function-24-07-2024
No ratings yet
3-Activation Function, Loss Function-24-07-2024
19 pages
Activation Functions in Neural Networks: What Is Activation Function?
No ratings yet
Activation Functions in Neural Networks: What Is Activation Function?
11 pages
Activation Functions
No ratings yet
Activation Functions
3 pages
Activation Functions
No ratings yet
Activation Functions
6 pages
Deep Learning
No ratings yet
Deep Learning
10 pages
M2 PPT
No ratings yet
M2 PPT
84 pages
Activation
No ratings yet
Activation
7 pages
Activation Function
No ratings yet
Activation Function
9 pages
Activation Functions in Neural Networks
No ratings yet
Activation Functions in Neural Networks
10 pages
Module1 - Upto Loss Function
No ratings yet
Module1 - Upto Loss Function
137 pages
4 4 Choosing The Right Activation Function For Neural Networks
No ratings yet
4 4 Choosing The Right Activation Function For Neural Networks
25 pages
activatn fn 2
No ratings yet
activatn fn 2
10 pages
Perceptron in Machine Learning
No ratings yet
Perceptron in Machine Learning
11 pages
Unit 3 Deep Learning
No ratings yet
Unit 3 Deep Learning
11 pages
DL Answers
No ratings yet
DL Answers
24 pages
lecture 9-NN- modified
No ratings yet
lecture 9-NN- modified
94 pages
4 - Activation Functions in Neural Networks
No ratings yet
4 - Activation Functions in Neural Networks
12 pages
Module1
No ratings yet
Module1
124 pages
Activation Function in NN
No ratings yet
Activation Function in NN
29 pages
Unit Iv
No ratings yet
Unit Iv
34 pages
Types of Neural Network Activation Functions_ How to Choose_ (1)
No ratings yet
Types of Neural Network Activation Functions_ How to Choose_ (1)
36 pages
Activation Function
No ratings yet
Activation Function
43 pages
Activation Funtions
No ratings yet
Activation Funtions
26 pages
Deep Learning: International Islamic University of Chittagong
No ratings yet
Deep Learning: International Islamic University of Chittagong
31 pages
Activation Function
No ratings yet
Activation Function
36 pages
Performance Analysis of Various Activation Functio
No ratings yet
Performance Analysis of Various Activation Functio
7 pages
Experiment No. 1 SL-II (ANN)
No ratings yet
Experiment No. 1 SL-II (ANN)
3 pages
Functii de Activare1
No ratings yet
Functii de Activare1
89 pages
Lect 5- Non Linear Activation Functions
No ratings yet
Lect 5- Non Linear Activation Functions
41 pages
Need and Use of Activation Functions in Anndeep Learning
No ratings yet
Need and Use of Activation Functions in Anndeep Learning
7 pages
Activation Functions in Neural Networks - 241102 - 224129
No ratings yet
Activation Functions in Neural Networks - 241102 - 224129
7 pages
Unit 5 Activation Function
No ratings yet
Unit 5 Activation Function
15 pages
Ann
No ratings yet
Ann
40 pages
Mod 2.3 - Activation Function, Loss Functions
No ratings yet
Mod 2.3 - Activation Function, Loss Functions
12 pages
Activation functions 2
No ratings yet
Activation functions 2
5 pages
7 Types of Neural Network Activation Functions
No ratings yet
7 Types of Neural Network Activation Functions
16 pages
26- netinput activation function forward and back propogation
No ratings yet
26- netinput activation function forward and back propogation
41 pages
SoftComp 02
No ratings yet
SoftComp 02
33 pages
Activation Functions
No ratings yet
Activation Functions
8 pages
Lecture 2.1.2activation Function
No ratings yet
Lecture 2.1.2activation Function
15 pages
Activation Function
No ratings yet
Activation Function
4 pages
Mod 2.3 - Activation Function
No ratings yet
Mod 2.3 - Activation Function
9 pages
Deep Learning
No ratings yet
Deep Learning
5 pages
Deep Learning Tutorial 3
No ratings yet
Deep Learning Tutorial 3
12 pages
Study of Ensemble of Activation Functions in Deep Learning
No ratings yet
Study of Ensemble of Activation Functions in Deep Learning
10 pages
Fundamentals Deep Learning Activation Functions When To Use Them
No ratings yet
Fundamentals Deep Learning Activation Functions When To Use Them
15 pages
Activation Function
No ratings yet
Activation Function
18 pages
Pr1_ANN_Writeup.docx
No ratings yet
Pr1_ANN_Writeup.docx
7 pages
Presentation for deep learning
No ratings yet
Presentation for deep learning
15 pages
Lec08-1Activation Functions
No ratings yet
Lec08-1Activation Functions
19 pages
4-Neural Networks and Activation Function
No ratings yet
4-Neural Networks and Activation Function
28 pages
Ad3451 Ml Unit 4 Notes
No ratings yet
Ad3451 Ml Unit 4 Notes
34 pages
2. Activation Functions - Sigmoid- Tanh- ReLU- Softmax- Risk Minimization- Loss Function
No ratings yet
2. Activation Functions - Sigmoid- Tanh- ReLU- Softmax- Risk Minimization- Loss Function
17 pages
How To Choose An Activation Function For Deep Learning
No ratings yet
How To Choose An Activation Function For Deep Learning
15 pages
Unit 2 Deep Learning and Neural Networks
No ratings yet
Unit 2 Deep Learning and Neural Networks
38 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
AIML Lec-14
No ratings yet
AIML Lec-14
12 pages
ML_Notes_1
No ratings yet
ML_Notes_1
13 pages
AIML Lec-7
No ratings yet
AIML Lec-7
24 pages
Summary Notes of Cnn
No ratings yet
Summary Notes of Cnn
23 pages
AIML Lec-11
No ratings yet
AIML Lec-11
18 pages
Part 8_Confusion Matrix
No ratings yet
Part 8_Confusion Matrix
21 pages
AIML Lec-3
No ratings yet
AIML Lec-3
16 pages
Convolution Over Volumes (1)
No ratings yet
Convolution Over Volumes (1)
9 pages
AIML Lec-1
No ratings yet
AIML Lec-1
15 pages
Ethem Alpaydin-Introduction To Machine Learning-The MIT Press (2014) (330-333)
No ratings yet
Ethem Alpaydin-Introduction To Machine Learning-The MIT Press (2014) (330-333)
4 pages
Transformer - Ipynb - Colab
No ratings yet
Transformer - Ipynb - Colab
5 pages
Mcculloch Pitts Neuron Model
No ratings yet
Mcculloch Pitts Neuron Model
15 pages
BITS F464 Machine Learning Neural Network Practice Questions - SolutionKey
No ratings yet
BITS F464 Machine Learning Neural Network Practice Questions - SolutionKey
5 pages
Introduction To Artificial Neural Networks: Andrew L. Nelson
No ratings yet
Introduction To Artificial Neural Networks: Andrew L. Nelson
29 pages
NN Unit 1 Complete Notes
100% (1)
NN Unit 1 Complete Notes
154 pages
NNDL Mid -2 exam IMP Questions
No ratings yet
NNDL Mid -2 exam IMP Questions
1 page
Unit 13 - Week 12: Assignment 12
No ratings yet
Unit 13 - Week 12: Assignment 12
3 pages
Backpropagation in Neural Network - GeeksforGeeks
No ratings yet
Backpropagation in Neural Network - GeeksforGeeks
7 pages
CNN For Computer Vision Problem (Session 1)
No ratings yet
CNN For Computer Vision Problem (Session 1)
43 pages
Lecture-20 21 22 (ANN)
No ratings yet
Lecture-20 21 22 (ANN)
30 pages
MCQ
No ratings yet
MCQ
8 pages
Student Notes - Convolutional Neural Networks (CNN) Introduction - Belajar Pembelajaran Mesin Indonesia
No ratings yet
Student Notes - Convolutional Neural Networks (CNN) Introduction - Belajar Pembelajaran Mesin Indonesia
14 pages
14 04 Transformers
No ratings yet
14 04 Transformers
11 pages
INT422
No ratings yet
INT422
5 pages
Soft Computing CT QP
No ratings yet
Soft Computing CT QP
2 pages
Deep Learning Record
No ratings yet
Deep Learning Record
43 pages
Jurnal Sistem Pendeteksi Pejalan Kaki
No ratings yet
Jurnal Sistem Pendeteksi Pejalan Kaki
12 pages
Time Delay Neural Network
No ratings yet
Time Delay Neural Network
6 pages
ML unit 5
No ratings yet
ML unit 5
19 pages
XOR Problem Demonstration Using MATLAB
0% (1)
XOR Problem Demonstration Using MATLAB
19 pages
08 Ann PDF
No ratings yet
08 Ann PDF
72 pages
CSE - MINI Project Report Sample
No ratings yet
CSE - MINI Project Report Sample
18 pages
Introduction of Neural Network
No ratings yet
Introduction of Neural Network
31 pages
Timeline: Timeline of Natural Language Processing Models
No ratings yet
Timeline: Timeline of Natural Language Processing Models
5 pages
Networks of Artificial Neurons, Single Layer Perceptrons
No ratings yet
Networks of Artificial Neurons, Single Layer Perceptrons
18 pages
Neuro Fuzzy - Session 6
No ratings yet
Neuro Fuzzy - Session 6
21 pages
NNDL Lab Manual
No ratings yet
NNDL Lab Manual
43 pages
Adarsh - 2024en01 - Soft Computing Assignment 1
No ratings yet
Adarsh - 2024en01 - Soft Computing Assignment 1
12 pages
CS8082 Unit 2
No ratings yet
CS8082 Unit 2
38 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Activation Functions

Uploaded by

Activation Functions

Uploaded by

Activation Functions

Dr. Chiradeep Mukherjee

Types of Activation Function: The Activation Functions can be basically divided

σ(z) graph g(z) graph

iii) Rectified Linear Unit iv) Leaky ReLU Function:

ii) Why and when tanh Function:

Derivative of Sigmoid Function:

σ(z)= ==== σ(z)(1- σ(z))

Plots of Derivatives of Activation

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.