0% found this document useful (0 votes)

20 views

Lec08-1Activation Functions

Uploaded by

awaisqarni640

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPSX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views

Lec08-1Activation Functions

Uploaded by

awaisqarni640

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPSX, PDF, TXT or read online on Scribd

You are on page 1/ 19

Activation functions

Activation functions
• They introduced Non linear properties to networks.
• Linear function is polynomial of one degree and always form a
straight line.
• If we add more dimensions then they would form a planes or
hyperplanes but shapes would perfectly straight and not curves.
• Polynomial of higher degrees are non-linear and produced curves.
• Linear Eq. are easy to solve but are limited to represent any NN
function, which is considered as “Universal function approximator”.
• If we do not use any non linear fn. then NN behaves as single layer
network and does not matter who much layers are being used
because summing these layers will produce another linear network.
So it is not Strong to model any kind of Data.
Activation function
• But using non linear activation the mapping of input
to the output is non linear.
• Activation fn. Should be differentiable to compute
its derivative to perform back propagation
optimization strategy to find a non linear gradient to
learn complex behavior.
• Idea behind activation is to find model the neurons
communicate with each others in brain.
• Each neuron is activated through its action potential
if it reaches at certain threshold to activate it or not.
Whys Activation fn.s in Neural Nets
• They are used for containing the outputs b/w given values usually 0-1.
• To impart the non linearity, which is an important factor for effective
results and accuracy of model.
• So we most know about them
Activation functions (AF)
• Various threshold function can be considered as AF.
• Identity f(x) =x;
• Threshold f(x)= 0 for x<0 while f(x) = 1 for x>=0 (useful in classifiers)
• Most Popular are:
• Sigmoid
• tanh
• ReLU
• Leaky ReLU
• Maxout
• Softmax (also used as classifier by computing probability)
tanh
Sigmoid
• Takes some number and ranges it between 0 and 1 (even +ve
value is too large to avoid exponential increase in +ve values
using NN) to interpret firing of neurons.
• 0 means no firing and
• 1 means a fully saturated firing.
• Easy to understand thus popular.
• But it has two Problems:
1. Causes the gradient to vanish.
• when neurons activation saturates closing either 0 or 1 the gradient
reduces to very close to 0.
• During back propagation this local gradient will be multiplied by the
gradient of this gate output the whole objective
• If local gradient is really small it will make the gradient slowly vanished
and no signal will flow through the neurons to its weights and recursively
to its data.
2. Its Output is Not Zero centered.
Problems with Sigmoid
• It starts from 0 and ends up 1.
• That means the value after the fn. will be +ve that make the gradient of the
weights become either all +ve or all –ve.
• This makes the gradient updates gone too far in different directions, which
makes optimization harder.

Ouuuh.. I can’t control its Gradient. uuuuhh

So it is Difficult to optimize.
Hyperbolic Tangent fn. tanh
• Its squishes the real numbers between -1
and +1.
• Output is Zero centered to make
optimization easier.
• Always Preferred over Sigmoid.
• But like Sigmoid it also vanishes
gradients.
Rectified Linear Unit ReLU
• It is most popular activation fn.
• It is simplest and most elegant solution.
• Give significant improvement in
convergence over tanh according to Alex et
al., 2012.
• It just max 0 and x.
• The value is 0 when x is < 0 and
• linear with slope of 1 when x is greater than 0.
• Extensive operations are not involved in it
like tanh and Sigmoid.
ReLU
• Almost all deep network in these
days use ReLU but only for hidden
layers.
• The output layer uses
• Softmax for classification to give
probability for different classes, and
• A linear function for regression so
the signal goes through unchanged.
ReLU Sometimes gives Problem
• Some units can be fragile during training
and could “Die”. ….
• Mean a big gradient flow through ReLU to
neuron can cause a weight update that
makes it never activate on any data point
again.
• So when gradient flowing through it there
will always be 0 from that point on.
• A variant, known as Leaky ReLU, was
introduced to fix this problem.
Another problem With ReLU
Leaky ReLU
• Instead of activation fn. being 0 when x < 0 it sets a small –ve slope.
Other Popular Variants
• Maxout is generalize form of both ReLU and Leaky ReLU. Its trade off:
• It doubles the number of parameters of each neuron.
Which Activation fn. Should be
considered?

1. But if lot of neurons die than consider Leaky ReLU, Maxout or other variants.
2. But don’t consider Sigmoid or tanh.
3. Still many rooms for improvements.

Negele-Orlado Quantum Many Particle System
No ratings yet
Negele-Orlado Quantum Many Particle System
63 pages
cs373 - Midterm Fall 2022
No ratings yet
cs373 - Midterm Fall 2022
12 pages
Study of Van Emde Boas Tree With Application To Dijkstra: Advanced Problem Solving
No ratings yet
Study of Van Emde Boas Tree With Application To Dijkstra: Advanced Problem Solving
16 pages
Activation
No ratings yet
Activation
7 pages
Lecture 2.1.2activation Function
No ratings yet
Lecture 2.1.2activation Function
15 pages
Activation Function
No ratings yet
Activation Function
36 pages
Activation F
No ratings yet
Activation F
4 pages
Act_Fun
No ratings yet
Act_Fun
7 pages
Activation Function
No ratings yet
Activation Function
43 pages
Deep Learning: International Islamic University of Chittagong
No ratings yet
Deep Learning: International Islamic University of Chittagong
31 pages
Module1 - Upto Loss Function
No ratings yet
Module1 - Upto Loss Function
137 pages
Pr1_ANN_Writeup.docx
No ratings yet
Pr1_ANN_Writeup.docx
7 pages
Types of Neural Network Activation Functions_ How to Choose_ (1)
No ratings yet
Types of Neural Network Activation Functions_ How to Choose_ (1)
36 pages
Need and Use of Activation Functions in Anndeep Learning
No ratings yet
Need and Use of Activation Functions in Anndeep Learning
7 pages
Lect 5- Non Linear Activation Functions
No ratings yet
Lect 5- Non Linear Activation Functions
41 pages
lecture 9-NN- modified
No ratings yet
lecture 9-NN- modified
94 pages
5 TH
No ratings yet
5 TH
22 pages
4 4 Choosing The Right Activation Function For Neural Networks
No ratings yet
4 4 Choosing The Right Activation Function For Neural Networks
25 pages
Activation Functions
No ratings yet
Activation Functions
3 pages
Activation Functions
No ratings yet
Activation Functions
4 pages
UNIT-III Activation-function
No ratings yet
UNIT-III Activation-function
6 pages
7 Types of Neural Network Activation Functions
No ratings yet
7 Types of Neural Network Activation Functions
16 pages
Activation Function
No ratings yet
Activation Function
13 pages
26- netinput activation function forward and back propogation
No ratings yet
26- netinput activation function forward and back propogation
41 pages
Deep Learning
No ratings yet
Deep Learning
10 pages
Module1
No ratings yet
Module1
124 pages
what are the activation functions, how do i deter...
No ratings yet
what are the activation functions, how do i deter...
3 pages
Study of Ensemble of Activation Functions in Deep Learning
No ratings yet
Study of Ensemble of Activation Functions in Deep Learning
10 pages
Activation Functions
No ratings yet
Activation Functions
9 pages
9.b Handout-4-Activation Functions
No ratings yet
9.b Handout-4-Activation Functions
4 pages
Activation Functions in Neural Networks: What Is Activation Function?
No ratings yet
Activation Functions in Neural Networks: What Is Activation Function?
11 pages
Perceptron in Machine Learning
No ratings yet
Perceptron in Machine Learning
11 pages
Unit 5 Activation Function
No ratings yet
Unit 5 Activation Function
15 pages
Machine Learning (CSO851) - Lecture 08
No ratings yet
Machine Learning (CSO851) - Lecture 08
27 pages
Feed Forward NN
No ratings yet
Feed Forward NN
35 pages
Activation Function in NN
No ratings yet
Activation Function in NN
29 pages
Activation Functions
No ratings yet
Activation Functions
8 pages
ML_Lec-22
No ratings yet
ML_Lec-22
25 pages
Ann
No ratings yet
Ann
40 pages
4. ANNs
No ratings yet
4. ANNs
57 pages
Activation Functions - Ipynb - Colaboratory
No ratings yet
Activation Functions - Ipynb - Colaboratory
10 pages
Activation Function
No ratings yet
Activation Function
31 pages
Unit 2_Activation Function_PR
No ratings yet
Unit 2_Activation Function_PR
22 pages
Activation functions 2
No ratings yet
Activation functions 2
5 pages
Activation Functions in Neural Networks - 241102 - 224129
No ratings yet
Activation Functions in Neural Networks - 241102 - 224129
7 pages
Activation Functions in Neural Networks
No ratings yet
Activation Functions in Neural Networks
10 pages
Unit 3 Deep Learning
No ratings yet
Unit 3 Deep Learning
11 pages
Unit Iv
No ratings yet
Unit Iv
34 pages
Neural_Networks_Activation_Functions__1694135997
No ratings yet
Neural_Networks_Activation_Functions__1694135997
7 pages
Artificial Neural Networks(ANN)
No ratings yet
Artificial Neural Networks(ANN)
67 pages
Activation Function
No ratings yet
Activation Function
18 pages
Performance Analysis of Various Activation Functio
No ratings yet
Performance Analysis of Various Activation Functio
7 pages
Activation Functions
No ratings yet
Activation Functions
6 pages
4 - Activation Functions in Neural Networks
No ratings yet
4 - Activation Functions in Neural Networks
12 pages
ReLu Heuristics For Avoiding Local Bad Minima
100% (2)
ReLu Heuristics For Avoiding Local Bad Minima
10 pages
DL Answers
No ratings yet
DL Answers
24 pages
3. Activation Function
No ratings yet
3. Activation Function
14 pages
Deep Learning Tutorial 3
No ratings yet
Deep Learning Tutorial 3
12 pages
Activation Function
No ratings yet
Activation Function
4 pages
3-Activation Function, Loss Function-24-07-2024
No ratings yet
3-Activation Function, Loss Function-24-07-2024
19 pages
activatn fn 2
No ratings yet
activatn fn 2
10 pages
Presentation for deep learning
No ratings yet
Presentation for deep learning
15 pages
Introduction to Logarithms and Exponentials
From Everand
Introduction to Logarithms and Exponentials
Simone Malacrida
No ratings yet
Lec09-1Confusion Matrix - A Comprehensive Guide
No ratings yet
Lec09-1Confusion Matrix - A Comprehensive Guide
6 pages
Lecture-3-RC4-A
No ratings yet
Lecture-3-RC4-A
18 pages
Lec03-1 Artificial Neural Networks
No ratings yet
Lec03-1 Artificial Neural Networks
17 pages
Lec07 1 Vectorization
No ratings yet
Lec07 1 Vectorization
47 pages
Lec01.301d-How To DL
No ratings yet
Lec01.301d-How To DL
33 pages
Lec02-Deep Learning Applications
No ratings yet
Lec02-Deep Learning Applications
49 pages
Lec06 Derivatives
No ratings yet
Lec06 Derivatives
22 pages
03-Lecture Notes-Mid
No ratings yet
03-Lecture Notes-Mid
23 pages
Database Backup Encryption in SQL Server
No ratings yet
Database Backup Encryption in SQL Server
10 pages
Computational Fluid Dynamics: Finite Difference Method
No ratings yet
Computational Fluid Dynamics: Finite Difference Method
51 pages
2504.03624v1
No ratings yet
2504.03624v1
31 pages
Distributed & Parallel Algorithms (CS 18301) : Coordinator: Dr. Ashish Kumar Maurya Teacher: Shailendra Kumar Singh
No ratings yet
Distributed & Parallel Algorithms (CS 18301) : Coordinator: Dr. Ashish Kumar Maurya Teacher: Shailendra Kumar Singh
31 pages
05 eLMS Activity 1 DSP
No ratings yet
05 eLMS Activity 1 DSP
2 pages
Artificial Intelligence 8. The Resolution Method: Course V231 Department of Computing Imperial College, London Jeremy Gow
No ratings yet
Artificial Intelligence 8. The Resolution Method: Course V231 Department of Computing Imperial College, London Jeremy Gow
30 pages
Tutorial 2 - Mechanical Design of Overhead TX Lines - Slides
No ratings yet
Tutorial 2 - Mechanical Design of Overhead TX Lines - Slides
32 pages
Cryptography & Network Security PDF
No ratings yet
Cryptography & Network Security PDF
25 pages
Quantum Scattering Theory: Last Chapter of PC4130
No ratings yet
Quantum Scattering Theory: Last Chapter of PC4130
15 pages
Dividends:: Modelling, Option Pricing, Portfolio Optimization
No ratings yet
Dividends:: Modelling, Option Pricing, Portfolio Optimization
21 pages
pseudocode-to-predict-stock-prices
No ratings yet
pseudocode-to-predict-stock-prices
3 pages
A Stochastic Method For Condition Rating of Concrete Bridges
No ratings yet
A Stochastic Method For Condition Rating of Concrete Bridges
10 pages
Adnan-2019-Big Data Security in The Web-Based PDF
No ratings yet
Adnan-2019-Big Data Security in The Web-Based PDF
13 pages
Kubiak_S3R-Net_A_Single-Stage_Approach_to_Self-Supervised_Shadow_Removal_CVPRW_2024_paper
No ratings yet
Kubiak_S3R-Net_A_Single-Stage_Approach_to_Self-Supervised_Shadow_Removal_CVPRW_2024_paper
11 pages
Upper and Lower Bound Theorem Graphs of Polynomial Functions
No ratings yet
Upper and Lower Bound Theorem Graphs of Polynomial Functions
2 pages
Basic-Probability-Concepts
No ratings yet
Basic-Probability-Concepts
6 pages
Machine Learning - Applications, Process and Techniques
No ratings yet
Machine Learning - Applications, Process and Techniques
241 pages
Advanced Engg Mathematics DE ZG 535: Lab Session
No ratings yet
Advanced Engg Mathematics DE ZG 535: Lab Session
18 pages
DSA Roadmap Sheet
No ratings yet
DSA Roadmap Sheet
4 pages
Financial Modelling Course
No ratings yet
Financial Modelling Course
2 pages
EEN-305 01A IntroSS
No ratings yet
EEN-305 01A IntroSS
12 pages
Cs3452 - Toc - QB New
No ratings yet
Cs3452 - Toc - QB New
10 pages
Binary Revision Past Paper Questions
No ratings yet
Binary Revision Past Paper Questions
7 pages
Practical File Questions With Answer
No ratings yet
Practical File Questions With Answer
4 pages
ENGI9839 LogicBackground Part11
No ratings yet
ENGI9839 LogicBackground Part11
34 pages
15 ImpedanceControl
No ratings yet
15 ImpedanceControl
21 pages
BROUCHURE of ATAL FD
No ratings yet
BROUCHURE of ATAL FD
3 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Lec08-1Activation Functions

Uploaded by

Lec08-1Activation Functions

Uploaded by

Activation functions

Ouuuh.. I can’t control its Gradient. uuuuhh

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.