Untitled Document
Untitled Document
Many a times, perfectly labelled data sets are hard to find. In such
situations, data used to train the machine are neither labelled nor
classified. Unsupervised learning is a ML technique where we don’t need
to supply labelled data, instead we allow the machine learning model
(algorithm) to discover the patterns on its own. The task of the machine
is to assemble unsorted information according to resemblances,
patterns and variances without any former training of data.
Deep Learning is inspired from human brain and the neurons in the
human brain
Input Node: This is the layer where information or initial data from the
real world gets introduced into the neural network. The information is
then passed onto the hidden node where computations can begin.
Hidden Node: There is no connection to the real world at this stage. This
is the point where the machine uses the information received from the
input node, it carries out computation and processing on it. There can
be more than one hidden layer.
Output Node: This is the final stage where the computations conclude,
and data is made available to the output layer from where it gets
transferred back into the real-world environment.
Chapter - 2’
. Rule – based Chatbot This is the simpler form of Chatbot which follows
a set of pre-defined rules in responding to user’s questions. For
example, a Chatbot installed at a school reception area, can retrieve data
from the school’s archive to answer queries on school fee structure,
course offered, pass percentage, etc
Algorithm While algorithm, in itself doesn’t inject bias but can amplify
the biases caused by data. For example, an image classifier trained on
the images available in public domain - which happens to show more
women in the kitchen as opposed to men. AI algorithms are designed to
maximize accuracy. So an AI algorithm may decide that all people in the
kitchen are women, despite the fact the training data has some images
of men in the kitchen. It thereby incorporates a gender bias in the AI
system.
People The last issue is with the people who are developing the AI
system i.e. engineers, scientists, developers etc. They aim to get the
most accurate results with the available data. They are often lesser
focused on the broader context. It is rightly said that ethics and bias are
not the problem of the machine but that of the humans behind the
machine.
Chapter -5
Storytelling is defined as the art of narrating stories to engage an
audience. It originated in the ancient times with visual stories, such as
cave drawings, and then shifted to oral traditions, in which stories were
passed down from generation to generation by word of mouth. Later,
words formed into narratives, that included written, printed and typed
stories. Written language, as it is seen now, was arguably the first
technological innovation, that gave us as a species the power to convey
story in a physical format, and thus visualize, archive and share that
data with community members and future generations. It encourages
people to make use of their imagination and inventiveness (creativity) to
express themselves (verbal skills) which makes it a lot more than just a
recitation of facts and events.
The steps involved in telling an effective data story are given below:
1 Understanding the audience
2- Choosing the right data and visualisations
3- Drawing attention to key information
4- Developing a narrative
5-Engaging your audience
Chapter - 8
Regression can be defined as a method or an algorithm in Machine
Learning that models a target value based on independent predictors. It
is essentially a statistical tool used in finding out the relationship
between a dependent variable and an independent variable. This method
comes to play in forecasting and finding out the cause and effect
relationship between variables.
A scatter plot (aka scatter chart, scatter graph) uses dots to represent
values for two different numeric variables. The position of each dot on
the horizontal and vertical axis indicates values for an individual data
point. Scatter plots are used to observe relationships between variables