0% found this document useful (0 votes)
502 views

Revision Questions

The document provides answers to several questions related to machine learning and natural language processing concepts. It defines artificial intelligence as machines that can make decisions based on training data. It also discusses different types of intelligences according to multiple intelligence theory and explains how neural networks work by automatically extracting features from data without programmer input. The document further demonstrates the steps for text normalization including tokenization, stopword removal, case conversion and stemming/lemmatization.

Uploaded by

Om Dixit
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
502 views

Revision Questions

The document provides answers to several questions related to machine learning and natural language processing concepts. It defines artificial intelligence as machines that can make decisions based on training data. It also discusses different types of intelligences according to multiple intelligence theory and explains how neural networks work by automatically extracting features from data without programmer input. The document further demonstrates the steps for text normalization including tokenization, stopword removal, case conversion and stemming/lemmatization.

Uploaded by

Om Dixit
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

Revision Questions

Q. A corpus contains 12 documents. How many document vectors will be there for that corpus?

a. 12 b. 1 c. 24 d. 1/12

Ans. 12.

Q. How do you understand whether a machine/application is AI based or not? Explain with the help of an example.

Ans. Any machine that has been trained with data and can make decisions/predictions on its own can be termed as AI.
Eg: The bot or the automation machine is not trained with any data is not an AI while a chatbot that understands and
processes human language is an AI.

Q. If you do an image search for vacations on a popular search engine, the first few searches mostly return the
picture of beaches. What is the concern here? Explain.

Ans. In the given scenario, we are concerned about the bias. When we talk about a machine, we know that it is
artificial and cannot think on its own. It can have intelligence, but we cannot expect a machine to have any biases of its
own. Any bias can transfer from the developer to the machine while the algorithm is being developed.

Q. What will be the results of conversion of the term, ‘happily’ in the process of stemming and lemmatization?
Which process takes longer time for execution?

Ans. Stemming - happi

Lemmatization- happy

Process that takes longer time for execution is lemmatization

Q. What do we get from the “bag of words'' algorithm?

Ans. Bag of words gives us two things: 1. A vocabulary of words for the corpus 2. The frequency of these words
(number of times it has occurred in the whole corpus)

Q. Ashwat is amazed to learn about his sister Ananya who is multi-talented and has excelled in academics, music,
dancing, sports and painting. He was quite curious when Ananya told him that he too possessed all these
intelligences like every human being does, but only at different levels. He wondered which intelligence she was
talking about. Can you help Ashwat in learning about different types of intelligences by naming and explaining any
four types of intelligences?

Ans. All humans possess 9 types of intelligence but at different levels. They are:

1. Mathematical Logical Reasoning: ability to regulate, measure, and understand numerical symbols, abstraction and
logic.

2. Linguistic Intelligence: Language processing skills understanding or implementation in writing or verbally.

3. Spatial Visual Intelligence : ability to perceive the visual world and the relationship of one object to another.

4. Kinesthetic Intelligence : ability that is related to how a person uses his limbs in a skilled manner. 5. Musical
Intelligence : ability to recognize and create sounds, rhythms, and sound patterns.

6. Intrapersonal Intelligence : Describes how high the level of self -awareness someone has is. Starting from realizing
weakness, strength, to his own feelings.

7. Existential Intelligence : An additional category of intelligence relating to religious and spiritual awareness.

8. Naturalist Intelligence : An additional category of intelligence relating to the ability to process information on the
environment around us.

9. Interpersonal intelligence : ability to communicate with others by understanding other people's feelings & influence
of the person
Q. Neural networks are said to be modelled the way how neurons in the human brain behave. A similar system is
mimicked by the AI machine to perform certain tasks. Explain how neural networks work in an AI model and
mention any three features of Neural Networks.

Ans. Neural networks are loosely modelled after how neurons in the human brain behave. The features of a neural
network are : 1. They are able to extract data features automatically without needing the input of the programmer. 2.
A neural network is essentially a system of organizing machine learning algorithms to perform certain tasks. 3. It is a
fast and efficient way to solve problems for which the dataset is very large, such as in images.

Q. Samiksha, a student of class X was exploring the Natural Language Processing domain. She got stuck while
performing the text normalisation. Help her to normalise the text on the segmented sentences given below:

Document 1: Akash and Ajay are best friends.

Document 2: Akash likes to play football but Ajay prefers to play online games.

Ans. 1. Tokenisation

Akash, and, Ajay, are, best, friends Akash, likes, to, play, football, but, Ajay, prefers, to, play, online, games

2. Removal of stopwords

Akash, Ajay, best, friends Akash, likes, play, football, Ajay, prefers, play, online, games

3. converting text to a common case

akash, ajay, best, friends akash, likes, play, football, ajay, prefers, play, online, games

4. Stemming/Lemmatisation

akash, ajay, best, friend akash, like, play, football, ajay, prefer, play, online, game

Q.

Document 1: Aman and Anil are stressed

Document 2: Aman went to a therapist

Document 3: Anil went to download a health chatbot

Ans. Step 1:

Here are three documents having one sentence each. After text normalisation, the text becomes: Document 1: [aman,
and, anil, are, stressed]

Document 2: [aman, went, to, a, therapist]

Document 3: [anil, went, to, download, a, health, chatbot]

Note that no tokens have been removed in the stopwords removal step. It is because we have very little data and
since the frequency of all the words is almost the same, no word can be said to have lesser value than the other.

Step 2: Create Dictionary


Step 3: Create document vector

Step 4: Repeat for all documents Same exercise has to be done for all the documents. Hence, the table becomes:

Q. For the above given documents show the steps to calculate the TFIDF value of the words.

Ans.

Inverse Document Frequency:

Document Frequency is the number of documents in which the word occurs irrespective of how many times it has
occurred in those documents

inverse document frequency, we need to put the document frequency in the denominator while the total number of
documents is the numerator. Here, the total number of documents are 3, hence inverse document frequency
becomes:
****IMPORTANT POINTERS FOR TFIDF:

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy