Revision Questions
Revision Questions
Q. A corpus contains 12 documents. How many document vectors will be there for that corpus?
a. 12 b. 1 c. 24 d. 1/12
Ans. 12.
Q. How do you understand whether a machine/application is AI based or not? Explain with the help of an example.
Ans. Any machine that has been trained with data and can make decisions/predictions on its own can be termed as AI.
Eg: The bot or the automation machine is not trained with any data is not an AI while a chatbot that understands and
processes human language is an AI.
Q. If you do an image search for vacations on a popular search engine, the first few searches mostly return the
picture of beaches. What is the concern here? Explain.
Ans. In the given scenario, we are concerned about the bias. When we talk about a machine, we know that it is
artificial and cannot think on its own. It can have intelligence, but we cannot expect a machine to have any biases of its
own. Any bias can transfer from the developer to the machine while the algorithm is being developed.
Q. What will be the results of conversion of the term, ‘happily’ in the process of stemming and lemmatization?
Which process takes longer time for execution?
Lemmatization- happy
Ans. Bag of words gives us two things: 1. A vocabulary of words for the corpus 2. The frequency of these words
(number of times it has occurred in the whole corpus)
Q. Ashwat is amazed to learn about his sister Ananya who is multi-talented and has excelled in academics, music,
dancing, sports and painting. He was quite curious when Ananya told him that he too possessed all these
intelligences like every human being does, but only at different levels. He wondered which intelligence she was
talking about. Can you help Ashwat in learning about different types of intelligences by naming and explaining any
four types of intelligences?
Ans. All humans possess 9 types of intelligence but at different levels. They are:
1. Mathematical Logical Reasoning: ability to regulate, measure, and understand numerical symbols, abstraction and
logic.
3. Spatial Visual Intelligence : ability to perceive the visual world and the relationship of one object to another.
4. Kinesthetic Intelligence : ability that is related to how a person uses his limbs in a skilled manner. 5. Musical
Intelligence : ability to recognize and create sounds, rhythms, and sound patterns.
6. Intrapersonal Intelligence : Describes how high the level of self -awareness someone has is. Starting from realizing
weakness, strength, to his own feelings.
7. Existential Intelligence : An additional category of intelligence relating to religious and spiritual awareness.
8. Naturalist Intelligence : An additional category of intelligence relating to the ability to process information on the
environment around us.
9. Interpersonal intelligence : ability to communicate with others by understanding other people's feelings & influence
of the person
Q. Neural networks are said to be modelled the way how neurons in the human brain behave. A similar system is
mimicked by the AI machine to perform certain tasks. Explain how neural networks work in an AI model and
mention any three features of Neural Networks.
Ans. Neural networks are loosely modelled after how neurons in the human brain behave. The features of a neural
network are : 1. They are able to extract data features automatically without needing the input of the programmer. 2.
A neural network is essentially a system of organizing machine learning algorithms to perform certain tasks. 3. It is a
fast and efficient way to solve problems for which the dataset is very large, such as in images.
Q. Samiksha, a student of class X was exploring the Natural Language Processing domain. She got stuck while
performing the text normalisation. Help her to normalise the text on the segmented sentences given below:
Document 2: Akash likes to play football but Ajay prefers to play online games.
Ans. 1. Tokenisation
Akash, and, Ajay, are, best, friends Akash, likes, to, play, football, but, Ajay, prefers, to, play, online, games
2. Removal of stopwords
Akash, Ajay, best, friends Akash, likes, play, football, Ajay, prefers, play, online, games
akash, ajay, best, friends akash, likes, play, football, ajay, prefers, play, online, games
4. Stemming/Lemmatisation
akash, ajay, best, friend akash, like, play, football, ajay, prefer, play, online, game
Q.
Ans. Step 1:
Here are three documents having one sentence each. After text normalisation, the text becomes: Document 1: [aman,
and, anil, are, stressed]
Note that no tokens have been removed in the stopwords removal step. It is because we have very little data and
since the frequency of all the words is almost the same, no word can be said to have lesser value than the other.
Step 4: Repeat for all documents Same exercise has to be done for all the documents. Hence, the table becomes:
Q. For the above given documents show the steps to calculate the TFIDF value of the words.
Ans.
Document Frequency is the number of documents in which the word occurs irrespective of how many times it has
occurred in those documents
inverse document frequency, we need to put the document frequency in the denominator while the total number of
documents is the numerator. Here, the total number of documents are 3, hence inverse document frequency
becomes:
****IMPORTANT POINTERS FOR TFIDF: