0% found this document useful (0 votes)

32 views6 pages

Q ClassX AI Ch7

This document is a question bank for Class 10 Artificial Intelligence focusing on Natural Language Processing (NLP). It includes various types of questions, such as definitions, comparisons, and applications related to NLP concepts like chatbots, stemming, lemmatization, and TFIDF. Additionally, it provides practical exercises for creating document vector tables and normalizing text.

Uploaded by

manojbadmosh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views6 pages

Q ClassX AI Ch7

Uploaded by

manojbadmosh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

ARTIFICIAL INTELLIGENCE

QUESTION BANK – CLASS 10

CHAPTER 7: NATURAL LANGUAGE PROCESSING

One (01) Mark Questions

1. What is a Chabot?
A chatbot is a computer program that's designed to simulate human conversation
through voice commands or text chats or both. Eg: Mitsuku Bot, Jabberwacky etc.
OR
A chatbot is a computer program that can learn over time how to best interact with
humans. It can answer questions and troubleshoot customer problems, evaluate and
qualify prospects, generate sales leads and increase sales on an ecommerce site.

2. While working with NLP what is the meaning of?

a. Syntax
b. Semantics
Syntax: Syntax refers to the grammatical structure of a sentence.
Semantics: It refers to the meaning of the sentence.

3. What is the difference between stemming and lemmatization?

Stemming is a technique used to extract the base form of the words by removing affixes
from them. It is just like cutting down the branches of a tree to its stems. For example,
the stem of the words eating, eats, eaten is eat.
Lemmatization is the grouping together of different forms of the same word. In search
queries, lemmatization allows end users to query any version of a base word and get
relevant results.
OR
Stemming is the process in which the affixes of words are removed and the words are
converted to their base form.

In lemmatization, the word we get after affix removal (also known as lemma) is a
meaningful one. Lemmatization makes sure that lemma is a word with meaning and
hence it takes a longer time to execute than stemming.

4. What is the full form of TFIDF?

Term Frequency and Inverse Document Frequency

5. What is meant by a dictionary in NLP?

Dictionary in NLP means a list of all the unique words occurring in the corpus. If some
words are repeated in different documents, they are all written just once as while
creating the dictionary.
6. What is term frequency?
Term frequency is the frequency of a word in one document.

7. Which package is used for Natural Language Processing in Python programming?

Natural Language Toolkit (NLTK).

8. What is a document vector table?

Document Vector Table is used while implementing Bag of Words algorithm.
If the document contains a particular word it is represented by 1 and absence of word is
represented by 0 value.
OR
Document Vector Table is a table containing the frequency of each word of the
vocabulary in each document.

9. What do you mean by corpus?

A corpus is a large and structured set of machine-readable texts that have been
produced in a natural communicative setting.
OR
A corpus can be defined as a collection of text documents. It can be thought of as just a
bunch of text files in a directory, often alongside many other directories of text files.

Two (02) Mark Questions

1. Differentiate between a script-bot and a smart-bot. (Any 2 differences)

Script-bot Smart-bot
 A scripted chatbot doesn’t carry  Smart bots are built on NLP and
even a glimpse of A.I ML.
 Script bots are easy to make  Smart –bots are comparatively
difficult to make.
 Script bot functioning is very  Smart-bots are flexible and
limited as they are less powerful.
powerful. ● Smart bots work on bigger
 Script bots work around a script databases and other resources
which is programmed in them directly
● NLP and Machine learning
 No or little language processing skills are required.
skills ● Wide functionality
 Limited functionality

2. What is inverse document frequency?

To understand inverse document frequency, first we need to understand document
frequency.
Document Frequency is the number of documents in which the word occurs irrespective
of how many times it has occurred in those documents.
In case of inverse document frequency, we need to put the document frequency in the
denominator while the total number of documents is the numerator.
For example, if the document frequency of a word “AMAN” is 2 in a particular document
then its inverse document frequency will be 3/2. (Here no. of documents is 3)

3. Mention some applications of Natural Language Processing.

Natural Language Processing Applications-
● Sentiment Analysis.
● Chatbots & Virtual Assistants.
● Text Classification.
● Text Extraction.
● Machine Translation
● Text Summarization
● Market Intelligence
● Auto-Correct

4. Explain the concept of Bag of Words.

Bag of Words is a Natural Language Processing model which helps in extracting features
out of the text which can be helpful in machine learning algorithms. In bag of words, we
get the occurrences of each word and construct the vocabulary for the corpus.

Bag of Words just creates a set of vectors containing the count of word occurrences in
the document (reviews). Bag of Words vectors are easy to interpret.

5. What are stop words? Explain with the help of examples.

“Stop words” are the most common words in a language like “the”, “a”, “on”, “is”, “all”.
These words do not carry important meaning and are usually removed from texts. It is
possible to remove stop words using Natural Language Toolkit (NLTK), a suite of
libraries and programs for symbolic and statistical natural language processing.

6. Differentiate between Human Language and Computer Language.

Humans communicate through language which we process all the time. Our brain keeps
on processing the sounds that it hears around itself and tries to make sense out of them
all the time.
On the other hand, the computer understands the language of numbers. Everything that
is sent to the machine has to be converted to numbers. And while typing, if a single
mistake is made, the computer throws an error and does not process that part. The
communications made by the machines are very basic and simple.

Four 04 Mark Questions

1. Create a document vector table for the given corpus:

Document 1: We are going to Mumbai
Document 2: Mumbai is a famous place.
Document 3: We are going to a famous place.
Document 4: I am famous in Mumbai.
We Are going to Mumbai is a famous place I am in
1 1 1 1 1 0 0 0 0 0 0 0
0 0 0 0 1 1 1 1 1 0 0 0
1 1 1 1 0 0 1 1 1 0 0 0
0 0 0 0 1 0 0 1 0 1 1 1

Perfect Syntax, no Meaning - Sometimes, a statement can have a perfectly correct syntax
but it does not mean anything. In Human language, a perfect balance of syntax and
semantics is important for better understanding.
These are some of the challenges we might have to face if we try to teach
computers how to understand and interact in human language.

2. Through a step-by-step process, calculate TFIDF for the given corpus and mention
the word(s) having highest value.
Document 1: We are going to Mumbai
Document 2: Mumbai is a famous place.
Document 3: We are going to a famous place.
Document 4: I am famous in Mumbai.

Term Frequency
Term frequency is the frequency of a word in one document. Term frequency can easily
be found from the document vector table as in that table we mention the frequency of
each word of the vocabulary in each document.

We Are Going to Mumbai is a famous Place I am in

1 1 1 1 1 0 0 0 0 0 0 0
0 0 0 0 1 1 1 1 1 0 0 0
1 1 1 1 0 0 1 1 1 0 0 0
0 0 0 0 1 0 0 1 0 1 1 1

Inverse Document Frequency

The other half of TFIDF which is Inverse Document Frequency. For this, let us first
understand what does document frequency mean. Document Frequency is the number
of documents in which the word occurs irrespective of how many times it has occurred
in those documents. The document frequency for the exemplar vocabulary would be:

We Are going to Mumbai is a Famous place I am in

2 2 2 2 3 1 2 3 2 1 1 1

Talking about inverse document frequency, we need to put the document frequency in
the denominator while the total number of documents is the numerator. Here, the total
number of documents are 3, hence inverse document frequency becomes:

We Are going to Mumbai is a Famous Place I am in

4/2 4/2 4/2 4/2 4/3 4/1 4/2 4/3 4/2 4/1 4/1 4/1

The formula of TFIDF for any word W becomes:

TFIDF(W) = TF(W) * log (IDF(W))
The words having highest value are – Mumbai, Famous

3. Normalize the given text and comment on the vocabulary before and after the
normalization:
Raj and Vijay are best friends. They play together with other friends. Raj likes to
play football but Vijay prefers to play online games. Raj wants to be a footballer.
Vijay wants to become an online gamer.

Normalization of the given text:

Sentence Segmentation:
1. Raj and Vijay are best friends.
2. They play together with other friends.
3. Raj likes to play football but Vijay prefers to play online games.
4. Raj wants to be a footballer.
5. Vijay wants to become an online gamer.

Tokenization:

Raj and Vijay Raj and Vijay are best friends .

are best
friends.

They play They play Together with other friends .

together with
other friends

Same will be done for all sentences. Removing Stop words,

Special Characters and Numbers:

In this step, the tokens which are not necessary are removed from the token list.
So, the words and, are, to, an, (Punctuation) will be removed.

Converting text to a common case:

After the stop words removal, we convert the whole text into a similar case, preferably
lower case.
Here we don’t have words in different case so this step is not required for given text.
Stemming:

In this step, the remaining words are reduced to their root words. In other words,
stemming is the process in which the affixes of words are removed and the words are
converted to their base form.
Word Affixes Stem

Likes -s Like

Prefers
-s Prefer

Wants -s want
In the given text Lemmatization is not required.
Given Text
Raj and Vijay are best friends. They play together with other friends. Raj likes to play
football but Vijay prefers to play online games. Raj wants to be a footballer. Vijay wants to
become an online gamer.
Normalized Text
Raj and Vijay best friends They play together with other friends Raj likes to play football
but Vijay prefers to play online games Raj wants to be a footballer Vijay wants to become
an online gamer

Georges MP3 Bangla
100% (1)
Georges MP3 Bangla
873 pages
Tech Test Concentrix - 122443
100% (1)
Tech Test Concentrix - 122443
30 pages
2023 P&G Guidelines
No ratings yet
2023 P&G Guidelines
64 pages
YouTube Content Machine PDF
100% (1)
YouTube Content Machine PDF
28 pages
Natural Language Processing Revision Notes
No ratings yet
Natural Language Processing Revision Notes
4 pages
Intelligent Capture 20.2 Installation Guide
100% (1)
Intelligent Capture 20.2 Installation Guide
203 pages
Document The SRS of College Automation System
100% (1)
Document The SRS of College Automation System
22 pages
NLP-Questions Class 10 Ai
No ratings yet
NLP-Questions Class 10 Ai
8 pages
Chapter 7.1 - Introducing Natural Language Processing
No ratings yet
Chapter 7.1 - Introducing Natural Language Processing
39 pages
PSHS Ict Roadmap 2026
No ratings yet
PSHS Ict Roadmap 2026
32 pages
08 - Design Concepts - Pressman PDF
No ratings yet
08 - Design Concepts - Pressman PDF
59 pages
(4th NLP'22) Final Exam
No ratings yet
(4th NLP'22) Final Exam
2 pages
05 Introduction To NLP
No ratings yet
05 Introduction To NLP
63 pages
Internet
No ratings yet
Internet
45 pages
NLP Class10 PDF
No ratings yet
NLP Class10 PDF
9 pages
How To Upgrade A MiR Robot's Software 2.1 - en
No ratings yet
How To Upgrade A MiR Robot's Software 2.1 - en
33 pages
Intership in Chennai For Mba
No ratings yet
Intership in Chennai For Mba
6 pages
1009 NLP PPT
No ratings yet
1009 NLP PPT
31 pages
Sample Paper Questions - NLP (Part 2)
No ratings yet
Sample Paper Questions - NLP (Part 2)
7 pages
Natural Language Processing Notes Class 10 AI
No ratings yet
Natural Language Processing Notes Class 10 AI
25 pages
ISO 11783-5 Network Management
0% (1)
ISO 11783-5 Network Management
11 pages
C10 - Ai - Unit 3 - NLP - Half Yearly
No ratings yet
C10 - Ai - Unit 3 - NLP - Half Yearly
37 pages
Intern Report Pabins Khadka
No ratings yet
Intern Report Pabins Khadka
32 pages
SALOME 9 12 0 Release Notes
No ratings yet
SALOME 9 12 0 Release Notes
23 pages
10 Different Common Use Cases of Business Rule in Successactors Employee Central
No ratings yet
10 Different Common Use Cases of Business Rule in Successactors Employee Central
12 pages
Board QP Solution and Notes
No ratings yet
Board QP Solution and Notes
36 pages
Unit 6 - AI (NLP)
No ratings yet
Unit 6 - AI (NLP)
37 pages
NLP Q&A1a Text Processing
No ratings yet
NLP Q&A1a Text Processing
16 pages
Natural Language Processing - Compressed
No ratings yet
Natural Language Processing - Compressed
17 pages
Lucas Paquetta Raw NLP
No ratings yet
Lucas Paquetta Raw NLP
12 pages
Previous Year Question Paper NLP
No ratings yet
Previous Year Question Paper NLP
5 pages
Dupppppppppp
No ratings yet
Dupppppppppp
15 pages
NLP and Evaluation
No ratings yet
NLP and Evaluation
23 pages
Chapter 2 (DBMS)
No ratings yet
Chapter 2 (DBMS)
25 pages
Quest NLP
No ratings yet
Quest NLP
13 pages
So Ware Testing Tutorial: Free QA Course
No ratings yet
So Ware Testing Tutorial: Free QA Course
11 pages
PDF NLP
No ratings yet
PDF NLP
7 pages
Week 6: Introduction To Natural Language Processing
No ratings yet
Week 6: Introduction To Natural Language Processing
18 pages
NLP
No ratings yet
NLP
14 pages
Artificial Intelligence Class X Unit 7: Natural Language Processing
No ratings yet
Artificial Intelligence Class X Unit 7: Natural Language Processing
10 pages
NLP Qa
No ratings yet
NLP Qa
10 pages
Q - ClassX - AI - NATURAL LANGUAGE PROCESSING
No ratings yet
Q - ClassX - AI - NATURAL LANGUAGE PROCESSING
10 pages
Natural Language Processing
No ratings yet
Natural Language Processing
10 pages
Ai NLP
No ratings yet
Ai NLP
9 pages
517-C-30070-Assignment - Chapter NLP
No ratings yet
517-C-30070-Assignment - Chapter NLP
9 pages
Java Micro Project
No ratings yet
Java Micro Project
9 pages
Web of Science Master Journal List - Journal Profile-SPE
No ratings yet
Web of Science Master Journal List - Journal Profile-SPE
2 pages
NLP Notes
No ratings yet
NLP Notes
10 pages
AIUnit 6 10
No ratings yet
AIUnit 6 10
8 pages
SKD Academy (CBSE) Session - 2024-2025 Subject - Artificial Intelligence (417) Important Questions Chap - NLP
No ratings yet
SKD Academy (CBSE) Session - 2024-2025 Subject - Artificial Intelligence (417) Important Questions Chap - NLP
7 pages
Natural Language Processing Notes Class 10
No ratings yet
Natural Language Processing Notes Class 10
10 pages
P.S.Senior Secondary School Class X - Artificial Intelligence - 2021-22 Natural Language Processing Question and Answers
No ratings yet
P.S.Senior Secondary School Class X - Artificial Intelligence - 2021-22 Natural Language Processing Question and Answers
7 pages
Chapter 6 - NLP Question Answer
No ratings yet
Chapter 6 - NLP Question Answer
7 pages
NLP Ai X
No ratings yet
NLP Ai X
6 pages
Ai Notes
No ratings yet
Ai Notes
11 pages
Natural Language Processing
No ratings yet
Natural Language Processing
6 pages
Ch-3 NLP Questions
No ratings yet
Ch-3 NLP Questions
6 pages
CS614-FinalTerm MCQs With Reference Solved by Arslan
No ratings yet
CS614-FinalTerm MCQs With Reference Solved by Arslan
40 pages
Understanding The Information System Department
0% (4)
Understanding The Information System Department
9 pages
MTE Practice Set
No ratings yet
MTE Practice Set
4 pages
Motivation Video: Mitsuku Vs Cleverbot - AI (Artificial Intelligence)
No ratings yet
Motivation Video: Mitsuku Vs Cleverbot - AI (Artificial Intelligence)
45 pages
AI HW
No ratings yet
AI HW
4 pages
NLP - Notes
No ratings yet
NLP - Notes
3 pages
Unit 6 (NLP)
No ratings yet
Unit 6 (NLP)
8 pages
NLP Revision Notes
No ratings yet
NLP Revision Notes
6 pages
9783293-CLASS10 AI Worksheet PART B UNIT6 Natural Language Processing
No ratings yet
9783293-CLASS10 AI Worksheet PART B UNIT6 Natural Language Processing
3 pages
Course Handout - CAD
No ratings yet
Course Handout - CAD
3 pages
Unit 6 - NLP Notes
No ratings yet
Unit 6 - NLP Notes
7 pages
Cbse - Department of Skill Education Artificial Intelligence
No ratings yet
Cbse - Department of Skill Education Artificial Intelligence
11 pages
Lemmatization Is The Grouping Together of Different Forms of The Same Word. in Search
No ratings yet
Lemmatization Is The Grouping Together of Different Forms of The Same Word. in Search
11 pages
Unit 6 Natural Language Processing
No ratings yet
Unit 6 Natural Language Processing
10 pages
Statement of Work (SoW) Template
100% (3)
Statement of Work (SoW) Template
16 pages
NLP - CH-6
No ratings yet
NLP - CH-6
4 pages
X - AI-NLP Worksheet
No ratings yet
X - AI-NLP Worksheet
2 pages
NLP Notes
No ratings yet
NLP Notes
3 pages
Natural Language Processing (NLP)
No ratings yet
Natural Language Processing (NLP)
5 pages
Ngai Chun Hou: Employment History
No ratings yet
Ngai Chun Hou: Employment History
3 pages
Bangkit Logbook Week 13
No ratings yet
Bangkit Logbook Week 13
3 pages
Unit-6 Natural Language Processing
No ratings yet
Unit-6 Natural Language Processing
7 pages
Proposal For Child Tracking Solution
No ratings yet
Proposal For Child Tracking Solution
2 pages
Rts Rdds Software en
No ratings yet
Rts Rdds Software en
2 pages
NLP Key Points
No ratings yet
NLP Key Points
3 pages
Mohamed Elbanna CV
No ratings yet
Mohamed Elbanna CV
1 page
Question Bank For Seen Pre-Board - AI - Grade 10 - 2021-22
No ratings yet
Question Bank For Seen Pre-Board - AI - Grade 10 - 2021-22
7 pages
Unit-I QB
No ratings yet
Unit-I QB
5 pages
Natural Language Processing - NOTES
No ratings yet
Natural Language Processing - NOTES
4 pages
(Project Name) : Planning: Master Test Plan
No ratings yet
(Project Name) : Planning: Master Test Plan
4 pages
Structure of DBMS PDF
50% (4)
Structure of DBMS PDF
2 pages
Programming Language Concepts: Improving your Software Development Skills
From Everand
Programming Language Concepts: Improving your Software Development Skills
Oliver Wegner
No ratings yet
Natural Language Processing
From Everand
Natural Language Processing
Ajit Singh
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Q ClassX AI Ch7

Uploaded by

Q ClassX AI Ch7

Uploaded by

ARTIFICIAL INTELLIGENCE

QUESTION BANK – CLASS 10

CHAPTER 7: NATURAL LANGUAGE PROCESSING

One (01) Mark Questions

2. While working with NLP what is the meaning of?

3. What is the difference between stemming and lemmatization?

4. What is the full form of TFIDF?

5. What is meant by a dictionary in NLP?

7. Which package is used for Natural Language Processing in Python programming?

8. What is a document vector table?

9. What do you mean by corpus?

Two (02) Mark Questions

1. Differentiate between a script-bot and a smart-bot. (Any 2 differences)

2. What is inverse document frequency?

3. Mention some applications of Natural Language Processing.

4. Explain the concept of Bag of Words.

5. What are stop words? Explain with the help of examples.

6. Differentiate between Human Language and Computer Language.

Four 04 Mark Questions

1. Create a document vector table for the given corpus:

We Are Going to Mumbai is a famous Place I am in

Inverse Document Frequency

We Are going to Mumbai is a Famous place I am in

We Are going to Mumbai is a Famous Place I am in

The formula of TFIDF for any word W becomes:

Normalization of the given text:

Raj and Vijay Raj and Vijay are best friends .

They play They play Together with other friends .

Same will be done for all sentences. Removing Stop words,

Converting text to a common case:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.