0% found this document useful (0 votes)

2 views4 pages

Phase 2 File

The document outlines the Phase-2 submission for a project focused on developing an NLP-powered chatbot for customer support, specifically targeting intent classification. It details the problem statement, project objectives, data description, preprocessing steps, exploratory data analysis, model building, and performance metrics. The project aims to enhance customer service efficiency through accurate intent recognition, utilizing various models including Logistic Regression, Random Forest, and BERT.

Uploaded by

mdnafeed29

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views4 pages

Phase 2 File

Uploaded by

mdnafeed29

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Phase-2 Submission Template

Student Name: Harish Ragavendra S

Register Number: 510623243010

Institution: C.Abdul Hakeem College of Engineering & Technology

Department: Btech Ai&Ds

Date of Submission: 09.05.2025

Github Repository Link: https://github.com/madhuhansa/NLP-Intent-

classification-chatbot

1. Problem Statement
In the digital age, customers expect instant, accurate, and 24/7 support across
various platforms. Traditional customer service models rely heavily on human
agents, resulting in increased operational costs, inconsistent responses, and delays
during high-demand periods. To overcome these limitations, businesses are
increasingly turning to intelligent automation solutions like chatbots.

In Phase-1, we proposed a basic NLP-powered chatbot for customer support. Upon

further exploration of the dataset and chatbot interactions, we have refined the
problem to focus specifically on intent classification—accurately identifying the
user's intent from their message and generating a relevant, automated response.

This is fundamentally a multi-class classification problem, where the chatbot must

classify input queries into predefined categories such as “Order Status,” “Returns,”
“Technical Support,” etc. The quality of classification directly impacts the
effectiveness of the automated response.

Solving this problem has practical and wide-reaching implications. An accurate

intent-classifying chatbot reduces human workload, improves customer satisfaction,
enables round-the-clock support, and allows businesses to scale their support
infrastructure efficiently. It has relevance across e-commerce, healthcare, banking,
and more, making it a vital solution for modern customer service operations.
2. Project Objectives
- Refine the chatbot to improve intent recognition and response generation
accuracy.
- Enhance user experience through improved NLP pipelines.
- Achieve high model performance in terms of accuracy and F1-score.
- Adapt and evolve objectives based on insights from EDA and initial trials.

3. Flowchart of the Project Workflow

4. Data Description
- Dataset: Custom and open-source chatbot datasets (e.g., Cornell Movie Dialogues,
Kaggle FAQs).
- Type: Text (Unstructured)
- Number of Records: ~10,000 conversation pairs (questions and responses)
- Number of Features: 2 main columns – user_input and intent
- Dataset Type: Static
- Target Variable: intent (used for classification)
5. Data Preprocessing
- Removed missing and duplicate entries.
- Normalized text (lowercasing, punctuation removal).
- Tokenized sentences and applied lemmatization.
- Applied label encoding on target variable.
- Vectorized inputs using TF-IDF and BERT embeddings.

6. Exploratory Data Analysis (EDA)

Univariate Analysis:
- Countplots and pie charts to visualize intent distribution.
- Word clouds for most common words.
- Boxplots of message lengths.

Bivariate / Multivariate Analysis:

- Bar plots: Intent vs. average message length.
- Cosine similarity heatmaps for intent overlap.

Insights Summary:
- Common intents dominate dataset.
- Keyword-based patterns support model separability.

7. Feature Engineering
- Created features: message length, keyword flags.
- TF-IDF vectorization and BERT embeddings used.
- PCA used on TF-IDF (optional dimensionality reduction).
- Features helped improve classification accuracy.

8. Model Building
Models Used:
1. Logistic Regression – baseline with TF-IDF.
2. Random Forest – handles sparse data, interpretable.
3. BERT – transformer model with high accuracy.

Train-Test Split: 80-20, stratified.

Metrics: Accuracy, Precision, Recall, F1-score.

Performance:
| Model | Accuracy | Precision | Recall | F1-Score |
|--------------------|----------|-----------|--------|----------|
| Logistic Regression| 84.5% | 0.83 | 0.84 | 0.835 |
| Random Forest | 88.2% | 0.87 | 0.88 | 0.875 |
| BERT Transformer | 94.1% | 0.94 | 0.94 | 0.94 |

9. Visualization of Results & Model Insights

- Confusion Matrix for all models showed class-wise performance.
- ROC Curves confirmed BERT's superior prediction confidence.
- Feature Importance plots from Random Forest explained top keywords.
- Visual comparisons proved BERT outperformed others in both precision and
recall.

10. Tools and Technologies Used

- Programming Language: Python
- IDE: Google Colab, Jupyter Notebook, VS Code
- Libraries: pandas, numpy, seaborn, matplotlib, scikit-learn, transformers,
TensorFlow
- Visualization Tools: Plotly, seaborn

11. Team Members and Contributions

- Harish Ragavendra S – Data Cleaning, Model Development, Report Writing
- Justin Rishi S B – EDA, Feature Engineering,
- Mohammed Raquess – Model Evaluation, Deployment

- Mohammed Naveed - GitHub Management, Visualizations

Beyond Imagination: Spiritual Interpretation of Numbers
100% (2)
Beyond Imagination: Spiritual Interpretation of Numbers
172 pages
Text Classification On Call Center Data Using BERT
No ratings yet
Text Classification On Call Center Data Using BERT
4 pages
Python Chatbot Project
No ratings yet
Python Chatbot Project
10 pages
Edunet Week 1 Submission Details
No ratings yet
Edunet Week 1 Submission Details
4 pages
AIML Hackathon
No ratings yet
AIML Hackathon
26 pages
AIML Internship Report
No ratings yet
AIML Internship Report
53 pages
Phase 2 File 1
No ratings yet
Phase 2 File 1
4 pages
Fatigue and Fracture of Metals and Alloys Numerical and Experimental Study
No ratings yet
Fatigue and Fracture of Metals and Alloys Numerical and Experimental Study
370 pages
Power System Stabliser: A Review
No ratings yet
Power System Stabliser: A Review
71 pages
Sundar RajI Phase 3
No ratings yet
Sundar RajI Phase 3
29 pages
Phase
No ratings yet
Phase
3 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
GUHAN
No ratings yet
GUHAN
19 pages
Phase-2 Intelligent Chatbot Automated Assistance
No ratings yet
Phase-2 Intelligent Chatbot Automated Assistance
7 pages
Chatbot: Abhishek Verma (00414902018) Archit Kr. Singh (01414902018) Jatin Bagga (03814902018)
No ratings yet
Chatbot: Abhishek Verma (00414902018) Archit Kr. Singh (01414902018) Jatin Bagga (03814902018)
29 pages
Turing Machine VS Pushdown Automata
100% (2)
Turing Machine VS Pushdown Automata
23 pages
Shalini NM Record
No ratings yet
Shalini NM Record
29 pages
Scientific Method & Variables
100% (1)
Scientific Method & Variables
28 pages
P3 Mathematics MIDTERM I JOINT EXAMINATION 2013 PDF
50% (2)
P3 Mathematics MIDTERM I JOINT EXAMINATION 2013 PDF
6 pages
Rucha Bapat Master Thesis Report
No ratings yet
Rucha Bapat Master Thesis Report
91 pages
NLU Final
No ratings yet
NLU Final
23 pages
Cl10 Maths Portfolio 24-25
No ratings yet
Cl10 Maths Portfolio 24-25
6 pages
MATM111-Mathematics in The Modern World
100% (1)
MATM111-Mathematics in The Modern World
140 pages
PP Mini Project-Gp
No ratings yet
PP Mini Project-Gp
23 pages
Britto 1 15 2 15 - Merged
No ratings yet
Britto 1 15 2 15 - Merged
18 pages
Seminar
No ratings yet
Seminar
27 pages
Cse303 Elements of The Theory of Computation: Professor Anita Wasilewska
No ratings yet
Cse303 Elements of The Theory of Computation: Professor Anita Wasilewska
91 pages
Britto
No ratings yet
Britto
16 pages
Chapter 3
No ratings yet
Chapter 3
14 pages
Influence Lines - Qualitative Influence Lines Using The Müller Breslau Principle
No ratings yet
Influence Lines - Qualitative Influence Lines Using The Müller Breslau Principle
7 pages
Varshini Phase 2
No ratings yet
Varshini Phase 2
19 pages
CM2060 NLP Coursework
No ratings yet
CM2060 NLP Coursework
5 pages
CH 24 Knight 4th
No ratings yet
CH 24 Knight 4th
77 pages
Phase-2 Ibrahim
No ratings yet
Phase-2 Ibrahim
9 pages
Related Rates of Change
No ratings yet
Related Rates of Change
54 pages
Reflection and Refraction
No ratings yet
Reflection and Refraction
46 pages
Estimation of PH of Sugar Cane Juices at High Temperature
No ratings yet
Estimation of PH of Sugar Cane Juices at High Temperature
4 pages
Project Expo Summary Report Final
No ratings yet
Project Expo Summary Report Final
7 pages
Project Expo Summary Report
No ratings yet
Project Expo Summary Report
7 pages
Rohan Task Performed
No ratings yet
Rohan Task Performed
6 pages
Blofeld Sysex v1 04
No ratings yet
Blofeld Sysex v1 04
25 pages
Assignment Data Science
No ratings yet
Assignment Data Science
6 pages
Intelligent Chatbot Phase1
No ratings yet
Intelligent Chatbot Phase1
2 pages
Finance Chpter 5 Time Value of Money
No ratings yet
Finance Chpter 5 Time Value of Money
11 pages
Research Paper
No ratings yet
Research Paper
4 pages
Internship Report
No ratings yet
Internship Report
5 pages
Protean
No ratings yet
Protean
5 pages
Detailed Case Study Rewrite Fixed
No ratings yet
Detailed Case Study Rewrite Fixed
4 pages
ML Project Proposal PDF
No ratings yet
ML Project Proposal PDF
4 pages
Project Synopsis23543
No ratings yet
Project Synopsis23543
4 pages
01 Introduction
No ratings yet
01 Introduction
24 pages
Natural Language Understanding in Chatbots
No ratings yet
Natural Language Understanding in Chatbots
4 pages
Digital Signal Processing
No ratings yet
Digital Signal Processing
24 pages
48 75 Dsa Report
No ratings yet
48 75 Dsa Report
11 pages
Phase 1
No ratings yet
Phase 1
3 pages
Bulba Advanced Instructions
No ratings yet
Bulba Advanced Instructions
13 pages
1 Lesson Notes Measurement Systems
No ratings yet
1 Lesson Notes Measurement Systems
11 pages
GenAI Workshop Report
No ratings yet
GenAI Workshop Report
14 pages
Project Report On Chatbotmuneshwar Anchal
No ratings yet
Project Report On Chatbotmuneshwar Anchal
50 pages
Demo
No ratings yet
Demo
1 page
Shreyank
No ratings yet
Shreyank
6 pages
Investment & Portfolio Management - Assignment Three - Smart 3B
No ratings yet
Investment & Portfolio Management - Assignment Three - Smart 3B
10 pages
Phase-1 For DA & DS
No ratings yet
Phase-1 For DA & DS
3 pages
Results & Discussions
No ratings yet
Results & Discussions
1 page
CM2015 Midterm Apr25
No ratings yet
CM2015 Midterm Apr25
4 pages
Phase-1 For DA & DS
No ratings yet
Phase-1 For DA & DS
3 pages
STResume
No ratings yet
STResume
1 page
HAI Report 3
No ratings yet
HAI Report 3
13 pages
Electrostatics: Coulomb's Law and Electric Field (E-Field)
No ratings yet
Electrostatics: Coulomb's Law and Electric Field (E-Field)
17 pages
Phase-2 For DS
No ratings yet
Phase-2 For DS
6 pages
Kelompok 7 Chap 18
No ratings yet
Kelompok 7 Chap 18
6 pages
FYProject Template (PROPOSAL)
No ratings yet
FYProject Template (PROPOSAL)
22 pages
Hany El - Gezawy: Tips 4 P6 Exams
No ratings yet
Hany El - Gezawy: Tips 4 P6 Exams
6 pages
BAZEPOD Sluzbeni Podsjetnik Kolegija Baze Podataka
No ratings yet
BAZEPOD Sluzbeni Podsjetnik Kolegija Baze Podataka
5 pages
Developing An AI
No ratings yet
Developing An AI
10 pages
Linear Algebra - Syllabus
No ratings yet
Linear Algebra - Syllabus
4 pages
Lab 125 Darshan Patel
No ratings yet
Lab 125 Darshan Patel
4 pages
Udaylokhande
No ratings yet
Udaylokhande
8 pages
Complex Analysis 14
No ratings yet
Complex Analysis 14
2 pages
Physics IITBHU Assignment1
No ratings yet
Physics IITBHU Assignment1
2 pages
MATLAB Workshop
No ratings yet
MATLAB Workshop
2 pages
BentoML Adapter Integrations for Machine Learning Frameworks: The Complete Guide for Developers and Engineers
From Everand
BentoML Adapter Integrations for Machine Learning Frameworks: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
IT Specialist: Artificial Intelligence Exam Prep - 500 Questions for Certification Success (0225)
From Everand
IT Specialist: Artificial Intelligence Exam Prep - 500 Questions for Certification Success (0225)
Satou Takahiro
No ratings yet
XGBoost in Practice: Definitive Reference for Developers and Engineers
From Everand
XGBoost in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Digital Modulations using Matlab
From Everand
Digital Modulations using Matlab
Mathuranathan Viswanathan
4/5 (6)
ChatGPT for Beginners Al-Powered Producivity
From Everand
ChatGPT for Beginners Al-Powered Producivity
Ary S. Jr.
No ratings yet
Mastering Algorithms for Competitive Programming: Unlock the Secrets of Expert-Level Skills
From Everand
Mastering Algorithms for Competitive Programming: Unlock the Secrets of Expert-Level Skills
Larry Jones
No ratings yet
CHATGPT DALL.E 3: Complete Guide. Third Edition
From Everand
CHATGPT DALL.E 3: Complete Guide. Third Edition
Hesham Mohamed Elsherif
No ratings yet
Wolfram Language and Computational Techniques: Definitive Reference for Developers and Engineers
From Everand
Wolfram Language and Computational Techniques: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Prompt Engineering with ChatGPT
From Everand
Prompt Engineering with ChatGPT
Nikiforos Kontopoulos
No ratings yet
Artificial Intelligence 2024 Book 2 of 2: AI, #2
From Everand
Artificial Intelligence 2024 Book 2 of 2: AI, #2
Yang Yen Thaw
No ratings yet
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
Detectron2 in Practice: Definitive Reference for Developers and Engineers
From Everand
Detectron2 in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Learning Dynamics NAV Patterns: Create solutions that are easy to maintain, are quick to upgrade, and follow proven concepts and design
From Everand
Learning Dynamics NAV Patterns: Create solutions that are easy to maintain, are quick to upgrade, and follow proven concepts and design
Marije Brummel
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Phase 2 File

Uploaded by

Phase 2 File

Uploaded by

Phase-2 Submission Template

Student Name: Harish Ragavendra S

Register Number: 510623243010

Institution: C.Abdul Hakeem College of Engineering & Technology

Department: Btech Ai&Ds

Date of Submission: 09.05.2025

Github Repository Link: https://github.com/madhuhansa/NLP-Intent-

In Phase-1, we proposed a basic NLP-powered chatbot for customer support. Upon

This is fundamentally a multi-class classification problem, where the chatbot must

Solving this problem has practical and wide-reaching implications. An accurate

3. Flowchart of the Project Workflow

6. Exploratory Data Analysis (EDA)

Bivariate / Multivariate Analysis:

Train-Test Split: 80-20, stratified.

Metrics: Accuracy, Precision, Recall, F1-score.

9. Visualization of Results & Model Insights

10. Tools and Technologies Used

11. Team Members and Contributions

- Mohammed Naveed - GitHub Management, Visualizations

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.