0% found this document useful (0 votes)

10 views

Logistic Regression Example (1)

The document explains how Logistic Regression is used for sentiment analysis through a step-by-step approach, starting from binary classification to making predictions. It details the process of converting text into numerical representations, assigning weights to words, calculating scores, applying the sigmoid function, and making final predictions. Additionally, it provides a Python implementation for training a Logistic Regression model on sentiment data, including data cleaning, feature extraction, and model evaluation.

Uploaded by

Asil Zulfiqar 4459-FBAS/BSCS4/F21

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views

Logistic Regression Example (1)

Uploaded by

Asil Zulfiqar 4459-FBAS/BSCS4/F21

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

How Logistic Regression Assigns Scores in Sentiment Analysis (Step-by-Step)

Alright, let’s break it down like we're explaining to a beginner. We’ll go step by step to
understand how Logistic Regression assigns scores and makes predictions.

🔹 Step 1: Understanding the Core Idea

Logistic Regression is used for binary classification, meaning it decides between two
categories (e.g., positive vs. negative sentiment).

Goal:

 Each word in a review contributes to the final score.

 The score is converted into a probability using the sigmoid function.
 If the probability is greater than 0.5 → Positive (1).
 Otherwise, Negative (0).

🔹 Step 2: Representing Text as Numbers

Before the model can work, we must convert text into numbers.

Example Reviews:

1️⃣ "I love this movie" → Positive (1)

2️⃣ "I hate this movie" → Negative (0)

Bag of Words (BoW) Representation:

I love hate this movie

Review 1 1 1 0 1 1
Review 2 1 0 1 1 1

Each review is now represented as a vector of numbers, which Logistic Regression can
process.

🔹 Step 3: Assigning Weights to Words

Logistic Regression Equation:

score=w1⋅x1+w2⋅x2+...+wn⋅xn+b\text{score} = w_1 \cdot x_1 + w_2 \cdot x_2 + ... + w_n \

cdot x_n + b
Where:

 wiw_i → Weight assigned to each word (how important it is)

 xix_i → Word presence (1 if the word appears, 0 if it doesn’t)
 bb → Bias term (a constant value)

📌 Example Weights (learned during training):

Word Weight (ww)

I 0.1
love 2.5
hate -3.0
this 0.5
movie 1.0

📌 Bias term: b=−0.5b = -0.5

🔹 Step 4: Calculating the Score

Now, let’s compute the score for each review using the weights.

Review 1: "I love this movie"

score=(0.1×1)+(2.5×1)+(−3.0×0)+(0.5×1)+(1.0×1)+(−0.5)\text{score} = (0.1 \times 1) +

(2.5 \times 1) + (-3.0 \times 0) + (0.5 \times 1) + (1.0 \times 1) + (-0.5)
=0.1+2.5+0+0.5+1.0−0.5= 0.1 + 2.5 + 0 + 0.5 + 1.0 - 0.5 =3.6= 3.6

Review 2: "I hate this movie"

score=(0.1×1)+(2.5×0)+(−3.0×1)+(0.5×1)+(1.0×1)+(−0.5)\text{score} = (0.1 \times 1) +

(2.5 \times 0) + (-3.0 \times 1) + (0.5 \times 1) + (1.0 \times 1) + (-0.5)
=0.1+0−3.0+0.5+1.0−0.5= 0.1 + 0 - 3.0 + 0.5 + 1.0 - 0.5 =−1.9= -1.9

🔹 Step 5: Applying the Sigmoid Function

The score is now converted into a probability using the sigmoid function:

P=11+e−scoreP = \frac{1}{1 + e^{-\text{score}}}

📌 Why do we use sigmoid?

 It squashes the score into a range between 0 and 1.

 If P > 0.5 → Positive sentiment (1)
 Otherwise, Negative sentiment (0)
Review 1: "I love this movie" (Score = 3.6)

P=11+e−3.6P = \frac{1}{1 + e^{-3.6}} P≈11+0.027=0.974P \approx \frac{1}{1 + 0.027} =

0.974

🔹 Probability = 97.4% → Positive (1) ✅

Review 2: "I hate this movie" (Score = -1.9)

P=11+e1.9P = \frac{1}{1 + e^{1.9}} P≈11+6.69=0.13P \approx \frac{1}{1 + 6.69} = 0.13

🔹 Probability = 13% → Negative (0) ❌

🔹 Step 6: Making the Final Prediction

Review Score Sigmoid Output Prediction
"I love this movie" 3.6 0.974 Positive (1) ✅
"I hate this movie" -1.9 0.13 Negative (0) ❌

🎯 The model successfully classified the reviews! 🎯

🔹 Step 7: Training Logistic Regression

Before making predictions, the model learns the weights from data.

How does the model learn?

1. Starts with random weights 📊

2. Calculates predictions using sigmoid
3. Compares predictions with actual labels (positive/negative)
4. Adjusts weights using an optimization method (Gradient Descent)
5. Repeats the process until predictions improve 📈

🔹 Summary of the Process 🚀

Step What Happens? Example
1️⃣ Convert Text to Represent words as "love" = 1, "hate" = 1
Numbers numbers
2️⃣ Assign Weights Learn word importance "love" → +2.5, "hate" → -3.0
Multiply words by Score = (2.5 × 1) + (-3.0 × 0) +
3️⃣ Compute Score ...
weights
Step What Happens? Example
Convert score to
4️⃣ Apply Sigmoid Sigmoid(3.6) → 0.974
probability
5️⃣ Make Prediction If P > 0.5 → Positive 0.974 → Positive (1)

🔹 Why Use Logistic Regression for Sentiment Analysis?

✅ Simple → Easy to implement
✅ Fast → Works well on small datasets
✅ Interpretable → You can see which words affect sentiment

🚀 Now you understand how Logistic Regression assigns scores step by step! 🎯

Alright! Let’s break down logistic regression for sentiment analysis step by step, assuming
you’re a complete beginner. 🚀

1. What is Sentiment Analysis?

Sentiment Analysis is a type of text classification where we determine whether a given text
expresses positive, negative, or sometimes neutral sentiment.

For example:
✅ "I love this movie!" → Positive
❌ "This product is terrible!" → Negative
😐 "The service was okay." → Neutral

2. Why Use Logistic Regression?

Logistic Regression is a simple yet powerful algorithm used for binary classification (two
categories). Since sentiment analysis is often a positive vs. negative task, logistic regression
works well.

It predicts probabilities using the sigmoid function, which ensures outputs are between 0 and
1. If the probability is above 0.5, we classify it as positive; otherwise, it’s negative.

3. Steps to Perform Sentiment Analysis Using Logistic

Regression
Let’s implement it in Python! 🐍

Step 1: Import Libraries

import numpy as np
import pandas as pd
import re # For text cleaning
import nltk # Natural Language Processing tools
from nltk.corpus import stopwords
from nltk.tokenize import word_tokenize
from sklearn.feature_extraction.text import CountVectorizer,
TfidfVectorizer
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import accuracy_score, classification_report

Note: Run nltk.download('stopwords') and nltk.download('punkt') if you’re using

nltk for the first time.

Step 2: Load Dataset

For simplicity, let’s assume we have a dataset with two columns:

 "text" (contains reviews)

 "sentiment" (1 = positive, 0 = negative)

# Sample dataset (usually loaded from CSV)

data = pd.DataFrame({
'text': [
"I love this movie, it's amazing!",
"This product is the worst. Never buying again.",
"Absolutely fantastic experience!",
"Terrible customer service. Not recommended.",
"I'm so happy with my purchase!",
"Worst experience ever."
],
'sentiment': [1, 0, 1, 0, 1, 0] # 1 = positive, 0 = negative
})

Step 3: Clean the Text Data

Text data is messy! We need to:

 Remove special characters, numbers, and punctuation

 Convert everything to lowercase
 Remove stopwords (common words like the, is, and)
 Tokenize (split text into words)

nltk.download('stopwords')
nltk.download('punkt')
stop_words = set(stopwords.words('english'))
def clean_text(text):
text = text.lower() # Convert to lowercase
text = re.sub(r'\W', ' ', text) # Remove non-word characters
text = re.sub(r'\s+', ' ', text).strip() # Remove extra spaces
words = word_tokenize(text) # Tokenization
words = [word for word in words if word not in stop_words] # Remove
stopwords
return " ".join(words)

data['clean_text'] = data['text'].apply(clean_text)
print(data[['text', 'clean_text']])

Step 4: Convert Text to Numbers (Feature Extraction)

Since machine learning models only understand numbers, we use TF-IDF or Bag of Words
to convert text into numerical features.

vectorizer = TfidfVectorizer()
X = vectorizer.fit_transform(data['clean_text']) # Convert text into
numerical features
y = data['sentiment'] # Target labels (0 or 1)

Step 5: Train-Test Split

We split our data into training and testing sets (80%-20%).

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,

random_state=42)

Step 6: Train Logistic Regression Model

Now, we train the model using LogisticRegression.

model = LogisticRegression()
model.fit(X_train, y_train)

Step 7: Make Predictions

We test our model on unseen data.

y_pred = model.predict(X_test)

Step 8: Evaluate the Model

We check accuracy and performance.

print("Accuracy:", accuracy_score(y_test, y_pred))

print("Classification Report:\n", classification_report(y_test, y_pred))

9. Make a Custom Prediction

Let’s check the sentiment of a new review.

def predict_sentiment(review):
clean_review = clean_text(review) # Clean the text
vectorized_review = vectorizer.transform([clean_review]) # Convert to
numerical features
prediction = model.predict(vectorized_review) # Predict sentiment
return "Positive" if prediction[0] == 1 else "Negative"

print(predict_sentiment("I really love this product, it's fantastic!")) #

Positive
print(predict_sentiment("This is the worst thing ever.")) # Negative

Summary of Key Steps

1. Load dataset 📊
2. Clean the text ✨
3. Convert text to numbers 🔢
4. Split into training & testing sets 🏋️‍♂️
5. Train logistic regression model 🤖
6. Test & evaluate performance 📈
7. Predict new sentiments 🎯

Next Steps?
 Try using a larger dataset (like IMDB movie reviews).
 Experiment with different feature extraction methods (CountVectorizer, Word
Embeddings).
 Tune hyperparameters (change solver, C values in LogisticRegression()).
 Explore deep learning models (LSTMs, Transformers) for even better accuracy!

Would you like me to help with dataset sources or improvements? 🚀

Teaching English Through English Movie - Advantages and Disadvantages
No ratings yet
Teaching English Through English Movie - Advantages and Disadvantages
6 pages
Ai Project
No ratings yet
Ai Project
15 pages
Top 10 NLP Question - Answer
No ratings yet
Top 10 NLP Question - Answer
16 pages
Q 3
No ratings yet
Q 3
2 pages
Sentiment Analysis From H El Reviews: Data Mining For Business Intelligence
No ratings yet
Sentiment Analysis From H El Reviews: Data Mining For Business Intelligence
13 pages
Types of Data Represented As Strings
No ratings yet
Types of Data Represented As Strings
2 pages
document-dsbda-codes-for-mini-project
No ratings yet
document-dsbda-codes-for-mini-project
9 pages
nlp_essentials
No ratings yet
nlp_essentials
22 pages
Yousef ML Washin Classification
100% (1)
Yousef ML Washin Classification
333 pages
Group 4 MovieReview
No ratings yet
Group 4 MovieReview
10 pages
17 Practicals
No ratings yet
17 Practicals
7 pages
Sentiment Analysis With NLP Deep Learning
No ratings yet
Sentiment Analysis With NLP Deep Learning
8 pages
Programming Assignment 3: Logistic Regression Instructions
No ratings yet
Programming Assignment 3: Logistic Regression Instructions
3 pages
Lab Report - CSE 816
No ratings yet
Lab Report - CSE 816
17 pages
DFD Description
No ratings yet
DFD Description
2 pages
Neural Networks
No ratings yet
Neural Networks
8 pages
Developing-an-Advanced-Sentiment-Analysis-System-Using-Logistic-Regression-and-Vector-Space-Models
No ratings yet
Developing-an-Advanced-Sentiment-Analysis-System-Using-Logistic-Regression-and-Vector-Space-Models
10 pages
PDF To PowerPoint 642
No ratings yet
PDF To PowerPoint 642
11 pages
Sentimental Analysis
No ratings yet
Sentimental Analysis
3 pages
ML-11
No ratings yet
ML-11
13 pages
Thesis - Aru Omarali
No ratings yet
Thesis - Aru Omarali
34 pages
Data Mining Numericals
No ratings yet
Data Mining Numericals
38 pages
Logistic Regression
No ratings yet
Logistic Regression
10 pages
2023 Aug How To Prepare Data For A Neural Network A Step-by-Step Guide
No ratings yet
2023 Aug How To Prepare Data For A Neural Network A Step-by-Step Guide
7 pages
vertopal.com_C1_W1_Assignment
No ratings yet
vertopal.com_C1_W1_Assignment
16 pages
Amazon Sentiment Analysis Documentation
No ratings yet
Amazon Sentiment Analysis Documentation
4 pages
Ch03 LogisticRegression
No ratings yet
Ch03 LogisticRegression
79 pages
Ml Projrct Article 2
No ratings yet
Ml Projrct Article 2
6 pages
SMA 5
No ratings yet
SMA 5
3 pages
vertopal.com_C1_W3_Logistic_Regression
No ratings yet
vertopal.com_C1_W3_Logistic_Regression
27 pages
MP 1
No ratings yet
MP 1
14 pages
Sentiment Analysis IMDB Review - Presentation
No ratings yet
Sentiment Analysis IMDB Review - Presentation
19 pages
Module4-TextAnalytics
No ratings yet
Module4-TextAnalytics
9 pages
ISSS609 Project Proposal Group 7
No ratings yet
ISSS609 Project Proposal Group 7
8 pages
### Seminar Report
No ratings yet
### Seminar Report
12 pages
Intro to Linear and Logistic Reg
No ratings yet
Intro to Linear and Logistic Reg
5 pages
Lecture 3 Sentiment Analysis
No ratings yet
Lecture 3 Sentiment Analysis
41 pages
Web Mining Unit 2
No ratings yet
Web Mining Unit 2
12 pages
Abusive Content Detection Using Sentimental Analysis Final
No ratings yet
Abusive Content Detection Using Sentimental Analysis Final
18 pages
Amna Bagh ali
No ratings yet
Amna Bagh ali
6 pages
Building An AI Model Capable of Judging User Sentiments
No ratings yet
Building An AI Model Capable of Judging User Sentiments
2 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
9 pages
Sentiment Analysis Using Naïve Bayes Classifier
No ratings yet
Sentiment Analysis Using Naïve Bayes Classifier
23 pages
Text Classification_movie Review_news Wires
No ratings yet
Text Classification_movie Review_news Wires
5 pages
Final Project Report
No ratings yet
Final Project Report
43 pages
MADHU-IEEE Update
No ratings yet
MADHU-IEEE Update
5 pages
Week 3 - Lecture Slides - Logistic Regression
No ratings yet
Week 3 - Lecture Slides - Logistic Regression
54 pages
Template For The First Slide of PPT Presentation1
No ratings yet
Template For The First Slide of PPT Presentation1
18 pages
BagOfopinionColing10 Camera-Ready Final
No ratings yet
BagOfopinionColing10 Camera-Ready Final
9 pages
Logistic Regression Algorithm
No ratings yet
Logistic Regression Algorithm
8 pages
L13_14_15_16_17_Classification
No ratings yet
L13_14_15_16_17_Classification
123 pages
Week4
No ratings yet
Week4
45 pages
NLP Labsheet-2 Sentiment Analysis Using Naive Bayes Classifier
No ratings yet
NLP Labsheet-2 Sentiment Analysis Using Naive Bayes Classifier
15 pages
C1_W1_Assignment (2)
No ratings yet
C1_W1_Assignment (2)
14 pages
Sentiment Analysis
No ratings yet
Sentiment Analysis
14 pages
FSP Logistics Regression
No ratings yet
FSP Logistics Regression
34 pages
Logistic Regression: "And How Do You Know That These Fine Begonias Are Not of Equal Importance?"
No ratings yet
Logistic Regression: "And How Do You Know That These Fine Begonias Are Not of Equal Importance?"
21 pages
Sentiment Analysis Using Machine Learning Classifiers
No ratings yet
Sentiment Analysis Using Machine Learning Classifiers
41 pages
102679174
No ratings yet
102679174
6 pages
MCS-011: Problem Solving and Programming
From Everand
MCS-011: Problem Solving and Programming
Dr. DK Sukhani
No ratings yet
Composing Software: An Exploration of Functional Programming and Object Composition in JavaScript
From Everand
Composing Software: An Exploration of Functional Programming and Object Composition in JavaScript
Eric Elliott
No ratings yet
IntroductorySheet
No ratings yet
IntroductorySheet
4 pages
Case study 1
No ratings yet
Case study 1
1 page
AI Lec13
No ratings yet
AI Lec13
65 pages
Info Classical Encryption
No ratings yet
Info Classical Encryption
71 pages
AI Lec3
No ratings yet
AI Lec3
22 pages
AI Lec2
No ratings yet
AI Lec2
43 pages
AI Lec5
No ratings yet
AI Lec5
42 pages
Consolidated Advertisement No. 13-2023 Extended
No ratings yet
Consolidated Advertisement No. 13-2023 Extended
5 pages
Postnatal Assessment
100% (1)
Postnatal Assessment
12 pages
Course Outline
100% (1)
Course Outline
3 pages
MPU3162_FIS_(Chapter_5_-_Metaphysics)(3)
No ratings yet
MPU3162_FIS_(Chapter_5_-_Metaphysics)(3)
38 pages
Charles Darwin - Biography
No ratings yet
Charles Darwin - Biography
2 pages
Grade 9
100% (2)
Grade 9
3 pages
Greece Rise of Democracy Student Inteactive Notebook
No ratings yet
Greece Rise of Democracy Student Inteactive Notebook
6 pages
Chapter 6 Recruitment and Selection 1227419638976965 8
No ratings yet
Chapter 6 Recruitment and Selection 1227419638976965 8
59 pages
Jennifer Bjorkman Resume
No ratings yet
Jennifer Bjorkman Resume
3 pages
Resume Ma1
No ratings yet
Resume Ma1
1 page
Mu Afc Paper Example
100% (1)
Mu Afc Paper Example
7 pages
Ortfolio Assessment: Carlo Magno, PHD Lasallian Institute For Development and Educational Research
No ratings yet
Ortfolio Assessment: Carlo Magno, PHD Lasallian Institute For Development and Educational Research
26 pages
Naskah Drama Bahasa Inggris
No ratings yet
Naskah Drama Bahasa Inggris
16 pages
1.4.3.1 Social Action
No ratings yet
1.4.3.1 Social Action
17 pages
Bridge Course - First Year Syllabus
No ratings yet
Bridge Course - First Year Syllabus
2 pages
Pma156 long course complete details all concept ok
No ratings yet
Pma156 long course complete details all concept ok
3 pages
Unit 1: Preliminaries of Reading and Writing For Academic and Professional Purposes
No ratings yet
Unit 1: Preliminaries of Reading and Writing For Academic and Professional Purposes
41 pages
Is The Posthuman Educable On The Convergence of Educational Philosophy Animal Studies and Posthumanist Theory
No ratings yet
Is The Posthuman Educable On The Convergence of Educational Philosophy Animal Studies and Posthumanist Theory
15 pages
Intern PPT Pranav Sriman
No ratings yet
Intern PPT Pranav Sriman
16 pages
DCBN Inicial PDF
No ratings yet
DCBN Inicial PDF
141 pages
Retrieval Form For Tablet
No ratings yet
Retrieval Form For Tablet
1 page
Stored Procedures
No ratings yet
Stored Procedures
17 pages
DBMS Microporject
No ratings yet
DBMS Microporject
25 pages
Accurate English A Complete Course in Pronunciat Book PDF
No ratings yet
Accurate English A Complete Course in Pronunciat Book PDF
4 pages
ODI11g - Deploying and Configuring The ODI Agent As A Java EE Application
No ratings yet
ODI11g - Deploying and Configuring The ODI Agent As A Java EE Application
27 pages
Assessment of CII Best Practices Usage in The Construction Industry
No ratings yet
Assessment of CII Best Practices Usage in The Construction Industry
11 pages
Igcse Cam Math p4 分章
No ratings yet
Igcse Cam Math p4 分章
656 pages
Youth Civic Engagement in Albania
No ratings yet
Youth Civic Engagement in Albania
103 pages
Entrepreneurial Project Guideline Student
No ratings yet
Entrepreneurial Project Guideline Student
7 pages
Ats 2 Handout Jerelle A.
No ratings yet
Ats 2 Handout Jerelle A.
6 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Logistic Regression Example (1)

Uploaded by

Logistic Regression Example (1)

Uploaded by

How Logistic Regression Assigns Scores in Sentiment Analysis (Step-by-Step)

🔹 Step 1: Understanding the Core Idea

 Each word in a review contributes to the final score.

🔹 Step 2: Representing Text as Numbers

1️⃣ "I love this movie" → Positive (1)

Bag of Words (BoW) Representation:

I love hate this movie

🔹 Step 3: Assigning Weights to Words

score=w1⋅x1+w2⋅x2+...+wn⋅xn+b\text{score} = w_1 \cdot x_1 + w_2 \cdot x_2 + ... + w_n \

 wiw_i → Weight assigned to each word (how important it is)

📌 Example Weights (learned during training):

Word Weight (ww)

📌 Bias term: b=−0.5b = -0.5

🔹 Step 4: Calculating the Score

Review 1: "I love this movie"

score=(0.1×1)+(2.5×1)+(−3.0×0)+(0.5×1)+(1.0×1)+(−0.5)\text{score} = (0.1 \times 1) +

Review 2: "I hate this movie"

score=(0.1×1)+(2.5×0)+(−3.0×1)+(0.5×1)+(1.0×1)+(−0.5)\text{score} = (0.1 \times 1) +

🔹 Step 5: Applying the Sigmoid Function

P=11+e−scoreP = \frac{1}{1 + e^{-\text{score}}}

📌 Why do we use sigmoid?

 It squashes the score into a range between 0 and 1.

P=11+e−3.6P = \frac{1}{1 + e^{-3.6}} P≈11+0.027=0.974P \approx \frac{1}{1 + 0.027} =

🔹 Probability = 97.4% → Positive (1) ✅

Review 2: "I hate this movie" (Score = -1.9)

P=11+e1.9P = \frac{1}{1 + e^{1.9}} P≈11+6.69=0.13P \approx \frac{1}{1 + 6.69} = 0.13

🔹 Probability = 13% → Negative (0) ❌

🔹 Step 6: Making the Final Prediction

🎯 The model successfully classified the reviews! 🎯

🔹 Step 7: Training Logistic Regression

How does the model learn?

1. Starts with random weights 📊

🔹 Summary of the Process 🚀

🔹 Why Use Logistic Regression for Sentiment Analysis?

1. What is Sentiment Analysis?

2. Why Use Logistic Regression?

3. Steps to Perform Sentiment Analysis Using Logistic

Step 1: Import Libraries

Note: Run nltk.download('stopwords') and nltk.download('punkt') if you’re using

Step 2: Load Dataset

For simplicity, let’s assume we have a dataset with two columns:

 "text" (contains reviews)

# Sample dataset (usually loaded from CSV)

Step 3: Clean the Text Data

Text data is messy! We need to:

 Remove special characters, numbers, and punctuation

Step 4: Convert Text to Numbers (Feature Extraction)

Step 5: Train-Test Split

We split our data into training and testing sets (80%-20%).

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,

Step 6: Train Logistic Regression Model

Now, we train the model using LogisticRegression.

Step 7: Make Predictions

We test our model on unseen data.

Step 8: Evaluate the Model

We check accuracy and performance.

print("Accuracy:", accuracy_score(y_test, y_pred))

9. Make a Custom Prediction

print(predict_sentiment("I really love this product, it's fantastic!")) #

Summary of Key Steps

Would you like me to help with dataset sources or improvements? 🚀

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.