Skip to content

nehalvaghasiya/ml-nlp-projects

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 

Repository files navigation

Data Science Portfolio


Repository containing portfolio of data science projects completed by me for academic, self learning, and hobby purposes. Presented in the form of iPython Notebooks.

Contents

  • Spam detection: A classification model to detect whether a text message is spam or not (aka Ham) using simple logistic regression in PyTorch.

  • Sentiment analysis: A classification model to predict the sentiment of text (Positive or Negative) using RNN, LSTM deep learning models in PyTorch. Used dropout to get rid of Overfitting.

  • Autocorrection: A Probabilistic language model with NLP to implement an auto-correct system with the help of minimum edit distance algorithm. (for example, if you type the word 'dys', chances are very high that you meant to write 'days').

  • Autocompletion: A Probabilistic language model which assigns the probability to a sequence of words, in a way that more likely sequences receive higher scores. (for example, if you type 'hey how are', then you can find a word 'you' that has the highest probability to match with the sentence 'hey how are'.).

  • Identificarion of duplicate Questions: A model to detect duplicate questions in the websites like Quora, Stack Overflow, using deep learning techniques, including Recurrent Neural Networks (RNN) and Long Short-Term Memory (LSTM).

  • Neural Machine Translation: A sequence-to-sequence (seq2seq) model to translate the given sentence from German to English. Both the input and output are sentences. In other words, these sentences are a sequence of words going in and out of a model. This is the basic idea of Sequence-to-Sequence modeling which contains 2 different recurrent neural network (RNN) models knowns as an encoder and a decoder.

  • Text Summarization: An attention model to summarize the text based on deep learning techniques.

  • Covid-19 Image Classification: An Image classsification model to classify the input image in three categories (If lungs are Normal, infected due to covid, or viral Pneumonia) using pretrained model of ResNet34.

  • Christmas Multiclass Classification: A multiclass Image classification model to classify the images in Nine categories (Christmas Cookies, Christmas Presents, Christmas Tree, Fireworks, Penguin, Reindeer, Santa, Snowman) using AlexNet pretrained model.

CO2 emission prediction CO2 emission prediction CO2 emission prediction CO2 emission prediction
Ecommerce Customers Ecommerce Customers Ecommerce Customers Ecommerce Customers
House Price Prediction House Price Prediction House Price Prediction House Price Prediction
click_on_Ad_or_not click_on_Ad_or_not click_on_Ad_or_not click_on_Ad_or_not click_on_Ad_or_not click_on_Ad_or_not click_on_Ad_or_not
Kyphosis project Kyphosis project Kyphosis project Kyphosis project Kyphosis project Kyphosis project Kyphosis project

About

Collection of machine learning and NLP projects demonstrating various models and techniques.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published
pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy