Data Scientist
Data Scientist
OBJECTIVE EXPERIENCE
+91 8087178017
Core Libraries worked with:
Email Pandas
dheerajwarudkar18@gmail.com NumPy
Seaborn, Matplotlib
Scikit-learn
EDUCATION SciPy
Beautiful Soup 4
MTech: Surface Science and Engineering,
NIT Jamshedpur
BE : Mechanical engineer Machine Learning Algorithms implemented:
College : Sinhgad Academy of Engineering, Linear Regression Decision Tree
Pune Logistic Regression Random Forest
University : Savitribai Phule Pune University K-nearest neighbors Support Vector Machine
Branch : Mechanical engineering K-means clustering XGBoost
Naïve Bayes
Highly experienced Data Scientist with overall 3+ years experience in Data Extraction, Data Modeling, Data
Wrangling, Statistical Modeling, Data Mining, Machine Learning, and Data Visualization.
Proficient in Machine Learning algorithms and Predictive Modeling including Regression Models, Decision Tree,
Random Forest, Sentiment Analysis, Naïve Bayes Classifier, SVM, and Ensemble Models.
Performed univariate and multivariate analysis of the data to identify any underlying pattern in the data and
associations between the variables.
Performed data imputation using the Scikit-learn package in Python.
Worked on data cleaning and ensured data quality, consistency, and integrity using Pandas and NumPy.
Developed and implemented predictive models using machine learning algorithms such as linear regression,
classification, multivariate regression, Naive Bayes, Random Forest, K-means clustering, KNN, PCA, and
regularization for data analysis.
Analyze and Prepare data, and identify the patterns in the dataset by applying historical models. Collaborating
with Senior Data Scientists for understanding the data.
Performed data manipulation, data preparation, normalization, and predictive modeling. Improve efficiency and
accuracy by evaluating models.
Familiar with various ML algorithms such as Linear Regression, Logistic Regression, KNN, K-Means, Naïve Bayes,
Decision Tree, Random Forest, Support Vector Machine, XG Boost, and concepts like Principal Component
Analysis, Mean, Median, and Mode.
Technical Skills
Projects
Project sequence- 1
Responsibilities :
Experience in developing entire frontend and backend modules using Python on Django Web Framework.
Experience in working at various phases of project such as analysis, design, development, and testing.
Using Django Framework model, implemented MVC architecture and developed web applications with superb
interface.
Created user interface of website using Python, HTML5, CSS, JSON and JQuery. Used CSS bootstrap framework
for developing web application.
Developed the business logic in views for the URLs created and linked the webpages to functions in views to
show the output to the end-user or to store information from the website into the database
Worked on Django ORM API to create and insert data into the tables and access the database.
Extensive experience in using the python packages such as NumPy, SciPy, Pandas, Beautiful Soap, Pickle and
OS.
Involved in Preparing Low level Design of Application, To take part in software and architectural development
activities
Collected historical data and third party data from different data source,Improved Operation activities. Used
Linear & Logistic Regression
Understand and Analyse Customer requirements and Business logic ,Perform data cleansing, data imputation
and data preparation using Scikit Learn and Numpy.
Project-Sequence 2
Involved in requirement analysis, design, estimation and testing of the assigned tasks in open stack WITH BA
Interpreting data, analysing results using statistical techniques
Acquiring data from primary or secondary data sources and maintaining databases
Work with stakeholders to determine how to use business data for valuable business solutions
Performed univariate and multivariate analysis of the data to identify any underlying pattern in the data and
associations between the variables.
Performed data imputation using the Scikit-learn package in Python.
Worked on data cleaning and ensured data quality, consistency, and integrity using Pandas and NumPy.
Use predictive models to improve customer experience, ad targeting, revenue generation, and more
Coordinate with various technical/functional teams to implement models and monitor results
Project-Sequence 3
Task Handled:
Implemented Data Exploration to analyze patterns and to select features using Python and SciPy.
Built Factor Analysis and Cluster Analysis Models using Python and SciPy to classify customers into different
Target Groups.
Designed an A/B Experiment for testing the business performance of the new recommendation system.
Evaluated business requirements and prepared detailed specifications that follow project guidelines required to
develop written programs.
Participated in Data Acquisition with Data Engineer team to extract historical and real-time data by using Hadoop-
MapReduce and HDFS.
Performed Data Enrichment jobs to deal missing value, to normalize data, and to select features.
Participated in all phases of Data Mining; Data Collection, Data Cleaning, Developing Models, Validation,
Visualization and performed Gap Analysis.
Extracted data from HDFS and prepared data for Exploratory Analysis using data Munging.
Built models using Statistical techniques like Bayesian HMM and Machine Learning Classification Models like XG
Boost, SVM, Random Forest. etc.
Used Pandas, Numpy, Seaborn, Scipy, Matplotlib, Scikit-learn, NLTK in Python for developing various Machine
Learning Algorithms.