Sarthak 091 Report

Download as pdf or txt
Download as pdf or txt
You are on page 1of 32

AN INTERNSHIP REPORT

ON

DATA ANALYTICS and MACHINE LEARNING

Submitted by

SARTHAK DESAI (200050131091)


Student of

Bachelors of Engineering
in

Computer Science & Engineering Department

Faculty Guide

MS.SONIA PANESAR

Head of the Department

Dr.Nitesh Sureja

Academic year

(2023-2024)

1
Summer Internship (3170001)
CONFIRMATION LETTER

2
Summer Internship (3170001)
CERTIFICATE OF COMPLETION

3
Summer Internship (3170001)
DECLARATION

We hereby declare that the Internship report submitted along with the
SUMMER INTERNSHIP entitled Data Analytics and Machine learning
submitted in partial fulfillment for the degree of Bachelor of Engineering in
CSE to Gujarat Technological University, Ahmedabad, is a bonafide record
of original project work carried out by me at INFOLABZ IT SERVICES
PVT. LTD. under the supervision of Ms. Sonia Panesar and that no part of
this report has been directly copied from any students’ reports or taken from
any other source, without providing due reference.

SARTHAK DESAI
Student, Babaria Institute of Technology

4
Summer Internship (3170001)
ACKNOWLEDGEMENT

I would like to express my deepest gratitude to all those who provided me the possibility
to the completion of the internship. A special gratitude of thanks I give to our
Assistant Professor Ms. Sonia Panesar, whose contribution in stimulating suggestions
and encouragement, helped me to coordinate the internship especially in drafting this
report.

Furthermore, I would also like to acknowledge with much appreciation the crucial role of
the Head of Department, Dr. Nitesh Sureja, who gave the permission to use all required
equipment and the necessary material to fulfil the task. Last but not the least, many
thanks goes to the teachers and my friends and families who have invested their full
effort in guiding us in achieving the goal.

5
Summer Internship (3170001)
Abstract

Data Analytics is one of the most important and helpful factor for business
growth. We can do analysis on any data using python libraries like numpy
and pandas. Analyzed data can be visualized by matpolib.Data Analysis is
useful for decision making and machine learning model development.We
will develop machine learning model using some data to predict important
parameters.

6
Summer Internship (3170001)
COMPANY PROFILE

Established in 2016, incorporation with our parent IT company, INFOLABZ IT


SERVICES PVT. LTD. has managed to make its own position in IT Sector. We are involved
in Web Development, App Development, Progressive Web Application Development, IOT
solutions, Graphics & Designing, Digital Marketing, Domain & Hosting services, SMS
services etc.

In the span of seven years we have managed to deliver all projects on time with utmost
accuracy to our clients across the globe. We have dedicated teams of experienced hard
working developers. Our developers who are always willing to take new challenges and
looking forward to learn new things, are heart of this company.

Our objective is to sustain with exponential growth in IT industry. Our mission is to deliver
the best with top notch quality every quarter and vision is to develop a product with one of
its kind concept which could be used by millions of people.

7
Summer Internship (3170001)
TABLE OF CONTENT

WEEK / DAY NO CONTENT PAGE NO

Confirmation Letter 2

Completion Certificate 3

Declaration 4

Acknowledgement 5

Abstract 6

Company Profile 7

WEEK 1 27JULY 2023 11

- Basics of Data Analytics, Type of Data

- Dictionary

- Concept of API

- Requests Package

WEEK 1 28 JULY 2023 12

- API Handling : Covid API, ISRO API, Bitcoin API,


Mutual Fund API

- API Search

WEEK 1 31 JULY 2023 15

- Data Visualization using MatPlotLib

- Bar graph, Pie Chart, Multiple Bar Graph, Scatter Plot

WEEK 1 01 AUGUST 2023 18

- Data Visualization

- API Data Visualization

8
Summer Internship (3170001)
WEEK 1 02 AUGUST 2023 19

- Assignment Task

- News API

- API Data Visualization

- Dynamic API : PINCODE

WEEK 2 03 AUGUST 2023 23

- Pandas : Data Frame

- Pandas : Excel, API to CSV

- XLRD : IPL Data Analytics

WEEK 2 04 AUGUST 2023 25

- Introduction to Machine Learning

- Linear Model : Mathematics

- Linear Model Implementation

WEEK 2 07 AUGUST 2023 26

- Multiple Linear Model : Mathematics

- Multiple Linear Model Implementation

WEEK 2 08 AUGUST 2023 27

- Polynomial Linear Regression Model Implementation

- Image Data

WEEK 2 09 AUGUST 2023 29

9
Summer Internship (3170001)
- Assignment Task

- Convolutional Neural Network Project

WEEK 2 10 AUGUST 2023 32

- Conclusion

10
Summer Internship (3170001)
WEEK-1

DATE: 27 JULY 2023

❖ Basics of Data Analytics, Type of Data:

• Data analytics involves examining, cleaning, and interpreting data to discover


meaningful patterns and insights.
• There are various types of data, including structured data (organized in tables),
unstructured data (lacking a fixed structure, like text), and semi-structured data
(partially organized, such as JSON). Data can also be quantitative (numerical, like
sales figures) or qualitative (description, like customer reviews).
• Time series data involves timestamps, while big data refers to exceptionally large
datasets challenging for traditional tools. Streaming data is real-time information
continuously generated, like social media posts.
• Each type of data necessitates specific analytic techniques tailored to its nature,
enabling businesses to make informed decisions based on valuable insights.

❖ Dictionary:

• A dictionary is a data structured in computer science and programming that stores


a collection of key-value pairs.
• Each key in dictionary is unique and is associated with a collection of key-value
pairs. This association allows for efficient retrieval of values based on their
corresponding keys.
• Dictionary are denoted by curly braces { }, and the key-value pair s are separated
by colons.

❖ Concept of API:

• An API (Application Programming Interface) is a set of rules that allows one


software application to interact with another.
• It defines the methods and data formats applications can use to request and
exchange information, enabling seamless communication between different
systems.

11
Summer Internship (3170001)
❖ Requests Package:

• The Request package in programming refers to a popular Python library used for
making HTTP requests. It simplifies the process of sending HTTP requests and
handling responses, providing a high-level interface for interacting with web
services and APIs.
• With the Request package, developers can send various types of requests, such as
GET, POST, PUT, and DELETE, and customize headers, parameters, and
authentication methods easily.
• It also supports handling JSON data, file uploads, and cookies, making a versatile
tool for web development tasks.

DATE: 28 JULY 2023

❖ API Handling:

Handling APIs involves making requests to these interfaces, receiving responses, and
processing the data exchanged.

12
Summer Internship (3170001)
• COVID API:

• ISRO API:

13
Summer Internship (3170001)
• Bitcoin API:

• Mutual Fund API:

14
Summer Internship (3170001)
❖ API Search:

• API search involves using engines or directories to find specific API based on
functionality and requirements, enabling developers to integrate external services
into their applications efficiently.
• These searches often involve keywords, tags, or categories to narrow down the
options and find the most appropriate API for integration.

DATE: 31JULY 2023

❖ Data Visualization using MatPlotLib:

• It refers to the process of creating visual representations of data, such as charts,


graphs, and plots, using the MatPlotLib library in python.
• Allowing developers to display data trends, patterns, and relationships effectively.

15
Summer Internship (3170001)
• Bar graph:

• Pie Chart:

16
Summer Internship (3170001)
• Multiple Bar Graph:

• Scatter Plot:

17
Summer Internship (3170001)
DATE: 01 AUGUST 2023

❖ Data Visualization:

• Data visualization is the graphical representation of information and data.


• It uses visual elements like charts, graphs, and maps to help viewers understand
the patterns, trends, and insights within the data.
• Visualization simplifies complex data sets, making it easier to interpret and
analyze, aiding in better decision-making and communication of findings.

❖ API Data Visualization:

• API data visualization involves using APIs (Application Programming Interfaces)


to fetch data from various sources and then using visualization tools and libraries
to represent that data graphically, making complex information more accessible
and understandable.
• It allows for real-time updates and interactive charts, enhancing the way data-
driven insights are presented and understood.

18
Summer Internship (3170001)
DATE: 02 AUGUST 2023

❖ Assignment Task:

• News API:

19
Summer Internship (3170001)
• API Data Visualization:

20
Summer Internship (3170001)
• Dynamic API: PINCODE

21
Summer Internship (3170001)
22
Summer Internship (3170001)
WEEK-2

DATE: 03 AUGUST 2023

❖ Pandas: Data Frame

• Pandas DataFrame is a two-dimensional, size-mutable, and potentially


heterogeneous tabular data structure with labeled axes (rows and columns).
• It is a primary data structure in the pandas library for data manipulation and
analysis in Python, offering functionalities similar to a spreadsheet or SQL table.
• DataFrames allow easy indexing, filtering, grouping, and statistical operations on
data, making them invaluable for data analysis and manipulation tasks.

❖ Pandas: Excel, API to CSV

23
Summer Internship (3170001)
❖ XLRD: IPL Data Analytics

24
Summer Internship (3170001)
DATE: 04 AUGUST 2023
❖ Introduction to Machine Learning:

• Introduction to Machine Learning (ML) involves the study of algorithms and


statistical models that enable computer systems to improve their performance on a
specific task over time without being explicitly programmed.
• ML algorithms learn from data patterns and make predictions or decisions based
on that learning.
• It is widely used in applications such as image recognition, language translation,
and recommendation systems, transforming data into actionable insights and
automating complex tasks.
• ML techniques include supervised learning (predictive modeling), unsupervised
learning (pattern recognition), and reinforcement learning (decision making
through trial and error).

❖ Linear Model: Mathematics

• In mathematics, a linear model represents a relationship between variables using a


linear equation.
• It assumes a straight-line relationship between the input variables (independent
variables) and the output variable (dependent variable).
• Mathematically, a linear model can be expressed as:

25
Summer Internship (3170001)
Y = aX + b
Where:
• (Y) Is the output variable.
• (X) Is the input variable.
• (a) Is the slope or coefficient that represents the relationship's
steepness.
• (b) Is the y-intercept, the value of (Y) when (X) is 0.

• Widely used in various fields for tasks like regression analysis, trend prediction,
and correlation assessment due to their simplicity and interpretability.

❖ Linear Model Implementation

DATE: 07 AUGUST 2023

❖ Multiple Linear Model: Mathematics:

• In mathematics, a Multiple Linear Model represents the relationship between a


dependent variable (Y) and multiple independent variables (X₁, X₂, ..., Xₙ) using a
linear equation. The model assumes that the relationship between the variables is
linear.

26
Summer Internship (3170001)
• Mathematically, a multiple linear model can be expressed as:

[Y = a₁X₁ + a₂X₂ + …. + aₙXₙ + b]


Where:
• (Y) is the dependent variable.
• (X₁, X₂,…., Xₙ) are the independent variables.
• (a₁, a₂,...., aₙ) are the coefficients representing the impact of each
independent variable on the dependent variable.
• (b) is the y-intercept, the value of (Y) when all independent
variables are 0.

• Multiple linear models are used when there are multiple predictors influencing
the outcome, and they are essential in various fields, including statistics,
economics, and engineering, for analyzing complex relationships between
variables.

❖ Multiple Linear Model Implementation:

27
Summer Internship (3170001)
DATE: 08 AUGUST 2023

❖ Polynomial Linear Regression Model Implementation:

• Implementing a Polynomial Linear Regression Model involves extending the


simple linear regression model to handle more complex relationships between
variables.
• In this case, the relationship between the independent variable (X) and the
dependent variable (Y) is modeled as an (n)th degree polynomial equation:

[Y = anXn + a{n-1}X{(n-1)} + ….. + a2X2 + a1X + a0]


Here:
• (Y)Is the dependent variable.
• (X) Is the independent variable.
• (an, a{n-1},…., a2, a1, a0) are coefficients to be determined through the
regression process.

28
Summer Internship (3170001)
❖ Image Data:

DATE: 09 AUGUST 2023

❖ Assignment Task:

29
Summer Internship (3170001)
❖ Convolutional Neural Network Project:

30
Summer Internship (3170001)
31
Summer Internship (3170001)
DATE: 10 AUGUST 2023

❖ Conclusion:

In conclusion, my two-week internship at INFOLABZ IT SERVICES PVT. LTD has been


an immensely valuable journey. I am profoundly thankful to my supervisors for their
guidance and unwavering support throughout this period. Their mentorship has been
instrumental in my growth. Additionally, I extend my gratitude to my college professors
whose teaching laid the foundation for my success in this internship. This experience has not
only expanded my knowledge but also depended my appreciation for the practical
applications of data analytics and machine learning in the professional world.

32
Summer Internship (3170001)

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy