0% found this document useful (0 votes)
3 views9 pages

Complete Excel Sheet

The document presents a dataset containing information on individuals' height, weight, gender, smoking status, age, and claim amounts. It also outlines various data analysis techniques, including descriptive and inferential analytics, as well as linear regression methods. Additionally, it provides details on Excel functions for data manipulation and analysis, such as COUNT, IF, and VLOOKUP.

Uploaded by

Aakansha Yadav
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views9 pages

Complete Excel Sheet

The document presents a dataset containing information on individuals' height, weight, gender, smoking status, age, and claim amounts. It also outlines various data analysis techniques, including descriptive and inferential analytics, as well as linear regression methods. Additionally, it provides details on Excel functions for data manipulation and analysis, such as COUNT, IF, and VLOOKUP.

Uploaded by

Aakansha Yadav
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd
You are on page 1/ 9

id Height Weight DependantGender Smoking Age Claim

1 173 76 2M YES 35 40000


2 170 76 2M YES 50 70000
3 150 67 3M YES 30 35000
4 150 60 2M YES 45 65000
5 167 65 2M NO 56 70000
6 160 62 2F NO 79 80000
7 180 76 2F NO 30 35000
8 190 82 1F NO 67 70000
9 170 72 0F NO 55 40000
10 155 67 3F YES 35 30000
11 167 68 3F YES 50 60000
12 160 64 1F YES 55 62000
13 180 80 1M NO 80 80000
14 190 85 1F NO 32 15000
15 170 74 1M NO 66 40000
16 172 77 2M YES 52 71000
17 150 65 3M YES 35 36000
18 150 60 2M YES 45 65000
19 180 80 1M NO 80 80000
20 185 83 1F NO 32 15000
21 170 74 1M NO 66 40000
22 157 68 3F YES 35 31000
23 165 66 3F YES 50 58000
24 162 65 1F YES 55 63000
25 165 63 2M NO 56 68000
26 162 63 2F NO 79 81000
27 178 74 2F NO 30 33000
1. Text to column conversion : Data > Text to columns> delimiton> Space.
2. For Table Headers : Press ctrl+T Data Types
3. Data Analysis - Excel, VBA, BI tools Time series
Data Analytics - EDA, Stats, Visualtion , ML, DL Cross sectional
a. Descriptive analytics Panel data
b. Inferential analytics
c. Predictive analytics ** A particular column also called Feature
d. Prescriptive analytics
4. Finding mean, median, mode ,Standard devi. etc,
step 1. go to data analytics
2 select descriptive statistics
3 give range
(in next sheet)
5. Univariate Analysis Bivariate Analysis
Numerical coulmn Numerical-Numerial Numerocal -Categorical Categorical-Categorical
Histogram Scatterplot Column Plot clustered column chart usingCross-tab
Box Plot Line Plot Categorical Plot

6. Coorelation chart : Go to Data Analysis > Corelation


mn also called Feature

al-Categorical Multivariate Analysis


column chart usingCross-tab Numerical Values Categorical-Categorical-Numerical Numerical-Numericsal-Categ
Correlaiton chart- Heat Map Cross-tab like aggression table Clustered Columns
with clusteres column
Numerical-Numericsal-Categorical
Clustered Columns
Claim

Mean 53074.07
Standard Error 3930.278
Median 60000
Mode 40000
Standard Deviation 20422.32
Sample Variance 4.17E+08
Kurtosis -1.166535
Skewness -0.281556
Range 66000
Minimum 15000
Maximum 81000
Sum 1433000
Count 27
** LINEAR REGRESSION

FORMULA TO CHECK BEST FIT LINE : R^2 = 1- SSR/SST SST = Totalsum squared errors
or R^2 = SST-SSR/SST SST = Totalsum squared errors

R^2 : It is the degree or percentage of variance explained in my data set explained by my best fit line.
R^2 value depends on value of annova and it must be <0.05
y

SST y^

y^ - y = SSE
y-y SSR

R^2 = SSE/SST [Explaned varience / Total variance]


FUNCTIONS AND THEIR FUNTCTION

1. COUNT : FOR COUNTING HOW MUCH DATA (COLUMNS) A ROW IS CONTAINING .


2. Stdp ( standard deviation population) & Stds ( standard deviation sample)
3. CELL REFERENCING : :
A. Relative
B. Absolute ( e.g. : $A$1 for specific row and column
C. Mixed (e.g. : $A1 or A$1 for specific row or column
4. IF function : from intro table IF(E2="M",1,0)
5. IF & AND : e.g. for male and smoker : IF(AND(E2="M",F2="YES"),1,0)
6. IF & OR : IF(OR(E2="M", F2="YES"),1,0)
7. COUNTIF : to count how many yes or not in row etc.
8. COUNTBLANK : to count blank cells.
9. COUNTIFS : to count multiple countif…..e.g. yes ,no and male & female at one time .
10. DATE Function : for date format
EOMONTH : for finding before/after months from a specific date
EDATE: eomonth gave end of month date (30,31) but edate shows exactly before/after month date.
11. LEN : to deletermine length of any word.
12. CONCATENATE : to join the two strings or names etc.
13. LEFT : to give the left string or name
RIGHT: to give the right string or name
14. TRIM: removes space from the string or name. left trim removes space in left end and vice versa.
15. INDEX : function returns the value of cell within a given range based on row and column numbers
16. MATCH : returns the position of a value in a given range.
17. VLOOKUP AND Hlookup
18. FLOOR : rounds given number down to the nearesr multiple of a specified "significance value".
Count (range)

r month date.

mn numbers

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy