0% found this document useful (0 votes)

63 views

All Codes

This document provides an introduction to working with vectors and data frames in R. It shows how to: 1) Create vectors using c() and assign them to variables using = or <-. Vectors can contain numeric, character, or other data types. 2) Access elements of a vector using indexing with []. Elements can be accessed by position, sequence, or specific indices. 3) Perform operations on vectors of the same length like addition or sorting. 4) Combine vectors into a data frame using data.frame() and access values using [row,column] indexing. 5) Use functions like head(), tail(), str(), and summary() to explore the structure and contents of data

Uploaded by

Armaan Choudhary

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

63 views

All Codes

Uploaded by

Armaan Choudhary

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

DAY1 - R

b = c(50,60, 70)
# is use for commenting
a+b
# Variables in R - There are two way to assign a value to
# a variable
# Accessing the element of a vecotr using square bracket indexing
# Option 1: By using = sign
age = c(38,35,40,42)
a = 10
name = c("Amit", 'Sunil', 'Raj', "Mohan")
# Option 2: By using <-
# If you want to access the single element of the vector
a <- 10
age[2]
# Rules to assign a variable name -I can not start the
# variable name with special characters
# You can access the elements based on a sequence of index values
# Kind of values you can assign to a variable
age[2:4]
# I can assign - Number, String
# You want to access specific index values
# Anything in the write in a " " will be taken as string
age[c(2,4)]
a = "10" # This is a sting 10
# If you want to access element by excluding the few elemen
b = 20 # This is a number 10
age = c(38,35,40,42)
# Vectors - 2D arrangement of data which should contain
# data of same data type. I can build a data frame from
age[c(-1,-4)] # It will exclude 1st element and 4th element
# Vectors
# Create another vector as location
# I will create a numerical vector
location = c('Mumbai', 'Bangalore', 'Kochi', 'Delhi')
age = 38 # Single element vector
# Sorting the vector - sort() command\
age = c(38,35,40,42)
# By default the arrangement is low-high
# If you want to assign a sequence of number to variable
age = sort(age)
z = 10:50
sort(location)
# If you want to assign a sequence of numbers with specific interval
# If you want to arrange it from highest to lowest
z = seq(10,50, by = 5)
sort(age, decreasing = TRUE)
z = seq(10,50,5)
sort(location, decreasing = TRUE)
# How to create a vector of string
################### Data frame ###########################
name = c("Amit", 'Sunil', 'Raj', "Mohan")
# Lets convert the name, age and location vector to a data frame
# Few key commands on vector
# The command is data.frame(v1, v2,v3....)
# Lenght command will give the number of elements in a vector
df = data.frame(age, name, location)
length(name)
# Accessing the values from a data frame by using square bracket
# nchar is used to find the number of character in each element
# I want to access the 2nd row 3 column value
nchar(age)
df[2,3]
# Math operators on vectors
# I want to access the 2nd and 4th row and 3 column
# If the vector length is same, the math operation is on index values
df[c(2,4), 3]
a = c(10,20,30,40)
# I want to access all rows from 2nd row onwards and
b = c(50,60,70,80)
# 2nd & 3rd column
a+b
df[c(2:4),c(2,3)]
# If the vector length is different - re-cyclic process
# Some key commands in data frame
a = c(10,20,30,40,20)
# View() is used to view your data # Import the 3rd user

View(df) users = read_excel("C:/Training data/Amit/Data Set/Sample-Superstore-

Subset-Excel.xlsx", sheet = 3)
# head() is used to see the top rows of my data.
# By default head() will give you first 5 rows #### Explor the data using DPLYR package

head(df,2) library(dplyr)

# tail() is used to see the bottom rows of your data # DPLYR has 5 key function- select, filter, groupby, summarise,
# arrange and mutate
tail(df,2)
# We can use anyone of this function of a combination of them
# nrow() is used to find the number of rows # by using a pipeline function

nrow(df) # Select() - this help to select the column of your data set

# ncol() is used to find the number of columns # PS - Create a new data frame which contain only region, sale & profit

ncol(df) df_new = orders %>% select(Region, Sales, Profit)

# names() is used to print the columns name # If you want select a column which contain 2 words

names(df) df_new1 = orders %>% select(`Order Priority`)

# str() is used to find the structure of your data types # Filter() - It will help you filter your data based on a specific
# condition
str(df)
# PS - Filter the data for south region
# data types in R
# NOTE: we need to use == for comparing the values
# R can have the following data types
df_south = orders %>% filter(Region == "South")
# 1) int
# 2) num # PS - Find the number of rows for Central with sale more than 2000 USD
# 3) char - any column containing text
# 4) factor - categorical column - which the data into # NOTE: I can use multiple filter condition by using AND / OR operator
# on-overlapping cateogries # In R - & is AND operator, | - OR operator
# 5) date and time
# 6) geo data - state, pin code, district, countries etc df1 = orders %>% filter(Region == 'Central' & Sales > 2000) %>% nrow()

# summary() - this give a stat summary of your data View(df1)

summary(df) # PS - Find the number of rows where

# product category is Techonology or Furniture
# Importing the data set from local drive to R # and sales is more than 2000

# Importing the excel file df2 = orders %>%

filter((`Product Category` == 'Technology' | `Product Category` ==
# install the package - readxl "Furniture") & Sales > 2000) %>%
nrow()
install.packages('readxl', dependencies = TRUE)
View(df2)
library(readxl)

# Group_by() - This function will help you to group a data on a

DAY2-R # Specific column
# NOTE: A groupby funtion will always follow by a summarise() function
# Summarise() - Will help you to aggregate the data by using math
############## Importing data set in R ###########################
# operators like count, sum(), mean(), min(), max() etc
# We will import an excel file
# PS - Find the total sales for each region
# we need library - readxl
df3 = orders %>% group_by(Region) %>% summarise(sum(Sales))
library(readxl)
View(df3)
# Importing the sample super store data set
# PS - Find the average profit for each product category?
orders = read_excel("C:/Training data/Amit/Data Set/Sample-Superstore-
Subset-Excel.xlsx")
df4 = orders %>% group_by(`Product Category`) %>%
summarise(Avg_profit = mean(Profit))
# Importing the return

returns = read_excel("C:/Training data/Amit/Data Set/Sample-Superstore-

View(df4)
Subset-Excel.xlsx", sheet = 2)
# NOTE: Can we group_by on multiple column
# Left Join - Will give ALL the rows for you left table
# PS - Find the average sales for each product category in # and matching values from Right table
# each region
# Right Join - ALL the rows of right table
df5 = orders %>% group_by(Region,`Product Category`) %>% # and matching values from left table
summarise(Avg_sales = mean(Sales))
# Syntex for merge command:
View(df5)
# merge(x = lefttableName, y = rightTableName,
# NOTE: can we summarise more than one column # by.x = common column name from left table,
# by.y = common column name from right table)
# PS - Find the total sales and average profit for each region
# NOTE: By default it will give you inner join
df6 = orders %>% group_by(Region) %>%
summarise(Total_sales = sum(Sales), Avg_profit = mean(Profit)) # PS - Find the total sales for each manager?

View(df6) df12 = merge(x = orders, y = users, by.x = 'Region',

by.y = 'Region')
# PS - For technology product category
# Find the average profit and average sales for each region View(df12)

# Option 1: df13 = df12 %>% group_by(Manager) %>% summarise(Total_sales =

sum(Sales))
df7 = orders %>% filter(`Product Category` == "Technology") %>%
group_by(Region) %>% summarise(Avg_profit = mean(Profit), View(df13)
Avg_sales = mean(Sales))
# How to convert the inner join to a left join or a right join
View(df7)
left = merge(x = orders, y = users, by.x = 'Region',
# Option 2: by.y = 'Region', all.x = TRUE)

df8 = orders %>% group_by(Region, `Product Category`) %>% Right = merge(x = orders, y = users, by.x = 'Region',
summarise(Avg_profit = mean(Profit), Avg_sales = mean(Sales)) %>% by.y = 'Region', all.y = TRUE)
filter(`Product Category` == 'Technology')

View(df8) # Extracting the information like month, days, year etc from
# date column
# Arrange() - This will help you to sort the data
# By default arrangement is in ascending order. If you want to # PS - Which month had the highest sales?
# arrange the data in descending order you must pass additional
# argument by putting a "-" sign orders$Month = format(orders$`Order Date`, "%B")

# PS - Find the product name with highest average sales df14 = orders %>% group_by(Month) %>% summarise(Total_sales =
sum(Sales))
df9 = orders %>% group_by(`Product Name`) %>%
summarise(Avg_sales = mean(Sales)) %>% arrange(-Avg_sales) %>% View(df14)
head(1)

View(df9) DAY3 – R
# PS- Find the customer based on customer ID with highest total sales # In this session - 1) We will see the basic ploting using ggplot2
# 2) How to build basic Predictive models
df10 = orders %>% group_by(`Customer ID`, `Customer Name`) %>%
summarise(Total_sales = sum(Sales)) %>% arrange(-Total_sales)%>% # Ploting with GGPLOT2
head(1)
library(readxl)
View(df10) library(ggplot2)
library(dplyr)
# Mutate() - This commond help to create a new column with the help
# of current columns orders = read_excel("C:/Training data/Amit/Data Set/Sample-Superstore-
Subset-Excel.xlsx")
# PS - Find the product name with highest shiping cost to sales ratio
# PS - Plot a bar chart to present total sales each region
df11 = orders %>% group_by(`Product Name`) %>%
summarise(Total_shipping_cost = sum(`Shipping Cost`), df1 = orders %>% group_by(Region) %>% summarise(Total_sales =
Total_sales = sum(Sales)) %>% sum(Sales))
mutate(Ratio = Total_shipping_cost/Total_sales) %>%
arrange(-Ratio)%>% head(1) View(df1)

View(df11)
# Plotting a bar chart using ggplot
# Merge / Join - To merge two table we can perform
# NOTE: Define your x axis and y axis inside the command aes()
# Inner Join - Will give only the common rows between data set # NOTE: In geom_bar() you need to mention addition argument -
# stats = 'Idenitity' View(test)

# Step 1: ggplot will help you to define you x - axis and y - axis test$pred_milage = predict(model, test)
# Step 2: You can use reorder command to arrange your barchart.
# Step 3: We will use geom_bar() draw our bar chart - Use stat = "Identity"
only in BAR CHART # Step 5: Find the error by using RMSE - Root mean square error
# Step 4: labs() to label my x - axis, y - axis and title of chart
# Step 5: geom_text() to put data label on you chart with vjust and hjust to test$error_squre = (test$mpg - test$pred_milage)^2
adjust the data labels
# Step 6: theme() to remove the any text from you axis by using axis.text.y
and son on #root and Mean of square error

plot1 = ggplot(df1, aes(x = reorder(Region, -Total_sales), y = Total_sales)) + RMSE = sqrt(mean(test$error_squre))

geom_bar(stat = 'Identity', width = 0.4) +
labs(x = "Region", y = "Total Sales", title = "Region wise sales") + RMSE
geom_text(label = df1$Total_sales, vjust = -0.25) +
theme(axis.text.y = element_blank())
# Step 6: If you are ok with the RMSE and adjusted R - Square value, its time
plot1 now
# to prodict the value for 5 cyl, 120 - displ, 98 hp, 5 gears, mpg = ???
# Can plot a scatter plot between sales and profit using geom_point()
val_data = data.frame(cyl = 5, disp = 120, hp =98, gear = 5)
plot2 = ggplot(orders, aes(x = Sales, y = Profit)) + geom_point()
View(val_data)
plot2
predict(model, val_data)

# Upper limit
Data Modelling – R
26.78 + RMSE
# Predictive data modelling
# Based on the data from you past, you need predict the values. # Lower Limit

# Based on the MTcars data you need find what will be a milage of a car 26.78 - RMSE
with:
# 5 cyl, 120 - displ, 98 hp, 5 gears, mpg = ???

View(mtcars)

# Step 1: Pick the variables which are required in your model

library(dplyr)

df = mtcars %>% select(cyl, disp, hp, gear, mpg)

View(df)

# Step 2: Split our data into training (80%) and test (20%) data.
# This split is a random split

index = sample(1:32, 0.8*32)

index

# filter the training rows and test rows

train = df[index, ]

test = df[-index, ]

# Step 3: Building the model on training data - Linear regression model

model = lm(mpg ~ cyl + hp + gear + disp , data = train)

# View the model

summary(model)

# Thus the equation will be :

# mpg = -0.82 * cyl - 0.04 * hp + 1.97 * gear - 0.007 * disp + 25.705

# Step 4: We will predict the test values using this model

Python – Day3
Day4 – Python
Day5 – SQL # Where command - This will help you to import the data with filter condition

# PS1 - Import all the data from south region

# How to create a data base- We can use a command of CREATE DATABASE to create
# a data base. The syntax is - CREATE DATABASE Databasename;
select * from orders
# NOTE: The default delimiter is ; alter
where Region = 'South';
# Creating a data base with name of SDA
# PS2: Import Product category, Customer Segment, Sales, Profit for sales > 1000
create database SDA;
select `Product Category`, `Customer Segment`, Sales, Profit from orders
where Sales > 1000;
# To see which all data bases are avaiable in your server:
# You can use the command show databases;
# PS3: Import all data where sales > 1000 and Profit is > 500
show databases;
select * from orders
where Sales > 1000 AND Profit>500;
# If you want to access a table from a specific data base, you can use the
# command databasename.tablename;
# PS3: Import all the data where region is South or West
select * from sakila.actor;
# Solution 1: Using OR with where command
# OPTION 2: You can also set up your working directory to the data base from
select * from orders
# where you want to access the table by using the commnand USE
where Region = 'South' OR Region = 'West';
use sakila;
# Solution 2: Using IN opertor with where command
select * from actor;
select * from orders
# How to drop / delete a data base: You can use a command of DROP to delete a
where Region IN ('South', 'West');
# data base
# Where with like command - This is a kind wild card matching
drop database SDA;
# PS4: Select all data where customer name start with "Aa"
# I will create the database SDA again and then I will create table batch3
# LIKE command -
create database SDA;
# LIKE 'x%' - anything which start with x
# LIKE '%x%' - anything which contain x at any position
# While creating the table you need to specifiy the following:
# LIKE '_XY%' - anything which have 2nd and 3rd charater or values as XY
# 1) Column name
# 2) Column data type
select * from orders
# 3) Column constrains
where `Customer Name` LIKE 'Aa%';
# Syntax - create table_name( column1 datatype constrain, column2 datatype...)
# PS5: Find all the data which have sales starting with 12.
# Lets create a table batch3 in SDA data base
select * from orders
where Sales like '12%';
use SDA;
# LIMIT command - You can limit the number of records which need to be displayed
create table batch3(
ID int not null,
select * from orders
Name varchar (20) not null,
where Sales like '12%'
Age int not null,
limit 5;
Marks decimal (3,2) not null,
primary key (ID)
);

# In decimal data type, please use the (total values including the decimal values
Day6 - SQL
# , the number of decimals)
use sda;
# Inserting the records in by table - We can use the insert into command for this
# Quick recap - What will be the command to select order ID, profit, sales from orders
# Syntax: insert into table_name (column1, Column2, Column3)
# table under SDA data base
# values ((value1, value2, value3) , (Value1, Value2, Value3));
select `Order ID`, Profit, Sales from orders;
insert into batch3(ID, Name, Age, Marks)
# Quick recap 2- Command for selecting all the columns for the rows where sales is
Values (100, 'Amit', 38, 7.35);
more
# than 1500 $
# If you want to insert multiple values
select * from orders
where Sales > 1500;
insert into batch3(ID, Name, Age, Marks)
Values (101, 'Amit', 38, 7.35), (102, 'Raj', 40, 8.35), (103, 'Sunil', 44, 8.35);
# BETWEEN command - This will help you to find the values between two numbers.
# Find all the rows of all columns where sales is between 1000 and 2000
# In SQL we will working the following key commands:
# 1) Select command - Which will help you to select the field (columns)
# OPTION 1: I can use AND opertor
# 2) Where command - Which will help you to filter your data
# 3) Order command - Which will help you to arrange your data
select * from orders
# 4) Group by command - Which will help to group the data at multiple levels
where Sales > 1000 AND Sales < 2000;
# Select query function - This is used to select the fields(columns)
# OPTION 2: Using between operator
select Sales, Profit from orders;
select * from orders
where Sales between 1000 AND 2000;
# If you want to select all the column
# We can also use a NOT Operator with Between:
select * from orders;
select * from orders
where Sales NOT BETWEEN 1000 and 2000;
# We can also select columns with alias name
# ORDER BY - This is used to sort the data in ascending or descending orders based on
select `Product Category` AS PC from orders;
# one or more columns
# PS: Select Region, Product category, Sales and Profit and order the table in # than 5 orders?
# 1) Lowest to highest of Sales
# 2) Highest to lowest of Profit select `Customer ID`, `Customer Name`, count(distinct `Order ID`) as Total from orders
where `Customer Segment` = 'Corporate' AND `State or Province` = 'California'
# Solution for 1) Arranging the lowest to highest Sales group by `Customer ID`
select Region, `Product Category`, Sales, Profit from orders having Total >5;
order by Sales;
# PS: Find the rows where the sales is greater than the overall average sales
# Solution for 2) Arranging the highest to lowest of Profit # NOTE: We use sub-queries where the output of a query is given and an input to
another
select Region, `Product Category`, Sales, Profit from orders # query. As we can not store the query with a variable name, we need to write a
order by Profit desc; # sub-query within the main query

# Arrange the same data (used in above condition) in ascending order of Region and select round(avg(Sales),0) from orders; # Q1
# desending order of Sales
select Sales from orders
select Region, `Product Category`, Sales, Profit from orders where Sales > (select round(avg(Sales),0) from orders);
order by Region, Sales desc
limit 9105; # PS: Find the list of customer where the total revenue given by the customer for
# South region is less than the average sales of south region?
# PS: Find the Product name with highest sales from South Region
# Solution:
select `Product Name` from orders
where Region = 'South' select `Customer ID`, `Customer Name`, sum(Sales) as Total from orders
order by Sales desc where Region = 'South'
limit 1; group by `Customer ID`, `Customer Name`
having Total < (select round(avg(Sales),0) from orders
# GROUP BY - This is used in collaboration with select statement to arrange data into where Region = 'South');
# Groups. This group by clause follows the WHERE clause in the select statement and
# precedes by the Order by clause. PLEASE THAT GROUP BY COMMAND MUST HAVE A ################################ JOINS IN SQL
MATH AGG ##################################
# IN SQL we have to define the Math aggr. within the select command
# We have INNER JOIN, LEFT JOIN, RIGHT JOIN in SQL
# PS: Find the total sales for each region
# Lets see how to join the table on inner join
Select Region, sum(Sales) from orders
group by Region; # Lets first see how to perform a inner join

# If you want to rename the column sum(Sales) as Total Sales select orders.`Order ID`, returns.Status

Select Region, sum(Sales) as 'Total Sales' from orders from orders

group by Region;
inner join returns on orders.Òrder ID` = returns.Òrder ID`;
# If you want to round the values of sum(Sales) to 2 decimal values
# Find the number of orders returned from each region
Select Region, round(sum(Sales),2) as 'Total Sales' from orders
group by Region; select orders.Region, count(distinct orders.Òrder ID`)

# PS: Find the average profit from each product category of South Region? from orders

select `Product Category`, round(avg(Profit),2) as 'Avg Profit' from orders inner join returns on orders.`Order ID` = returns.`Order ID`
where Region = 'South'
Group by `Product Category`; group by orders.Region;

# Find the Customer with highest number of orders? # PS: Find the list of orders ID which were not returned?

select `Customer ID`, count(distinct Òrder ID`) as Total_orders from orders select orders.Òrder ID`, returns.Status
group by `Customer ID` From orders
order by Total_orders desc left join returns ON orders.Òrder ID` = returns.Òrder ID`
limit 1; where returns.Status IS NULL;

# PS: Find the total number of orders placed from California

select `State or Province`, count(distinct `Order ID`) as Total from orders
where `State or Province` = 'California'
group by `State or Province`;

# PS: Find the product Name with the highest total Sales to Profit Ratio?
# Basic calculative field can be decleared in select statement itself

select `Product Name`, sum(Sales / Profit) as Ratio from orders

group by `Product Name`
order by Ratio DESC
limit 1;

# PS: Find the Product sub-cateogry the total sales of a product sub category
# is more than 10000 for the South Region

# NOTE: When you have to apply multiple filter, you can use WHERE command if the
filter
# is applied before the group by command and HAVING command if the filter is applied
# after group by.

select `Product Sub-Category`, round(sum(Sales),2) as Total from orders

where Region = 'South'
group by `Product Sub-Category`
Having Total>10000;

# PS: Find the customer list from corporate segment of California who have placed more

Competencies For Cloud Roles
100% (4)
Competencies For Cloud Roles
7 pages
R - A Practical Course
No ratings yet
R - A Practical Course
42 pages
R study material I
No ratings yet
R study material I
8 pages
6 Working With Data Frames in R
No ratings yet
6 Working With Data Frames in R
8 pages
R Commands
No ratings yet
R Commands
18 pages
RSTUDIO
No ratings yet
RSTUDIO
44 pages
R Course Own English HS
No ratings yet
R Course Own English HS
70 pages
CH 3
No ratings yet
CH 3
33 pages
DR - Pierpaolo-Delser - Introduction R
No ratings yet
DR - Pierpaolo-Delser - Introduction R
83 pages
EM622 Data Analysis and Visualization Techniques For Decision-Making
No ratings yet
EM622 Data Analysis and Visualization Techniques For Decision-Making
47 pages
Lab1 411 Eman Yahya 7773225
No ratings yet
Lab1 411 Eman Yahya 7773225
16 pages
R Program Record Book Iba
No ratings yet
R Program Record Book Iba
24 pages
PushpendraLabFile
No ratings yet
PushpendraLabFile
51 pages
Tutorial 1
No ratings yet
Tutorial 1
29 pages
R Advbeginner v5
No ratings yet
R Advbeginner v5
73 pages
MTech R Notes
No ratings yet
MTech R Notes
14 pages
Base-R
No ratings yet
Base-R
9 pages
BDA Section 4
No ratings yet
BDA Section 4
19 pages
R Programming
No ratings yet
R Programming
22 pages
Data Manipulation Workshop Handout
No ratings yet
Data Manipulation Workshop Handout
46 pages
Matrix, Dataframes, List
No ratings yet
Matrix, Dataframes, List
8 pages
MLlab5th
No ratings yet
MLlab5th
17 pages
Introduction to R for Business Analytics(1)
No ratings yet
Introduction to R for Business Analytics(1)
7 pages
Data Analytics Using R
No ratings yet
Data Analytics Using R
37 pages
Lecture 1
No ratings yet
Lecture 1
35 pages
Matrix Operations in R Programming
No ratings yet
Matrix Operations in R Programming
22 pages
Intro To Statistic Using R - Session 2
No ratings yet
Intro To Statistic Using R - Session 2
1 page
My R Report
No ratings yet
My R Report
52 pages
Bdo Co1 Session 4
No ratings yet
Bdo Co1 Session 4
43 pages
BT1101 - R Code Cheatsheet 1.0
No ratings yet
BT1101 - R Code Cheatsheet 1.0
12 pages
R Workshop Material 18-19, Oct-2023
No ratings yet
R Workshop Material 18-19, Oct-2023
67 pages
R Sharing
No ratings yet
R Sharing
16 pages
R Programming: © 2016 SMART Training Resources Pvt. LTD
No ratings yet
R Programming: © 2016 SMART Training Resources Pvt. LTD
28 pages
R For Machine Learning Lab Practical Work: Master of Business Administration in Business Analytics
0% (1)
R For Machine Learning Lab Practical Work: Master of Business Administration in Business Analytics
9 pages
Introduction To R PDF
No ratings yet
Introduction To R PDF
56 pages
R-Basics.knit (1)
No ratings yet
R-Basics.knit (1)
13 pages
Lesson 7 - The Data Frame
No ratings yet
Lesson 7 - The Data Frame
7 pages
R Prog
No ratings yet
R Prog
27 pages
BDA Section 3
No ratings yet
BDA Section 3
33 pages
Da Session 4
No ratings yet
Da Session 4
75 pages
Network Analysis and Visualization With R and Igraph
No ratings yet
Network Analysis and Visualization With R and Igraph
62 pages
Lesson2 Dataframe
No ratings yet
Lesson2 Dataframe
4 pages
Broomspatial
No ratings yet
Broomspatial
31 pages
data anlytics using r notes
No ratings yet
data anlytics using r notes
14 pages
Week 02
No ratings yet
Week 02
39 pages
Basic R Programming
No ratings yet
Basic R Programming
37 pages
R Cheatsheet Base R
No ratings yet
R Cheatsheet Base R
2 pages
MKT4080 Review Notes-R Part
No ratings yet
MKT4080 Review Notes-R Part
13 pages
Big Data - Lab 3
No ratings yet
Big Data - Lab 3
25 pages
R
No ratings yet
R
15 pages
Basic R Commands For Data Analysis
No ratings yet
Basic R Commands For Data Analysis
7 pages
8 R Basics 3
No ratings yet
8 R Basics 3
27 pages
Rtips. Revival 2012!: Paul E. Johnson June 8, 2012
No ratings yet
Rtips. Revival 2012!: Paul E. Johnson June 8, 2012
72 pages
R-pres
No ratings yet
R-pres
53 pages
Introduction To R
No ratings yet
Introduction To R
21 pages
R Basic and Advanced
No ratings yet
R Basic and Advanced
9 pages
R Session A
No ratings yet
R Session A
107 pages
ProgrammingForDS14_Rbasics
No ratings yet
ProgrammingForDS14_Rbasics
32 pages
Data Structures
No ratings yet
Data Structures
8 pages
Harsh
No ratings yet
Harsh
9 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Scan 07-Sep-2020
No ratings yet
Scan 07-Sep-2020
20 pages
Artshala: Supporting Indian Art, Supporting Make in India
No ratings yet
Artshala: Supporting Indian Art, Supporting Make in India
19 pages
Colligenc E: Battling Covid With Intelligence
No ratings yet
Colligenc E: Battling Covid With Intelligence
13 pages
Narrative Charter Statement: Armaan Choudhary
No ratings yet
Narrative Charter Statement: Armaan Choudhary
5 pages
Logistic Management System c1
No ratings yet
Logistic Management System c1
9 pages
XCP Performance Best Practices
No ratings yet
XCP Performance Best Practices
23 pages
Mongodb
No ratings yet
Mongodb
1 page
Azure Terraform Pipeline - DevOps
No ratings yet
Azure Terraform Pipeline - DevOps
119 pages
Web Project
No ratings yet
Web Project
4 pages
Java Collection Framework
No ratings yet
Java Collection Framework
21 pages
Oracle AWR Report
No ratings yet
Oracle AWR Report
64 pages
Salesforce Data Security Model
100% (1)
Salesforce Data Security Model
28 pages
Job HDL
No ratings yet
Job HDL
17 pages
C++ - Installing Python Module - SpiDev - Stack Overflow
No ratings yet
C++ - Installing Python Module - SpiDev - Stack Overflow
3 pages
build-your-own-database-from-scratch-1n
No ratings yet
build-your-own-database-from-scratch-1n
120 pages
Grading - 1199 ITStrategy
No ratings yet
Grading - 1199 ITStrategy
5 pages
JStorage - Simple JavaScript Plugin To Store Data Locally
No ratings yet
JStorage - Simple JavaScript Plugin To Store Data Locally
5 pages
Zia Resume
No ratings yet
Zia Resume
3 pages
Self Printing of CGHS Card As On Date - Open Government Data (OGD) Platform India
No ratings yet
Self Printing of CGHS Card As On Date - Open Government Data (OGD) Platform India
2 pages
UNIT-5 CLOUD COMPUTING (EEE-IV-I) (1)
No ratings yet
UNIT-5 CLOUD COMPUTING (EEE-IV-I) (1)
22 pages
Baseband5212&5216 Integrat Ion
No ratings yet
Baseband5212&5216 Integrat Ion
17 pages
Case Study Sap FRD
No ratings yet
Case Study Sap FRD
21 pages
Chapter Two: Arrays and Structure
No ratings yet
Chapter Two: Arrays and Structure
54 pages
Shiladitya Das Sharm Updated
No ratings yet
Shiladitya Das Sharm Updated
11 pages
Bug Report
No ratings yet
Bug Report
9,366 pages
Customizing Oracle Applications 11i Using Custom - PLL Varun Tekriwal
No ratings yet
Customizing Oracle Applications 11i Using Custom - PLL Varun Tekriwal
15 pages
Java Exercices 1 Answers
No ratings yet
Java Exercices 1 Answers
6 pages
Project1 - BookStoreDatabase
No ratings yet
Project1 - BookStoreDatabase
4 pages
Online Tutor Buddy School SRS
No ratings yet
Online Tutor Buddy School SRS
13 pages
Domain Name System (DNS)
No ratings yet
Domain Name System (DNS)
13 pages
SOA and Web Services
No ratings yet
SOA and Web Services
36 pages
Pressman CH 9 Design Engineering
No ratings yet
Pressman CH 9 Design Engineering
26 pages
DOC-4_KPLC Students Acceptable Computer Usage Policy
No ratings yet
DOC-4_KPLC Students Acceptable Computer Usage Policy
2 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

All Codes

Uploaded by

All Codes

Uploaded by

DAY1 - R

View(df) users = read_excel("C:/Training data/Amit/Data Set/Sample-Superstore-

ncol(df) df_new = orders %>% select(Region, Sales, Profit)

names(df) df_new1 = orders %>% select(`Order Priority`)

# summary() - this give a stat summary of your data View(df1)

summary(df) # PS - Find the number of rows where

# Importing the excel file df2 = orders %>%

# Group_by() - This function will help you to group a data on a

returns = read_excel("C:/Training data/Amit/Data Set/Sample-Superstore-

View(df6) df12 = merge(x = orders, y = users, by.x = 'Region',

# Option 1: df13 = df12 %>% group_by(Manager) %>% summarise(Total_sales =

plot1 = ggplot(df1, aes(x = reorder(Region, -Total_sales), y = Total_sales)) + RMSE = sqrt(mean(test$error_squre))

# Step 1: Pick the variables which are required in your model

df = mtcars %>% select(cyl, disp, hp, gear, mpg)

index = sample(1:32, 0.8*32)

# filter the training rows and test rows

# Step 3: Building the model on training data - Linear regression model

model = lm(mpg ~ cyl + hp + gear + disp , data = train)

# View the model

# Thus the equation will be :

# Step 4: We will predict the test values using this model

# PS1 - Import all the data from south region

Select Region, sum(Sales) as 'Total Sales' from orders from orders

# PS: Find the total number of orders placed from California

select `Product Name`, sum(Sales / Profit) as Ratio from orders

select `Product Sub-Category`, round(sum(Sales),2) as Total from orders

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.