0% found this document useful (0 votes)

14 views

Uni T - 2 - R Programming

Uploaded by

anju.k10301

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views

Uni T - 2 - R Programming

Uploaded by

anju.k10301

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Uni t -2 - R Programming

2.1. Summary Function in R

• The summary() function in R can be used to quickly summarize the values in a vector, data
frame, regression model.

• This syntax uses the following basic syntax:

summary(data)

• The summary() function automatically calculates the following summary statistics for the
vector:

• Min: The minimum value

• 1st Qu: The value of the 1st quartile (25th percentile)

• Median: The median value

• 3rd Qu: The value of the 3rd quartile (75th percentile)

• Max: The maximum value

Using summary() with Vector

• The following code shows how to use the summary() function to summarize the values in a
vector

Using summary() with Data Frame

• The following code shows how to use the summary() function to summarize every column in
a data frame:
Using summary() with Specific Data Frame Columns
• The following code shows how to use the summary() function to summarize specific columns
in a data frame:

2.2. Logistic Regression

• Logistic regression in R Programming is a classification algorithm used to find the probability
of event success and event failure.

• Logistic regression is a statistical method used to predict the outcome of a categorical

dependent variable (target variable) based on one or more predictor variables. It is a type of
regression analysis used for predicting the outcome of a binary dependent variable (i.e., a
variable that has only two possible outcomes, such as 0/1, yes/no, etc.).

• Logistic regression in R can be performed using the glm() function, which fits generalized
linear models.
Logistic regression is commonly used in various fields, including:

• Medical Research: Predicting disease outcomes, treatment responses, or patient survival.

• Marketing: Predicting customer satisfaction, response to marketing campaigns, or credit risk.

• Finance: Predicting credit risk, loan defaults, or stock prices.

• Social sciences: Predicting voter behavior, crime rates, or social network outcomes.

• The logistic regression model can be represented mathematically as:

p = 1 / (1 + e^(-z))

• There is the following syntax of the glm() function.

glm(formula, data, family)

• Formula - It is a symbol which represents the relationship b/w the variables.

• Data - It is the dataset giving the values of the variables.

• Family - An R object which specifies the details of the model, and its value is binomial for
logistic regression.

2.3. Confusion Matrix

• The Confusion Matrix is a type of matrix that is used to visualize the predicted values against
the actual Values.

• The row headers in the confusion matrix represent predicted values and column headers are
used to represent actual values.

• A confusion matrix is a table used to evaluate the performance of a classification model in R

programming. It summarizes the predictions against the actual outcomes.

• The Confusion matrix contains four cells as shown in the below image.
• True Positive – Indicates how many positive values are predicted as positive only by the
model.

• False Positive – Indicates how many negative values are predicted as positive values by the
model.

• False Negative – Indicates how many positive values are predicted as negative values by the
model.

• True Negative – Indicates how many negative values are predicted as negative only by the
model.

ConfusionMatrix() function

• In R Programming the Confusion Matrix can be visualized using confusionMatrix() function

which is present in the caret package.

Syntax: confusionMatrix(data, reference, positive = NULL, dnn = c(“Prediction”, “Reference”))

where

• Data - a factor of predicted classes.

• reference - a factor of classes to be used as the true results.

• positive(optional) - an optional character string for the factor level.

• dnn(optional) - a character vector of dimnames for the table.

Factor()

• Factors are data structures that are implemented to categorize the data or represent
categorical data and store it on multiple levels.

Example 1

1 - Example 2

2.4. Calculate Sensitivity, Specificity in CARET

• The ‘caret’ package is stands for Classification and Regression Training.

• Sensitivity and specificity are performance metrics used to evaluate the accuracy of a
classification model in R programming.

• The goal of a classification model is to learn the relationship between the input features and
the target class label, so that it can make accurate predictions on new, unseen data.

Here's how to calculate them:

Sensitivity (True Positive Rate):

• Definition: Proportion of true positives (correctly predicted instances) among all actual
positive instances.

• Formula: sensitivity = TP / (TP + FN)

• R code: sensitivity = sum(true positives) / (sum(true positives) + sum(false negatives))

Specificity (True Negative Rate):

• Definition: Proportion of true negatives (correctly predicted instances) among all actual
negative instances.

• Formula: specificity = TN / (TN + FP)

• R code: specificity = sum(true negatives) / (sum(true negatives) + sum(false positives))

Where:

• TP = True Positives (correctly predicted positive instances)

• TN = True Negatives (correctly predicted negative instances)

• FP = False Positives (incorrectly predicted positive instances)

• FN = False Negatives (incorrectly predicted negative instances)

•

2.4. ROC Curve

• A Receiver Operating Characteristic (ROC) curve is a graphical representation of the

performance of a binary classification model. It plots the True Positive Rate (TPR) against the
False Positive Rate (FPR) at different threshold settings.

Here's a breakdown of the ROC curve:

• True Positive Rate (TPR): The proportion of actual positive instances correctly identified by
the model.

• False Positive Rate (FPR): The proportion of actual negative instances incorrectly identified
as positive by the model.

• Threshold: The cutoff value used to determine whether a prediction is positive or negative.

ROC curves are useful for:

• Evaluating model performance

• Comparing different models

• Selecting optimal threshold values

• Identifying bias in models

Common ROC curve metrics include:

• AUC (Area Under the Curve)

• Accuracy

• Precision

• Recall (Sensitivity)

• Specificity

• F1-score

2.5. Recitation

• Reiteration: Repeating a process or operation, often using loops (e.g., for, while, repeat).

• Recursion: A function calling itself repeatedly until a base case is reached.

Data Types and Structures in R

• Basic Data Types:

- Numeric: Represents real numbers, e.g., 2.5, 10

- Integer: Whole numbers, e.g., 2L, 10L (where L indicates an integer)

- Character: Text or string data, e.g., "hello", "R programming"

- Logical: Boolean values, TRUE and FALSE

- Factor: Categorical variables, e.g., factor(c("yes", "no", "yes"))

• Data Structures:

- Vector: A sequence of elements of the same data type, created using c()

- Matrix: A two-dimensional array of elements of the same data type, created

using matrix()
- Array: A multi-dimensional collection of elements

- Data Frame: A table where columns can have different data types, created using
data.frame()

- List: A collection that can contain elements of different types

2. Basic Operators

• Arithmetic Operators: +, -, *, /, %% (modulus), %/% (integer division)

• Relational Operators: == (equals), != (not equal), <, >, <=, >=

• Logical Operators: & (and), | (or), ! (not)

• Assignment Operators: <-, = for assigning values to variables; <<- for global assignment
within functions

my_function <- function(arg1, arg2) {

3. Control Structures

• Conditional Statements:

- if (condition) { ... }: Executes code if the condition is TRUE

- ifelse(condition, true_value, false_value): Vectorized if-else statement

• Loops:

- for (variable in sequence) { ... }: Repeats code for each element in the sequence

- while (condition) { ... }: Repeats code while the condition is TRUE

- repeat { ...; break }: Repeats code until a break condition is met

4. Functions

• Defining Functions: Functions are defined using function(), allowing code to be reused:

Copy code

my_function <- function(arg1, arg2) {

# Code block
return(result)

• Scope of Variables: Variables defined within functions are local by default, unless assigned
globally using <<-

5. Data Manipulation

• Data Frames:

- Access columns with $ (e.g., df$column_name)

- Subset data frames with df[row, column] indexing

• dplyr Package: Provides easy-to-use functions for data manipulation:

- filter(): Select rows based on conditions

- select(): Choose specific columns

- mutate(): Add new columns or modify existing ones

- summarize(): Calculate summary statistics

- group_by(): Group data for grouped calculations

(eBook PDF) Biocalculus: Calculus, Probability, and Statistics for the Life Sciencesinstant download
100% (4)
(eBook PDF) Biocalculus: Calculus, Probability, and Statistics for the Life Sciencesinstant download
53 pages
CSEC Maths 2022 January Past Paper Solutions
100% (1)
CSEC Maths 2022 January Past Paper Solutions
40 pages
ESE500 HW3 Solutions
No ratings yet
ESE500 HW3 Solutions
7 pages
BS 3424 - 5 PDF
100% (1)
BS 3424 - 5 PDF
12 pages
Unit 2
No ratings yet
Unit 2
32 pages
r 2m
No ratings yet
r 2m
34 pages
R Programming 101 Part 1
No ratings yet
R Programming 101 Part 1
53 pages
R Programming
No ratings yet
R Programming
50 pages
Week2 Slides
No ratings yet
Week2 Slides
76 pages
saurabh
No ratings yet
saurabh
22 pages
2 Functions
No ratings yet
2 Functions
49 pages
data analysis in r
No ratings yet
data analysis in r
10 pages
Document (1)
No ratings yet
Document (1)
32 pages
R study material I
No ratings yet
R study material I
8 pages
lec_09
No ratings yet
lec_09
16 pages
Functions and Packages
No ratings yet
Functions and Packages
7 pages
Lec 4
No ratings yet
Lec 4
18 pages
R programing
No ratings yet
R programing
12 pages
Introdution to R - Network Analysis_ Practical 1 - Sacha Epskamp - University of Amsterdam, 2013
No ratings yet
Introdution to R - Network Analysis_ Practical 1 - Sacha Epskamp - University of Amsterdam, 2013
34 pages
Data_analysis_with_R _24
No ratings yet
Data_analysis_with_R _24
47 pages
R
No ratings yet
R
13 pages
STTN 225 R Summary
No ratings yet
STTN 225 R Summary
18 pages
First Course On R
No ratings yet
First Course On R
26 pages
Basics: TH TH TH TH TH TH TH
No ratings yet
Basics: TH TH TH TH TH TH TH
3 pages
02 Functions in R
No ratings yet
02 Functions in R
24 pages
Statistics Using R Language
No ratings yet
Statistics Using R Language
5 pages
Lecture 1
No ratings yet
Lecture 1
35 pages
Basics of R Programming - Part 2
No ratings yet
Basics of R Programming - Part 2
7 pages
R WorkSamples
No ratings yet
R WorkSamples
44 pages
Boulder Handout 2019
No ratings yet
Boulder Handout 2019
187 pages
Data Analysis Using R and Vectors
No ratings yet
Data Analysis Using R and Vectors
35 pages
Advance Stats
No ratings yet
Advance Stats
233 pages
R Lab File Deepak
No ratings yet
R Lab File Deepak
27 pages
UNIT 2
No ratings yet
UNIT 2
101 pages
Sta238 Wks - Week1+2
No ratings yet
Sta238 Wks - Week1+2
35 pages
MIT 302 - Statistical Computing II - Tutorial 01
No ratings yet
MIT 302 - Statistical Computing II - Tutorial 01
6 pages
N2 Data in R
No ratings yet
N2 Data in R
7 pages
Unit 2 R
No ratings yet
Unit 2 R
16 pages
Essential R
No ratings yet
Essential R
261 pages
r programming built in functions
No ratings yet
r programming built in functions
8 pages
ProgrammingForDS14_Rbasics
No ratings yet
ProgrammingForDS14_Rbasics
32 pages
Capital Gains
No ratings yet
Capital Gains
8 pages
Introduction To R Installation: Data Types Value Examples
No ratings yet
Introduction To R Installation: Data Types Value Examples
9 pages
R Programming Slides
No ratings yet
R Programming Slides
73 pages
Introduction to R for Business Analytics(1)
No ratings yet
Introduction to R for Business Analytics(1)
7 pages
R Programming Notes
No ratings yet
R Programming Notes
23 pages
R PROGRAMMING
No ratings yet
R PROGRAMMING
13 pages
Da Session 4
No ratings yet
Da Session 4
75 pages
DA_Lab_Week-2
No ratings yet
DA_Lab_Week-2
22 pages
Useful R Functions-1
No ratings yet
Useful R Functions-1
4 pages
RBasics Handout
No ratings yet
RBasics Handout
6 pages
Introduction To R PDF
No ratings yet
Introduction To R PDF
56 pages
Data in R
No ratings yet
Data in R
7 pages
CRM Cheat Sheet
No ratings yet
CRM Cheat Sheet
7 pages
R 5 Marks
No ratings yet
R 5 Marks
11 pages
Glocal University: Practical File of R Programming
100% (1)
Glocal University: Practical File of R Programming
32 pages
Writing Simple Functions in R Bootstrapping
No ratings yet
Writing Simple Functions in R Bootstrapping
17 pages
MD115 Wk01
No ratings yet
MD115 Wk01
67 pages
R Tutorial
No ratings yet
R Tutorial
15 pages
Data Analysis Using R - 5
No ratings yet
Data Analysis Using R - 5
9 pages
Bdo Co1 Session 4
No ratings yet
Bdo Co1 Session 4
43 pages
An Introduction To R: Biostatistics 615/815
No ratings yet
An Introduction To R: Biostatistics 615/815
59 pages
R Workshop Material 18-19, Oct-2023
No ratings yet
R Workshop Material 18-19, Oct-2023
67 pages
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
Detailed Lesson Plan in English 9 I. Objectives: Teacher"s Activity Students' Activity
No ratings yet
Detailed Lesson Plan in English 9 I. Objectives: Teacher"s Activity Students' Activity
5 pages
Quadratic Equations
No ratings yet
Quadratic Equations
12 pages
Answers For Odd-Numbered Problems
No ratings yet
Answers For Odd-Numbered Problems
7 pages
To Read Carefully: Xavier Jouve, PHD, Fcs
No ratings yet
To Read Carefully: Xavier Jouve, PHD, Fcs
8 pages
Mechanics of Solids-I: Module-3
No ratings yet
Mechanics of Solids-I: Module-3
29 pages
Question Bank M Tech 2ND Sem Batch 2018
No ratings yet
Question Bank M Tech 2ND Sem Batch 2018
31 pages
15me03 Thermodynamics Problems June2017
100% (1)
15me03 Thermodynamics Problems June2017
19 pages
Lecture - 4 - Modeling of DC Machines
No ratings yet
Lecture - 4 - Modeling of DC Machines
23 pages
Tree Concepts & Definitions Graph
No ratings yet
Tree Concepts & Definitions Graph
31 pages
DIAGNOSTIC TEST-math7
No ratings yet
DIAGNOSTIC TEST-math7
4 pages
16 11 24 - JR.C 120 - Jee Main - WTM 21 - Q.Paper
No ratings yet
16 11 24 - JR.C 120 - Jee Main - WTM 21 - Q.Paper
18 pages
Excel CheatSheet The Microsoft Excel Formulas Cheat Sheet
No ratings yet
Excel CheatSheet The Microsoft Excel Formulas Cheat Sheet
5 pages
Practice Sheet System of Particles and Centre of Mass Anil Sir Vinay
No ratings yet
Practice Sheet System of Particles and Centre of Mass Anil Sir Vinay
7 pages
Point Swizzerling-An Approach by Example
No ratings yet
Point Swizzerling-An Approach by Example
13 pages
Determination of Conductivity of Water
No ratings yet
Determination of Conductivity of Water
5 pages
Cambridge International Advanced Subsidiary and Advanced Level
No ratings yet
Cambridge International Advanced Subsidiary and Advanced Level
12 pages
3rd Sem Marksheet
No ratings yet
3rd Sem Marksheet
2 pages
Weber 2017 LogFunctions
No ratings yet
Weber 2017 LogFunctions
8 pages
Levene's Test
No ratings yet
Levene's Test
2 pages
Heavenly Chocolates Web Site Transactions Case Study
No ratings yet
Heavenly Chocolates Web Site Transactions Case Study
12 pages
Lecture 2: Active and Passive Circuit Elements (Resistors Only)
No ratings yet
Lecture 2: Active and Passive Circuit Elements (Resistors Only)
7 pages
Chapter 2 Force and Motion TEACHER's GUIDE
No ratings yet
Chapter 2 Force and Motion TEACHER's GUIDE
44 pages
MATHAROO Worksheet LP - 40 22: Student Name: - Grade: - Date
No ratings yet
MATHAROO Worksheet LP - 40 22: Student Name: - Grade: - Date
5 pages
Automation and Robotics: Prepared by G.Harsha Vardhini
No ratings yet
Automation and Robotics: Prepared by G.Harsha Vardhini
26 pages
DL Question Bank Answers
No ratings yet
DL Question Bank Answers
55 pages
Volume 1-Conference ICCS-X
No ratings yet
Volume 1-Conference ICCS-X
589 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.