0% found this document useful (0 votes)

234 views

SAS R::: Cheat Sheet

This guide introduces SAS users to R by providing examples that make use of the tidyverse collection of packages. It covers importing and manipulating data frames, conditional filtering, combining datasets, counting and summarizing data, sorting, and dealing with strings. Examples are provided for common tasks like creating new variables, conditional editing, plotting, and more. Keyboard shortcuts for some R operations like assignment and piping are also included.

Uploaded by

Vijay Puram

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

234 views

SAS R::: Cheat Sheet

Uploaded by

Vijay Puram

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

SAS <-> R :: CHEAT SHEET

Introduction New variables, conditional editing Some plotting in R

This guide aims to familiarise SAS users with R. data new_data; new_data <- old_data %>% ggplot( my_data , aes( year , sales ) ) +
R examples make use of tidyverse collection of packages. set old_data; mutate(total_income = wages + benefits) geom_point( ) + geom_line( )
total_income = wages + benefits ;
Install tidyverse: install.packages("tidyverse")
run;
Attach tidyverse packages for use: library(tidyverse)
R data here in ‘data frames’, and occasionally vectors (via c( ) ) data new_data; new_data <- old_data %>%
Other R structures (lists, matrices…) are not explored here. set old_data; mutate(full_time = if_else(hours > 30 , "Y" , "N"))
if hours > 30 then full_time = "Y";
Keyboard shortcuts: <- Alt + - %>% Ctrl + Shift + m else full_time = "N";
run;
Datasets; drop, keep & rename variables data new_data; new_data <- old_data %>%
ggplot( my_data , aes( year , sales ) ) +
geom_point( ) + geom_line( ) + ylim(0, 40) +
set old_data; mutate(weather = case_when( labs(x = "" , y = "Sales per year")
data new_data; new_data <- old_data if temp > 20 then weather = "Warm"; temp > 20 ~ "Warm",
set old_data; else if temp > 10 then weather = "Mild"; temp > 10 ~"Mild",
run; else weather = "Cold"; TRUE ~ "Cold" ) )
run;
data new_data (keep=id); new_data <- old_data %>%
set old_data (drop=job_title) ; select(-job_title) %>%
run; select(id) Counting and Summarising
data new_data (drop= temp: ); new_data <- old_data %>% proc freq data = old_data ; old_data %>%
set old_data; select( -starts_with("temp") ggplot(my_data, aes( year, sales, colour = dept) ) +
table job_type ; count( job_type )
run; For percent, add: geom_point( ) + geom_line( )
C.f. contains( ) , ends_with( ) run;
%>% mutate(percent = n*100/sum(n))
data new_data; new_data <- old_data %>% proc freq data = old_data ; old_data %>%
set old_data; rename(new_name = old_name) table job_type*region ; count( job_type , region )
rename old_name = new_name; run;
run; Note order differs
proc summary data = old_data nway ; new_data <- old_data %>%
Conditional filtering class job_type region ;
output out = new_data ;
group_by( job_type , region ) %>%
summarise( Count = n( ) )
run; ggplot( my_data , aes( year, sales, fill = dept) ) +
data new_data; new_data <- old_data %>% Equivalent without nway not trivially produced geom_col( )
set old_data; filter(Sex == "M") proc summary data = old_data nway ; new_data <- old_data %>%
if Sex = "M"; class job_type region ; group_by( job_type , region ) %>%
run; var salary ; summarise( total_salaries = sum( salary ) ,
output out = new_data Count = n( ) )
data new_data; new_data <- old_data %>% sum( salary ) = total_salaries ;
set old_data; filter(year %in% Lots of summary functions in both languages
run;
if year in (2010,2011,2012); c(2010,2011,2012)) Swap summarise( ) for mutate( ) to add summary data to original data
run;
Combining datasets Note ‘colour’ for lines & points, ‘fill’ for shapes

data new_data; new_data <- old_data %>% ggplot( my_data , aes( year, sales, fill = dept) ) +
set old_data; group_by( id ) %>% geom_col( position = "dodge" ) + coord_flip( )
data new_data ; new_data <- bind_rows( data_1 , data_2 )
by id ; slice(1) set data_1 data_2 ;
if first.id ; run;
run; C.f. rbind( ) which produces error if columns are not identical
Could use slice(n( )) for last
data new_data ; new_data <- left_join( data_1 , data_2 , by = "id")
data new_data; new_data <- old_data %>% merge data_1 (in= in_1) data_2 ;
set old_data; filter(dob > as.Date("1990-04-25")) by id ;
if dob > "25APR1990"d; if in_1 ;
run; run; C.f. full_join( ) , right_join( ) , inner_join( ) C.f. position = "fill" for 100% stacked bars/cols

CC BY SA Brendan O’Dowd • brendanjodowd@gmail.com • Updated 2021-09

Sorting and Row-Wise Operations Dealing with strings
proc sort data=old_data out=new_data; new_data <- old_data %>% data new_data; new_data <- old_data %>%
by id descending income ; arrange( id , desc( income ) ) set old_data; filter( str_detect( job_title , "Health" ))
run; if find( job_title , "Health" );
run;
proc sort data=old_data nodup; old_data <- old_data %>%
by id job_type; arrange( id , job_type)) %>% data new_data; new_data <- old_data %>%
run; distinct( ) set old_data; filter( str_detect( job_title , "^Health" ))
if job_title =: "Health" ;
Note nodup relies on adjacency of duplicate rows, distinct( ) does not
run;
Use ^ for start of string, $ for end of string, e.g. "Health$"
proc sort data=old_data nodupkey; old_data <- old_data %>%
by id ; arrange( id ) %>% data new_data; new_data <- old_data %>%
run; group_by( id ) %>% set old_data; mutate( substring = str_sub( big_string , 3 , 6 ))
slice( 1 ) substring = substr( big_string , 3 , 4 );
run;
Returns characters 3 to 6. Note SAS uses <start>, <length>, R uses <start>, <end>
data new_data; new_data <- old_data %>%
set old_data; group_by( id ) %>% data new_data; new_data <- old_data %>%
by id descending income ; slice(which.max( income )) set old_data; mutate( address = str_replace_all( address , "Street" , "St" ))
C.f.which.min( )
if first.id ; address = tranwrd( address , "Street" , "St" );
Swap to preserve duplicate maxima: … slice.max( income )
run; run;
Alternatively: … filter(income==max(income)) C.f. str_replace( ) for first instance of pattern only

data new_data; new_data <- old_data %>% data new_data; new_data <- old_data %>%
set old_data; mutate( prev_id = lag( id , 1 )) set old_data; mutate( full_name = str_c( first_name , surname , sep = " " ))
prev_id= lag( id ); full_name = catx(" " , first_name , surname );
run; run;
C.f. lead( ) for subsequent rows Drop sep = " " for equivalent to cats( ) in SAS
data new_data; new_data <- old_data %>% data new_data; new_data <- old_data %>%
set old_data; group_by( id ) %>% set old_data; mutate( first_word = word( sentence , 1 ))
by id; mutate( counter = row_number( ) ) first_word = scan( sentence , 1 );
counter +1 ; run;
R example preserves punctuation at the end of words, SAS doesn’t
if first.id then counter = 1;
run; data new_data; new_data <- old_data %>%
set old_data; mutate( house_number = str_extract( address , "\\d*" ))
house_number = compress( address , , "dk" );
Converting and Rounding run;
Wide range of regexps in both languages, this example extracts digits only

data new_data; new_data <- old_data %>%

set old_data ; mutate(num_var = as.numeric("5" )) %>% File operations
num_var = input("5" , 8. ); mutate(text_var = as.character( 5 ))
text_var = put( 5 , 8. ); Operate in ‘Work’ library. Operate in a particular ‘working directory’ (identify using getwd( ) )
run; Use libname to define file locations Move to other locations using setwd( )

data new_data ; new_data <- old_data %>% libname library_name "file_location"; save(data_in_use , file="file_location/saved_data.rda")
set old_data; mutate(nearest_5 = round(x/5)*5) %>% data library_name.saved_data; or
nearest_5 = round( x , 5 ) mutate(two_decimals = round( x , digits = 2) set data_in_use; setwd("file_location")
two_decimals = round( x , 0.01) run; save( data_in_use , file = "saved_data.rda")
run;
libname library_name "file_location"; load("file_location/saved_data.rda" )
Creating functions to modify datasets data data_in_use ;
set library_name.saved_data ;
or
setwd("file_location")
save( ) can store multiple data frames in a
run; load("saved_data.rda")
%macro add_variable(dataset_name); add_variable <- function( dataset_name ){ single .rda file, load( ) will restore all of these
data &dataset_name; dataset_name <- dataset_name %>% proc import datafile = "my_file.csv" my_data <- read_csv("my_file.csv")
set &dataset_name; mutate(new_variable = 1) out = my_data dbms = csv;
new_variable = 1; return( dataset_name ) run;
run; } Both examples assume column headers in csv file
%mend; my_data <- add_variable( my_data )
%add_variable( my_data );
Note SAS can modify within the macro,
whereas R creates a copy within the function
CC BY SA Brendan O’Dowd • brendanjodowd@gmail.com • Updated 2021-09

Nas 509
100% (1)
Nas 509
4 pages
Sas R
No ratings yet
Sas R
2 pages
Presentation 1
No ratings yet
Presentation 1
34 pages
Big Data - Lab 3
No ratings yet
Big Data - Lab 3
25 pages
Econ6067 R (I) 2022
No ratings yet
Econ6067 R (I) 2022
22 pages
R Studio Notes
No ratings yet
R Studio Notes
10 pages
R Tutorial #1: Applied Econometrics (Econ3005)
No ratings yet
R Tutorial #1: Applied Econometrics (Econ3005)
21 pages
Unit2
No ratings yet
Unit2
76 pages
Data Transformation
No ratings yet
Data Transformation
1 page
R Course Own English HS
No ratings yet
R Course Own English HS
70 pages
MBA Sem 1 Unit 3 Fundamentals of R (1)
No ratings yet
MBA Sem 1 Unit 3 Fundamentals of R (1)
41 pages
Introduction To R PDF
No ratings yet
Introduction To R PDF
56 pages
Da Session 4
No ratings yet
Da Session 4
75 pages
Tutorial 1
No ratings yet
Tutorial 1
29 pages
Data Manipulation Workshop Handout
No ratings yet
Data Manipulation Workshop Handout
46 pages
BIO259 Note
No ratings yet
BIO259 Note
55 pages
R Studio
No ratings yet
R Studio
8 pages
R Program Record Book Iba
No ratings yet
R Program Record Book Iba
24 pages
02 Data Processing
No ratings yet
02 Data Processing
65 pages
Lab1 411 Eman Yahya 7773225
No ratings yet
Lab1 411 Eman Yahya 7773225
16 pages
CRM Cheat Sheet
No ratings yet
CRM Cheat Sheet
7 pages
R study material I
No ratings yet
R study material I
8 pages
Data Management II
No ratings yet
Data Management II
15 pages
Getting Started With R
No ratings yet
Getting Started With R
155 pages
STATA - Subject Table of Contents
No ratings yet
STATA - Subject Table of Contents
15 pages
WWWWWW WWWWWW WWWWWW WWWWWW WWWW WWWW WWWWWW: Data Transformation With Dplyr
No ratings yet
WWWWWW WWWWWW WWWWWW WWWWWW WWWW WWWW WWWWWW: Data Transformation With Dplyr
2 pages
R Commands
No ratings yet
R Commands
18 pages
Week 7
No ratings yet
Week 7
10 pages
All Codes
No ratings yet
All Codes
10 pages
Rtips. Revival 2012!: Paul E. Johnson June 8, 2012
No ratings yet
Rtips. Revival 2012!: Paul E. Johnson June 8, 2012
72 pages
My Learning From Data Science Classes
No ratings yet
My Learning From Data Science Classes
16 pages
CH 3
No ratings yet
CH 3
33 pages
Week 1-3
No ratings yet
Week 1-3
17 pages
DSCI 100 Cheat Sheet
No ratings yet
DSCI 100 Cheat Sheet
3 pages
A Short List of The Most Useful R Commands
No ratings yet
A Short List of The Most Useful R Commands
11 pages
RSTUDIO
No ratings yet
RSTUDIO
44 pages
Chapter - 03 - Review of Basic Data
No ratings yet
Chapter - 03 - Review of Basic Data
92 pages
R_intro2021
No ratings yet
R_intro2021
23 pages
Programming With R: Lecture #4
No ratings yet
Programming With R: Lecture #4
34 pages
R For Machine Learning Lab Practical Work: Master of Business Administration in Business Analytics
0% (1)
R For Machine Learning Lab Practical Work: Master of Business Administration in Business Analytics
9 pages
DSR LAB MANUAL - 10 programs
No ratings yet
DSR LAB MANUAL - 10 programs
34 pages
Practical File R by Komal
No ratings yet
Practical File R by Komal
26 pages
Basics: TH TH TH TH TH TH TH
No ratings yet
Basics: TH TH TH TH TH TH TH
3 pages
Lecture 1
No ratings yet
Lecture 1
35 pages
Beginner Guide To R and R Studio V1
No ratings yet
Beginner Guide To R and R Studio V1
27 pages
Session For R
No ratings yet
Session For R
23 pages
Stat A Tutorial
No ratings yet
Stat A Tutorial
40 pages
Base-R
No ratings yet
Base-R
9 pages
Basic Data Science With R
100% (1)
Basic Data Science With R
364 pages
R-Basics.knit (1)
No ratings yet
R-Basics.knit (1)
13 pages
DSDA MANUAL
No ratings yet
DSDA MANUAL
64 pages
R Syntax Examples 1
No ratings yet
R Syntax Examples 1
6 pages
R Cheat Sheet (Updated)
No ratings yet
R Cheat Sheet (Updated)
13 pages
DR - Pierpaolo-Delser - Introduction R
No ratings yet
DR - Pierpaolo-Delser - Introduction R
83 pages
An R Tutorial Starting Out
No ratings yet
An R Tutorial Starting Out
9 pages
Data Transformation With Data - Table: Cheat Sheet
No ratings yet
Data Transformation With Data - Table: Cheat Sheet
2 pages
Amazing Java: Learn Java Quickly
From Everand
Amazing Java: Learn Java Quickly
Andrei Besedin
No ratings yet
C Language Programming Codes
From Everand
C Language Programming Codes
Durgesh
No ratings yet
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Digital and Microprocessor Techniques V10
From Everand
Digital and Microprocessor Techniques V10
Clive W. Humphris
No ratings yet
ID Checking Guidelines For Standard and Enhanced DBS Checks
No ratings yet
ID Checking Guidelines For Standard and Enhanced DBS Checks
16 pages
Payment Application Form: Applicant'S Particulars
No ratings yet
Payment Application Form: Applicant'S Particulars
2 pages
Dbs List of Acceptable Identification: WWW - Gov.uk
No ratings yet
Dbs List of Acceptable Identification: WWW - Gov.uk
2 pages
DBS (Disclosure and Barring Service) Document Checklist
No ratings yet
DBS (Disclosure and Barring Service) Document Checklist
1 page
SA III Grade 5 English
No ratings yet
SA III Grade 5 English
7 pages
Latsco Application Tubiera, Jc Moneses
No ratings yet
Latsco Application Tubiera, Jc Moneses
3 pages
Cure Activators To Improve Both ECO-Balance and Economics of A Cure System
No ratings yet
Cure Activators To Improve Both ECO-Balance and Economics of A Cure System
15 pages
BBC Products 1
No ratings yet
BBC Products 1
51 pages
Frankenstein - Pauline Francis
No ratings yet
Frankenstein - Pauline Francis
56 pages
CW & Acw
No ratings yet
CW & Acw
1 page
Tariel Kapanadze Patent WO2008103130A1
No ratings yet
Tariel Kapanadze Patent WO2008103130A1
12 pages
Practice Test 01
No ratings yet
Practice Test 01
5 pages
Jurnal Internasional 1
No ratings yet
Jurnal Internasional 1
11 pages
Software Engineer - Job Description
No ratings yet
Software Engineer - Job Description
2 pages
Unknown 2
No ratings yet
Unknown 2
6 pages
Trigonometry - Lesson 1 To 3
No ratings yet
Trigonometry - Lesson 1 To 3
29 pages
Islcollective Worksheets Preintermediate A2 Intermediate b1 Upperintermediate b2 Adult High School Reading Spe My Table 109954f3e6f32d56084 38193480
No ratings yet
Islcollective Worksheets Preintermediate A2 Intermediate b1 Upperintermediate b2 Adult High School Reading Spe My Table 109954f3e6f32d56084 38193480
2 pages
7699-Văn Bản Của Bài Báo-12940-1-10-20230105
No ratings yet
7699-Văn Bản Của Bài Báo-12940-1-10-20230105
9 pages
Aplio I800 - General Imaging
No ratings yet
Aplio I800 - General Imaging
20 pages
sample paper maths 1
No ratings yet
sample paper maths 1
8 pages
Glovebox Guide To Evs Esf
No ratings yet
Glovebox Guide To Evs Esf
20 pages
Salary Guide - UAE in 2023
100% (1)
Salary Guide - UAE in 2023
14 pages
How Should Heritage Decisions Be Made?: Increasing Participation From Where You Are
100% (2)
How Should Heritage Decisions Be Made?: Increasing Participation From Where You Are
35 pages
Building Economics Life Cycle Cost Analysis
No ratings yet
Building Economics Life Cycle Cost Analysis
4 pages
2021년_중2_NE능률(김성곤)_2과_[13]주관대 2
No ratings yet
2021년_중2_NE능률(김성곤)_2과_[13]주관대 2
13 pages
Siemens PLC Wiring
100% (1)
Siemens PLC Wiring
16 pages
Notes To Pharmacovigilance
100% (1)
Notes To Pharmacovigilance
58 pages
MATHS KS-3 Pupil Book 1.2
No ratings yet
MATHS KS-3 Pupil Book 1.2
25 pages
Product Information: Toshiba X-Ray Tube D-054 / D-054S / D-054SB
No ratings yet
Product Information: Toshiba X-Ray Tube D-054 / D-054S / D-054SB
9 pages
3. SECURE - Benchmarking Generative Large Language
No ratings yet
3. SECURE - Benchmarking Generative Large Language
14 pages
15647/Ltt Guwahati Ex Sleeper Class (SL)
No ratings yet
15647/Ltt Guwahati Ex Sleeper Class (SL)
2 pages
Complete Us It Recruitement Process
No ratings yet
Complete Us It Recruitement Process
10 pages
Baa3023-Project MGMT in Construction 21213
No ratings yet
Baa3023-Project MGMT in Construction 21213
6 pages
InControl 1 2013 en
No ratings yet
InControl 1 2013 en
12 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

SAS R::: Cheat Sheet

Uploaded by

SAS R::: Cheat Sheet

Uploaded by

SAS <-> R :: CHEAT SHEET

Introduction New variables, conditional editing Some plotting in R

CC BY SA Brendan O’Dowd • brendanjodowd@gmail.com • Updated 2021-09

data new_data; new_data <- old_data %>%

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.