Package Aasea': R Topics Documented
Package Aasea': R Topics Documented
Package Aasea': R Topics Documented
November 9, 2019
Type Package
Title Amino Acid Substitution Effect Analyser
Version 1.1.0
Author Raja Sekhara Reddy D.M
Maintainer Raja Sekhara Reddy D.M <raja.duvvuru@gmail.com>
Description
Given a protein multiple sequence alignment, it is daunting task to assess the effects of substitu-
tions along sequence length. 'aaSEA' package is intended to help researchers to rapidly anal-
yse property changes caused by single, multiple and correlated amino acid substitutions in pro-
teins. Methods for identification of co-evolving positions from multiple sequence align-
ment are as described in : Pelé et al., (2017) <doi:10.4172/2379-1764.1000250>.
Depends R(>= 3.4.0)
Imports DT(>= 0.4), networkD3(>= 0.4), shiny(>= 1.0.5),
shinydashboard(>= 0.7.0), magrittr(>= 1.5), Bios2cor(>= 2.0),
seqinr(>= 3.4-5), plotly(>= 4.7.1), Hmisc(>= 4.1-1)
License GPL-3
Encoding UTF-8
LazyData true
RoxygenNote 6.1.1
Suggests knitr, rmarkdown
VignetteBuilder knitr
NeedsCompilation no
Repository CRAN
Date/Publication 2019-11-09 16:20:02 UTC
R topics documented:
AAindex . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
corSubFile . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
Cruciani . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1
2 AAindex
Fasgai . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
getAASub . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
getCorPropChange . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
getCorSites . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
getPropChange . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
getPropCorr . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
getTopSub . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Kidera . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
matEncode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
plotCorNet . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
plotCorSubChanges . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
plotMultiSubChange . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
plotSingleSubChange . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
startSeaShiny . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
Index 13
Description
A data frame of 533 amino acid properties.
Usage
AAindex
Format
A data frame with 533 rows and 20 variables:
name amino acid property as in AAindex
A Alanine, Ala, A
C Cysteine, Cys, C
D Aspartate, Asp, D
E Glutamate, Glu, E
F Phenylalanine, Phe, F
G Glycine, Gly, G
H Histidine, His, H
I Isoleucine, Ile, I
K Lysine, Lys, K
L Leucine, Leu, L
M Methionine, Met, M
N Asparagine, Asn, N
corSubFile 3
P Proline, Pro, P
Q Glutamine, Gln, Q
R Arginine, Arg, R
S Serine, Ser, S
T Threonine, Thr, T
V Valine, Val, V
W Tryptophan, Trp, W
Y Tyrosine, Tyr, Y ...
Description
This file is output of ’getTopSub’ function with subset of alignment as input. This subset alignment
is output of ’getCorSite’ function
Usage
corSubFile
Format
A data frame with 2 Columns representing positional pairs:
Pos1 Position one prefixed with Wt amino acid and suffixed with substitution
Pos2 Position two prefixed with Wt amino acid and suffixed with substitution ...
Description
A data frame of 3 Cruciani properties for 20 amino acids.
Usage
Cruciani
Format
A data frame with 20 rows and 3 columns representing amino acids and Cruciani properties:
Names of amino acids
PP1 Polarity
PP2 Hydrophobicity
PP3 H-bonding ...
4 getAASub
Description
A data frame of six Fasgai vectors for 20 amino acids.
Usage
Fasgai
Format
A data frame with 20 rows and 6 columns representing amino acids and Fasgai vectors:
Names of amino acids
F1 Hydrophobicity index
F2 Alpha and turn propensities
F3 Bulky properties
F4 Compositional characteristic index
F5 Local flexibility
F6 Electronic properties ...
Description
Get amino acid substitutions from multiple sequence alignment
Usage
getAASub(fileLoc)
Arguments
fileLoc exact location of multiple sequence alignment file in "FASTA" format
Value
Returns a list of two data frames 1.Single substitutions 2.Multiple substitutions
Examples
file = system.file("extdata", "linB_Prot_ali.fasta", package = "aaSEA")
getAASub(fileLoc = file)
getCorPropChange 5
Description
Get property changes associated with correlated substitutions
Usage
getCorPropChange(corSubFile, propertyDF = "Cruciani",
propertyIndex = 1)
Arguments
corSubFile A dataframe with two columns i.e. Pos1 and Pos2. This file is generated by
getTopSub
propertyDF Choose one of Cruciani, Fasgai, Kidera or AAindex based amino acid properties
propertyIndex The amino acid property to analyse. It is row number in propertyDF data frame
Value
A data frame of 12 columns consisting data regarding Wild type and mutant resides and their posi-
tions alogn with their properties and net changes between Pos1 and Pos2
Examples
getCorPropChange(corSubFile = corSubFile, propertyDF = "Cruciani", propertyIndex = 1)
Description
Get correlated sites with substitutions
Usage
getCorSites(fileLoc, corMethod = "mcbasc")
Arguments
fileLoc exact location of multiple sequence alignment file in "FASTA" format
corMethod One of the methods to compute correlated sites viz. ’mip’, ’elsc’, ’mcbasc’ and
’omes’. Default is ’mcbasc’.
6 getPropChange
Value
A subset alignment matrix of original multiple sequence alignment with significant correlations.
Examples
file = system.file("extdata", "linB_toy_ali.fasta", package = "aaSEA")
getCorSites(fileLoc = file,corMethod="mcbasc")
getPropChange Get wild type and substituted amino acid properties and associated
property changes
Description
Get wild type and substituted amino acid properties and associated property changes
Usage
Arguments
Value
A substitution data frame with three additional columns i.e. wt.Prop, mu.Prop and Delta.Prop
Examples
ssFileLoc <- system.file("extdata", "singleSub.rda", package = "aaSEA")
singleSubFile <- readRDS(ssFileLoc)
msFileLoc <- system.file("extdata", "multiSub.rda", package = "aaSEA")
multiSubFile <- readRDS(msFileLoc)
getPropChange(subFile = singleSubFile, propertyDF = "Cruciani", propertyIndex = 1)
getPropChange(subFile = multiSubFile, propertyDF = "Cruciani", propertyIndex = 1)
getPropCorr 7
Description
Get amino acid property wise correlations of co-evolving columns of a multiple sequence alignment
Usage
getPropCorr(selMat, propertyDF = "Cruciani", propertyIndex = 1)
Arguments
selMat A subset matrix of original multiple sequence alignment with significant corre-
lations identified with ’getCorSites’ function
propertyDF One of the amino acid property data frames. viz. Cruciani, Fasgai, Kidera,
AAindex. Default is Cruciani properties
propertyIndex Specific property row number from the data frame of propertyDF
Value
A data frame of four columns viz. Pos1, Pos2, Cor and p Value. Results are filtered to find position
pairs with correlations above 0.8 and below -0.8
Examples
selMatLoc <- system.file("extdata", "selMat.rda", package = "aaSEA")
selMat <- readRDS(selMatLoc)
getPropCorr(selMat = selMat, propertyDF = "Cruciani", propertyIndex = 1)
getTopSub Get sites with more than one-correlated substitutions other than con-
served amino acids at that position
Description
Get sites with more than one-correlated substitutions other than conserved amino acids at that posi-
tion
Usage
getTopSub(selMat)
8 Kidera
Arguments
selMat A subset matrix of original multiple sequence alignment with significant corre-
lations identified with ’getCorSites’ function
Value
A data frame with two columns i.e. Pos1 and Pos2 which is a filtered subset of many correlated
substitutions based on frequency of substitution after consensus of a column in multiple sequence
alignment
Examples
selMatLoc <- system.file("extdata", "selMat.rda", package = "aaSEA")
selMat <- readRDS(selMatLoc)
getTopSub(selMat = selMat)
Description
A data frame of 10 Kidera factors for 20 amino acids.
Usage
Kidera
Format
A data frame with 20 rows and ten columns representing 20 amino acids and 10 kidera factors:
Description
Function to encode Correlated columns of alignment matrix with desired properties
Usage
matEncode(aliMat, pIndex, propDf)
Arguments
aliMat amino acid multiple sequence alignment in the form of a matrix
pIndex Amino acid property index to be encoded. It is the row number in the property
data frame
propDf The amino acid property to analyse. It is row number in propertyDF data frame
Value
A matrix of input dimensions with amino acid alphabets replaced by amino acid properties of choice
Examples
aliMatLoc <- system.file("extdata", "aliMat.rda", package = "aaSEA")
aliMat <- readRDS(aliMatLoc)
matEncode(aliMat = aliMat, pIndex = 1, propDf = "Cruciani" )
Description
simple interactive network diagram of correlated substitution pairs
Usage
plotCorNet(corSubDF)
Arguments
corSubDF A data frame with two columns i.e. Pos1 and Pos2. This file is generated by
’getTopSub’ function.
10 plotCorSubChanges
Value
Examples
Description
Usage
plotCorSubChanges(corSitePropChange)
Arguments
corSitePropChange
A data frame of coevolving sites and associated property changes obtained by
"getPropCorr" function with selected MSA matrix (selMat) and desired property
selected from property data frame and property index.
Value
Returns an interactive heat map of significant sites with selected property correlations
Examples
plotMultiSubChange Plots heat map of multiple substitution associated changes per site
Description
Plots heat map of multiple substitution associated changes per site
Usage
plotMultiSubChange(multiSubChangeDF)
Arguments
multiSubChangeDF
A data frame of multiple amino acid substitutions per site calculated with ’getAA-
Sub’ and associated property changes obtained by calling ’getPropChange’ func-
tion
Value
An interactive heat map of multiple substitution assocaiated changes per site
Examples
multiSubChangeLoc <- system.file("extdata", "multiSubChange.rda", package = "aaSEA")
multiSubChange <- readRDS(multiSubChangeLoc)
plotMultiSubChange(multiSubChangeDF = multiSubChange )
Description
Plots single substitution change histogram
Usage
plotSingleSubChange(singleSubChangeDF)
Arguments
singleSubChangeDF
A data frame of single amino acid substitutions per site calculated with ’getAA-
Sub’ and associated property changes obtained by calling ’getPropChange’ func-
tion
12 startSeaShiny
Value
An interactive histogram representing amino acid substitution associaited change
Examples
singleSubChangeLoc <- system.file("extdata", "singleSubChange.rda", package = "aaSEA")
singleSubChange <- readRDS(singleSubChangeLoc)
plotSingleSubChange(singleSubChangeDF = singleSubChange)
Description
Title Enables to start and run the app
Usage
startSeaShiny()
Value
a shiny app will be launched in browser
Examples
if(interactive()){
startSeaShiny()
}
Index
∗Topic datasets
AAindex, 2
corSubFile, 3
Cruciani, 3
Fasgai, 4
Kidera, 8
AAindex, 2
corSubFile, 3
Cruciani, 3
Fasgai, 4
getAASub, 4
getCorPropChange, 5
getCorSites, 5
getPropChange, 6
getPropCorr, 7
getTopSub, 7
Kidera, 8
matEncode, 9
plotCorNet, 9
plotCorSubChanges, 10
plotMultiSubChange, 11
plotSingleSubChange, 11
startSeaShiny, 12
13