Package Aasea': R Topics Documented

Download as pdf or txt
Download as pdf or txt
You are on page 1of 13

Package ‘aaSEA’

November 9, 2019
Type Package
Title Amino Acid Substitution Effect Analyser
Version 1.1.0
Author Raja Sekhara Reddy D.M
Maintainer Raja Sekhara Reddy D.M <raja.duvvuru@gmail.com>
Description
Given a protein multiple sequence alignment, it is daunting task to assess the effects of substitu-
tions along sequence length. 'aaSEA' package is intended to help researchers to rapidly anal-
yse property changes caused by single, multiple and correlated amino acid substitutions in pro-
teins. Methods for identification of co-evolving positions from multiple sequence align-
ment are as described in : Pelé et al., (2017) <doi:10.4172/2379-1764.1000250>.
Depends R(>= 3.4.0)
Imports DT(>= 0.4), networkD3(>= 0.4), shiny(>= 1.0.5),
shinydashboard(>= 0.7.0), magrittr(>= 1.5), Bios2cor(>= 2.0),
seqinr(>= 3.4-5), plotly(>= 4.7.1), Hmisc(>= 4.1-1)
License GPL-3
Encoding UTF-8
LazyData true
RoxygenNote 6.1.1
Suggests knitr, rmarkdown
VignetteBuilder knitr
NeedsCompilation no
Repository CRAN
Date/Publication 2019-11-09 16:20:02 UTC

R topics documented:
AAindex . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
corSubFile . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
Cruciani . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

1
2 AAindex

Fasgai . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
getAASub . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
getCorPropChange . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
getCorSites . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
getPropChange . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
getPropCorr . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
getTopSub . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Kidera . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
matEncode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
plotCorNet . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
plotCorSubChanges . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
plotMultiSubChange . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
plotSingleSubChange . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
startSeaShiny . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12

Index 13

AAindex A data frame of 533 amino acid properties from AAindex.

Description
A data frame of 533 amino acid properties.

Usage
AAindex

Format
A data frame with 533 rows and 20 variables:
name amino acid property as in AAindex
A Alanine, Ala, A
C Cysteine, Cys, C
D Aspartate, Asp, D
E Glutamate, Glu, E
F Phenylalanine, Phe, F
G Glycine, Gly, G
H Histidine, His, H
I Isoleucine, Ile, I
K Lysine, Lys, K
L Leucine, Leu, L
M Methionine, Met, M
N Asparagine, Asn, N
corSubFile 3

P Proline, Pro, P
Q Glutamine, Gln, Q
R Arginine, Arg, R
S Serine, Ser, S
T Threonine, Thr, T
V Valine, Val, V
W Tryptophan, Trp, W
Y Tyrosine, Tyr, Y ...

corSubFile A data frame of correlated sites.

Description
This file is output of ’getTopSub’ function with subset of alignment as input. This subset alignment
is output of ’getCorSite’ function

Usage
corSubFile

Format
A data frame with 2 Columns representing positional pairs:
Pos1 Position one prefixed with Wt amino acid and suffixed with substitution
Pos2 Position two prefixed with Wt amino acid and suffixed with substitution ...

Cruciani A data frame of 3 Cruciani properties for 20 amino acids.

Description
A data frame of 3 Cruciani properties for 20 amino acids.

Usage
Cruciani

Format
A data frame with 20 rows and 3 columns representing amino acids and Cruciani properties:
Names of amino acids
PP1 Polarity
PP2 Hydrophobicity
PP3 H-bonding ...
4 getAASub

Fasgai A data frame of six Fasgai vectors for 20 amino acids.

Description
A data frame of six Fasgai vectors for 20 amino acids.

Usage
Fasgai

Format
A data frame with 20 rows and 6 columns representing amino acids and Fasgai vectors:
Names of amino acids
F1 Hydrophobicity index
F2 Alpha and turn propensities
F3 Bulky properties
F4 Compositional characteristic index
F5 Local flexibility
F6 Electronic properties ...

getAASub Get amino acid substitutions from multiple sequence alignment

Description
Get amino acid substitutions from multiple sequence alignment

Usage
getAASub(fileLoc)

Arguments
fileLoc exact location of multiple sequence alignment file in "FASTA" format

Value
Returns a list of two data frames 1.Single substitutions 2.Multiple substitutions

Examples
file = system.file("extdata", "linB_Prot_ali.fasta", package = "aaSEA")
getAASub(fileLoc = file)
getCorPropChange 5

getCorPropChange Get property changes associated with correlated substitutions

Description
Get property changes associated with correlated substitutions

Usage
getCorPropChange(corSubFile, propertyDF = "Cruciani",
propertyIndex = 1)

Arguments
corSubFile A dataframe with two columns i.e. Pos1 and Pos2. This file is generated by
getTopSub
propertyDF Choose one of Cruciani, Fasgai, Kidera or AAindex based amino acid properties
propertyIndex The amino acid property to analyse. It is row number in propertyDF data frame

Value
A data frame of 12 columns consisting data regarding Wild type and mutant resides and their posi-
tions alogn with their properties and net changes between Pos1 and Pos2

Examples
getCorPropChange(corSubFile = corSubFile, propertyDF = "Cruciani", propertyIndex = 1)

getCorSites Get correlated sites with substitutions

Description
Get correlated sites with substitutions

Usage
getCorSites(fileLoc, corMethod = "mcbasc")

Arguments
fileLoc exact location of multiple sequence alignment file in "FASTA" format
corMethod One of the methods to compute correlated sites viz. ’mip’, ’elsc’, ’mcbasc’ and
’omes’. Default is ’mcbasc’.
6 getPropChange

Value

A subset alignment matrix of original multiple sequence alignment with significant correlations.

Examples
file = system.file("extdata", "linB_toy_ali.fasta", package = "aaSEA")
getCorSites(fileLoc = file,corMethod="mcbasc")

getPropChange Get wild type and substituted amino acid properties and associated
property changes

Description

Get wild type and substituted amino acid properties and associated property changes

Usage

getPropChange(subFile, propertyDF = "Cruciani", propertyIndex = 1)

Arguments

subFile A data frame of single or multiple substitutions obtained using ’getAASub’


function
propertyDF Choose one of Cruciani, Fasgai, Kidera or AAindex based amino acid properties
propertyIndex The amino acid property to analyse. It is row number in propertyDF data frame

Value

A substitution data frame with three additional columns i.e. wt.Prop, mu.Prop and Delta.Prop

Examples
ssFileLoc <- system.file("extdata", "singleSub.rda", package = "aaSEA")
singleSubFile <- readRDS(ssFileLoc)
msFileLoc <- system.file("extdata", "multiSub.rda", package = "aaSEA")
multiSubFile <- readRDS(msFileLoc)
getPropChange(subFile = singleSubFile, propertyDF = "Cruciani", propertyIndex = 1)
getPropChange(subFile = multiSubFile, propertyDF = "Cruciani", propertyIndex = 1)
getPropCorr 7

getPropCorr Get amino acid property wise correlations of co-evolving columns of


a multiple sequence alignment

Description
Get amino acid property wise correlations of co-evolving columns of a multiple sequence alignment

Usage
getPropCorr(selMat, propertyDF = "Cruciani", propertyIndex = 1)

Arguments
selMat A subset matrix of original multiple sequence alignment with significant corre-
lations identified with ’getCorSites’ function
propertyDF One of the amino acid property data frames. viz. Cruciani, Fasgai, Kidera,
AAindex. Default is Cruciani properties
propertyIndex Specific property row number from the data frame of propertyDF

Value
A data frame of four columns viz. Pos1, Pos2, Cor and p Value. Results are filtered to find position
pairs with correlations above 0.8 and below -0.8

Examples
selMatLoc <- system.file("extdata", "selMat.rda", package = "aaSEA")
selMat <- readRDS(selMatLoc)
getPropCorr(selMat = selMat, propertyDF = "Cruciani", propertyIndex = 1)

getTopSub Get sites with more than one-correlated substitutions other than con-
served amino acids at that position

Description
Get sites with more than one-correlated substitutions other than conserved amino acids at that posi-
tion

Usage
getTopSub(selMat)
8 Kidera

Arguments
selMat A subset matrix of original multiple sequence alignment with significant corre-
lations identified with ’getCorSites’ function

Value
A data frame with two columns i.e. Pos1 and Pos2 which is a filtered subset of many correlated
substitutions based on frequency of substitution after consensus of a column in multiple sequence
alignment

Examples
selMatLoc <- system.file("extdata", "selMat.rda", package = "aaSEA")
selMat <- readRDS(selMatLoc)
getTopSub(selMat = selMat)

Kidera A data frame of 10 Kidera factors for 20 amino acids

Description
A data frame of 10 Kidera factors for 20 amino acids.

Usage
Kidera

Format
A data frame with 20 rows and ten columns representing 20 amino acids and 10 kidera factors:

Names of amino acids


KF1 Helix/bend preference
KF2 Side-chain size
KF3 Extended structure preference
KF4 Hydrophobicity
KF5 Double-bend preference
KF6 Partial specific volume
KF7 Flat extended preference
KF8 Occurrence in alpha region
KF9 pK-C
KF10 Surrounding hydrophobicity ...
matEncode 9

matEncode Function to encode Correlated columns of alignment matrix with de-


sired properties

Description
Function to encode Correlated columns of alignment matrix with desired properties

Usage
matEncode(aliMat, pIndex, propDf)

Arguments
aliMat amino acid multiple sequence alignment in the form of a matrix
pIndex Amino acid property index to be encoded. It is the row number in the property
data frame
propDf The amino acid property to analyse. It is row number in propertyDF data frame

Value
A matrix of input dimensions with amino acid alphabets replaced by amino acid properties of choice

Examples
aliMatLoc <- system.file("extdata", "aliMat.rda", package = "aaSEA")
aliMat <- readRDS(aliMatLoc)
matEncode(aliMat = aliMat, pIndex = 1, propDf = "Cruciani" )

plotCorNet simple interactive network diagram of correlated substitution pairs

Description
simple interactive network diagram of correlated substitution pairs

Usage
plotCorNet(corSubDF)

Arguments
corSubDF A data frame with two columns i.e. Pos1 and Pos2. This file is generated by
’getTopSub’ function.
10 plotCorSubChanges

Value

An interactive network diagram of correlated substitution pairs.

Examples

corSub <- corSubFile


plotCorNet(corSubDF = corSub)

plotCorSubChanges Plot co-evolving sites with selected property correlations

Description

Plot co-evolving sites with selected property correlations

Usage

plotCorSubChanges(corSitePropChange)

Arguments
corSitePropChange
A data frame of coevolving sites and associated property changes obtained by
"getPropCorr" function with selected MSA matrix (selMat) and desired property
selected from property data frame and property index.

Value

Returns an interactive heat map of significant sites with selected property correlations

Examples

fileLocation <- system.file("extdata", "corSitePropChangeDF.rda", package = "aaSEA")


corSitePropChange <- readRDS(fileLocation)
plotCorSubChanges(corSitePropChange = corSitePropChange)
plotMultiSubChange 11

plotMultiSubChange Plots heat map of multiple substitution associated changes per site

Description
Plots heat map of multiple substitution associated changes per site

Usage
plotMultiSubChange(multiSubChangeDF)

Arguments
multiSubChangeDF
A data frame of multiple amino acid substitutions per site calculated with ’getAA-
Sub’ and associated property changes obtained by calling ’getPropChange’ func-
tion

Value
An interactive heat map of multiple substitution assocaiated changes per site

Examples
multiSubChangeLoc <- system.file("extdata", "multiSubChange.rda", package = "aaSEA")
multiSubChange <- readRDS(multiSubChangeLoc)
plotMultiSubChange(multiSubChangeDF = multiSubChange )

plotSingleSubChange Plots single substitution change histogram

Description
Plots single substitution change histogram

Usage
plotSingleSubChange(singleSubChangeDF)

Arguments
singleSubChangeDF
A data frame of single amino acid substitutions per site calculated with ’getAA-
Sub’ and associated property changes obtained by calling ’getPropChange’ func-
tion
12 startSeaShiny

Value
An interactive histogram representing amino acid substitution associaited change

Examples
singleSubChangeLoc <- system.file("extdata", "singleSubChange.rda", package = "aaSEA")
singleSubChange <- readRDS(singleSubChangeLoc)
plotSingleSubChange(singleSubChangeDF = singleSubChange)

startSeaShiny Title Enables to start and run the app

Description
Title Enables to start and run the app

Usage
startSeaShiny()

Value
a shiny app will be launched in browser

Examples
if(interactive()){
startSeaShiny()
}
Index

∗Topic datasets
AAindex, 2
corSubFile, 3
Cruciani, 3
Fasgai, 4
Kidera, 8

AAindex, 2

corSubFile, 3
Cruciani, 3

Fasgai, 4

getAASub, 4
getCorPropChange, 5
getCorSites, 5
getPropChange, 6
getPropCorr, 7
getTopSub, 7

Kidera, 8

matEncode, 9

plotCorNet, 9
plotCorSubChanges, 10
plotMultiSubChange, 11
plotSingleSubChange, 11

startSeaShiny, 12

13

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy