Ggstatsplot

Package ‘ggstatsplot’
March 11, 2021

Type Package
Title 'ggplot2' Based Plots with Statistical Details
Version 0.7.1
Maintainer Indrajeet Patil <patilindrajeet.science@gmail.com>
Description Extension of 'ggplot2', 'ggstatsplot' creates graphics with
details from statistical tests included in the plots themselves. It
provides an easier API to generate information-rich plots for
statistical analysis of continuous (violin plots, scatterplots,
histograms, dot plots, dot-and-whisker plots) or categorical (pie and
bar charts) data. Currently, it supports the most common types of
statistical approaches and tests: parametric, nonparametric, robust,
and Bayesian versions of t-test/ANOVA, correlation analyses,
contingency table analysis, meta-analysis, and regression analyses.
License GPL-3 | file LICENSE
URL https://indrajeetpatil.github.io/ggstatsplot/,
https://github.com/IndrajeetPatil/ggstatsplot
BugReports https://github.com/IndrajeetPatil/ggstatsplot/issues
Depends R (>= 3.6.0)
Imports dplyr,
ggcorrplot,
ggExtra,
ggplot2,
ggrepel,
ggsignif (>= 0.6.1),
insight (>= 0.13.0),
ipmisc (>= 6.0.0),
pairwiseComparisons (>= 3.1.3),
paletteer,
parameters (>= 0.12.0),
patchwork,
performance,
purrr,
rlang,
stats,
statsExpressions (>= 1.0.0),
tidyr,
utils
1
2 R topics documented:
Suggests afex,
forcats,
knitr,
MASS,
metaBMA,
metafor,
rmarkdown,
spelling,
testthat,
tibble,
vdiffr,
WRS2
VignetteBuilder knitr
Config/testthat/edition 3
Config/testthat/parallel true
Encoding UTF-8
Language en-US
LazyData true
Roxygen list(markdown = TRUE, roclets = c(``rd'', ``namespace'', ``collate'',
``pkgapi::api_roclet''))
RoxygenNote 7.1.1.9001
R topics documented:
ggstatsplot-package . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
bugs_long . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
bugs_wide . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
combine_plots . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
ggbarstats . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
ggbetweenstats . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
ggcoefstats . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
ggcorrmat . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
ggdotplotstats . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
gghistostats . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
ggpiestats . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
ggscatterstats . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
ggwithinstats . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
grouped_ggbarstats . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
grouped_ggbetweenstats . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
grouped_ggcorrmat . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
grouped_ggdotplotstats . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47
grouped_gghistostats . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
grouped_ggpiestats . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53
grouped_ggscatterstats . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
grouped_ggwithinstats . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59
iris_long . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63
movies_long . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63
movies_wide . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
theme_ggstatsplot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
ggstatsplot-package 3
Titanic_full . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66
VR_dilemma . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67
Index 68
ggstatsplot-package ggstatsplot: ’ggplot2’ Based Plots with Statistical Details
Description
Maturing
ggstatsplot is an extension of ggplot2 package. It creates graphics with details from statisti-
cal tests included in the plots themselves. It provides an easier API to generate information-rich
plots for statistical analysis of continuous (violin plots, scatterplots, histograms, dot plots, dot-and-
whisker plots) or categorical (pie and bar charts) data. Currently, it supports the most common types
of statistical tests: parametric, nonparametric, robust, and Bayesian versions of t-test/ANOVA, cor-
relation analyses, contingency table analysis, meta-analysis, and regression analyses.
Details
ggstatsplot
The main functions are-
• ggbetweenstats function to produce information-rich comparison plot between different

groups or conditions with ggplot2 and details from the statistical tests in the subtitle.
• ggwithinstats function to produce information-rich comparison plot within different groups
or conditions with ggplot2 and details from the statistical tests in the subtitle.
• ggscatterstats function to produce ggplot2 scatterplots along with a marginal distribution
plots from ggExtra and details from the statistical tests in the subtitle.
• ggpiestats function to produce pie chart with details from the statistical tests in the subtitle.
• ggbarstats function to produce stacked bar chart with details from the statistical tests in the
subtitle.
• gghistostats function to produce histogram for a single variable with results from one sam-
ple test displayed in the subtitle.
• ggdotplotstats function to produce Cleveland-style dot plots/charts for a single variable
with labels and results from one sample test displayed in the subtitle.
• ggcorrmat function to visualize the correlation matrix.
• ggcoefstats function to visualize results from regression analyses.
• combine_plots helper function to combine multiple ggstatsplot plots using patchwork::wrap_plots().
For more documentation, see the dedicated Website.
Author(s)
Maintainer: Indrajeet Patil <patilindrajeet.science@gmail.com> (ORCID) (@patilindrajeets)
[copyright holder]
4 bugs_long
See Also
Useful links:
• https://indrajeetpatil.github.io/ggstatsplot/
• https://github.com/IndrajeetPatil/ggstatsplot
• Report bugs at https://github.com/IndrajeetPatil/ggstatsplot/issues
bugs_long Tidy version of the "Bugs" dataset.
Description
Tidy version of the "Bugs" dataset.
Usage
bugs_long
Format
A data frame with 372 rows and 6 variables
• subject. Dummy identity number for each participant.

• gender. Participant’s gender (Female, Male).
• region. Region of the world the participant was from.
• education. Level of education.
• condition. Condition of the experiment the participant gave rating for (LDLF: low freight-
eningness and low disgustingness; LFHD: low freighteningness and high disgustingness;
HFHD: high freighteningness and low disgustingness; HFHD: high freighteningness and high
disgustingness).
• desire. The desire to kill an arthropod was indicated on a scale from 0 to 10.
Details
This data set, "Bugs", provides the extent to which men and women want to kill arthropods that vary
in freighteningness (low, high) and disgustingness (low, high). Each participant rates their attitudes
towards all anthropods. Subset of the data reported by Ryan et al. (2013).
Source
https://www.sciencedirect.com/science/article/pii/S0747563213000277
Examples
dim(bugs_long)
head(bugs_long)
dplyr::glimpse(bugs_long)
bugs_wide 5
bugs_wide Wide-format version of the "Bugs" dataset.
Description
Wide-format version of the "Bugs" dataset.
Usage
bugs_wide
Format
• subject. Dummy identity number for each participant.

• gender. Participant’s gender (Female, Male).
• region. Region of the world the participant was from.
• education. Level of education.
• ldlf,ldhf,hdlf,hdhf.The desire to kill an arthropod was indicated on a scale from 0 to 10 in each
condition of the experiment (LDLF: low freighteningness and low disgustingness; LFHD:
low freighteningness and high disgustingness; HFHD: high freighteningness and low disgust-
ingness; HFHD: high freighteningness and high disgustingness).
Details
This data set, "Bugs", provides the extent to which men and women want to kill arthropods that vary
in freighteningness (low, high) and disgustingness (low, high). Each participant rates their attitudes
towards all anthropods. Subset of the data reported by Ryan et al. (2013).
Source
https://www.sciencedirect.com/science/article/pii/S0747563213000277
Examples
dim(bugs_wide)
head(bugs_wide)
dplyr::glimpse(bugs_wide)
6 combine_plots
combine_plots Combining and arranging multiple plots in a grid
Description
Maturing
Wrapper around patchwork::wrap_plots that will return a combined grid of plots with annota-
tions.
Usage
combine_plots(
plotlist,
guides = "collect",
plotgrid.args = list(),
annotation.args = list(),
...
)
Arguments
plotlist A list containing ggplot objects.
guides A string specifying how guides should be treated in the layout. 'collect' will
collect guides below to the given nesting level, removing duplicates. 'keep'
will stop collection at this level and let guides be placed alongside their plot.
auto will allow guides to be collected if a upper level tries, but place them along-
side the plot if not. If you modify default guide "position" with theme(legend.position=...)
while also collecting guides you must apply that change to the overall patchwork
(see example).
plotgrid.args A list of additional arguments passed to patchwork::wrap_plots, except for
guides argument which is already separately specified here.
annotation.args
A list of additional arguments passed to patchwork::plot_annotation.
... Currently ignored.
Value
Combined plot with annotation labels
References
https://indrajeetpatil.github.io/ggstatsplot/articles/web_only/combine_plots.html
Examples
# loading the necessary libraries
library(ggplot2)
# preparing the first plot

p1 <-
ggplot2::ggplot(
ggbarstats 7
data = subset(iris, iris$Species == "setosa"),

aes(x = Sepal.Length, y = Sepal.Width)
) +
geom_point() +
labs(title = "setosa")
# preparing the second plot

p2 <-
ggplot2::ggplot(
data = subset(iris, iris$Species == "versicolor"),
aes(x = Sepal.Length, y = Sepal.Width)
) +
geom_point() +
labs(title = "versicolor")
# combining the plot with a title and a caption

combine_plots(
plotlist = list(p1, p2),
annotation.args = list(
tag_levels = "a",
title = "Dataset: Iris Flower dataset",
subtitle = "Edgar Anderson collected this data",
caption = "Note: Only two species of flower are displayed"
)
)
ggbarstats Bar (column) charts with statistical tests
Description
Maturing
Bar charts for categorical data with statistical details included in the plot as a subtitle.
Usage
ggbarstats(
data,
x,
y,
counts = NULL,
type = "parametric",
paired = FALSE,
results.subtitle = TRUE,
label = "percentage",
label.args = list(alpha = 1, fill = "white"),
k = 2L,
proportion.test = TRUE,
perc.k = 0,
bf.message = TRUE,
ratio = NULL,
conf.level = 0.95,
sampling.plan = "indepMulti",
8 ggbarstats
fixed.margin = "rows",
prior.concentration = 1,
title = NULL,
subtitle = NULL,
caption = NULL,
legend.title = NULL,
xlab = NULL,
ylab = NULL,
ggtheme = ggplot2::theme_bw(),
ggstatsplot.layer = TRUE,
package = "RColorBrewer",
palette = "Dark2",
ggplot.component = NULL,
output = "plot",
...
)
Arguments
data A dataframe (or a tibble) from which variables specified are to be taken. Other
data types (e.g., matrix,table, array, etc.) will not be accepted.
x The variable to use as the rows in the contingency table. Please note that if there
are empty factor levels in your variable, they will be dropped.
y The variable to use as the columns in the contingency table. Please note that
if there are empty factor levels in your variable, they will be dropped. Default
is NULL. If NULL, one-sample proportion test (a goodness of fit test) will be run
for the x variable. Otherwise an appropriate association test will be run. This
argument can not be NULL for ggbarstats function.
counts A string naming a variable in data containing counts, or NULL if each row repre-
sents a single observation.
type A character specifying the type of statistical approach. Four possible options:
• "parametric"
• "nonparametric"
• "robust"
• "bayes"
Corresponding abbreviations are also accepted: "p" (for parametric), "np" (for
nonparametric), "r" (for robust), or "bf" (for Bayesian).
paired Logical indicating whether data came from a within-subjects or repeated mea-
sures design study (Default: FALSE). If TRUE, McNemar’s test expression will be
returned. If FALSE, Pearson’s chi-square test will be returned.
results.subtitle
Decides whether the results of statistical tests are to be displayed as a subtitle
(Default: TRUE). If set to FALSE, only the plot will be returned.
label Character decides what information needs to be displayed on the label in each
pie slice. Possible options are "percentage" (default), "counts", "both".
label.args Additional aesthetic arguments that will be passed to geom_label.
k Number of digits after decimal point (should be an integer) (Default: k = 2L).
ggbarstats 9
proportion.test
Decides whether proportion test for x variable is to be carried out for each level
of y (Default: TRUE). In ggbarstats, only p-values from this test will be dis-
played.
perc.k Numeric that decides number of decimal places for percentage labels (Default:
0).
bf.message Logical that decides whether to display Bayes Factor in favor of the null hypoth-
esis. This argument is relevant only for parametric test (Default: TRUE).
ratio A vector of proportions: the expected proportions for the proportion test (should
sum to 1). Default is NULL, which means the null is equal theoretical proportions
across the levels of the nominal variable. This means if there are two levels
this will be ratio = c(0.5,0.5) or if there are four levels this will be ratio =
c(0.25,0.25,0.25,0.25), etc.
conf.level Scalar between 0 and 1. If unspecified, the defaults return 95% confidence/credible
intervals (0.95).
sampling.plan Character describing the sampling plan. Possible options are "indepMulti"
(independent multinomial; default), "poisson", "jointMulti" (joint multino-
mial), "hypergeom" (hypergeometric). For more, see ?BayesFactor::contingencyTableBF().
fixed.margin For the independent multinomial sampling plan, which margin is fixed ("rows"
or "cols"). Defaults to "rows".
prior.concentration
Specifies the prior concentration parameter, set to 1 by default. It indexes the ex-
pected deviation from the null hypothesis under the alternative, and corresponds
to Gunel and Dickey’s (1974) "a" parameter.
title The text for the plot title.
subtitle The text for the plot subtitle. Will work only if results.subtitle = FALSE.
caption The text for the plot caption.
legend.title Title text for the legend.
xlab Custom text for the x axis label (Default: NULL, which will cause the x axis label
to be the x variable).
ylab Custom text for the y axis label (Default: NULL).
ggtheme A function, ggplot2 theme name. Default value is ggplot2::theme_bw().
Any of the ggplot2 themes, or themes from extension packages are allowed
(e.g., ggthemes::theme_fivethirtyeight(), hrbrthemes::theme_ipsum_ps(),
etc.).
ggstatsplot.layer
Logical that decides whether theme_ggstatsplot theme elements are to be dis-
played along with the selected ggtheme (Default: TRUE). theme_ggstatsplot is
an opinionated theme layer that override some aspects of the selected ggtheme.
package Name of the package from which the given palette is to be extracted. The avail-
able palettes and packages can be checked by running View(paletteer::palettes_d_names).
palette Name of the package from which the given palette is to be extracted. The avail-
ggplot.component
A ggplot component to be added to the plot prepared by ggstatsplot. This
argument is primarily helpful for grouped_ variants of all primary functions.
Default is NULL. The argument should be entered as a ggplot2 function or a list
of ggplot2 functions.
10 ggbetweenstats
output Character that describes what is to be returned: can be "plot" (default) or

"subtitle" or "caption". Setting this to "subtitle" will return the expres-
sion containing statistical results. If you have set results.subtitle = FALSE,
then this will return a NULL. Setting this to "caption" will return the expres-
sion containing details about Bayes Factor analysis, but valid only when type =
"parametric" and bf.message = TRUE, otherwise this will return a NULL.
See Also
grouped_ggbarstats, ggpiestats, grouped_ggpiestats
Examples
# for reproducibility
set.seed(123)
# association test (or contingency table analysis)

ggstatsplot::ggbarstats(
data = mtcars,
x = vs,
y = cyl
)
ggbetweenstats Box/Violin plots for group or condition comparisons in between-

subjects designs.
Description
Maturing
A combination of box and violin plots along with jittered data points for between-subjects designs
with statistical details included in the plot as a subtitle.
Usage
ggbetweenstats(
data,
x,
y,
plot.type = "boxviolin",
pairwise.comparisons = TRUE,
pairwise.display = "significant",
p.adjust.method = "holm",
effsize.type = "unbiased",
bf.prior = 0.707,
bf.message = TRUE,
xlab = NULL,
ylab = NULL,
ggbetweenstats 11
caption = NULL,
title = NULL,
subtitle = NULL,
k = 2L,
var.equal = FALSE,
conf.level = 0.95,
nboot = 100L,
tr = 0.2,
centrality.plotting = TRUE,
centrality.type = type,
centrality.point.args = list(size = 5, color = "darkred"),
centrality.label.args = list(size = 3, nudge_x = 0.4, segment.linetype = 4),
outlier.tagging = FALSE,
outlier.label = NULL,
outlier.coef = 1.5,
outlier.shape = 19,
outlier.color = "black",
outlier.label.args = list(size = 3),
point.args = list(position = ggplot2::position_jitterdodge(dodge.width = 0.6), alpha
= 0.4, size = 3, stroke = 0),
violin.args = list(width = 0.5, alpha = 0.2),
ggsignif.args = list(textsize = 3, tip_length = 0.01),
palette = "Dark2",
output = "plot",
...
)
Arguments
x The grouping (or independent) variable from the dataframe data.
y The response (or outcome or dependent) variable from the dataframe data.
plot.type Character describing the type of plot. Currently supported plots are "box" (for
only boxplots), "violin" (for only violin plots), and "boxviolin" (for a com-
bination of box and violin plots; default).
• "parametric"
• "nonparametric"
• "robust"
• "bayes"
pairwise.comparisons
Logical that decides whether pairwise comparisons are to be displayed (default:
TRUE). Please note that only significant comparisons will be shown by default.
12 ggbetweenstats
To change this behavior, select appropriate option with pairwise.display ar-

gument. The pairwise comparison dataframes are prepared using the pairwiseComparisons::pairw
function. For more details about pairwise comparisons, see the documentation
for that function.
pairwise.display
Decides which pairwise comparisons to display. Available options are:
• "significant" (abbreviation accepted: "s")
• "non-significant" (abbreviation accepted: "ns")
• "all"
You can use this argument to make sure that your plot is not uber-cluttered when
you have multiple groups being compared and scores of pairwise comparisons
being displayed.
p.adjust.method
Adjustment method for p-values for multiple comparisons. Possible methods
are: "holm" (default), "hochberg", "hommel", "bonferroni", "BH", "BY",
"fdr", "none".
effsize.type Type of effect size needed for parametric tests. The argument can be "eta"
(partial eta-squared) or "omega" (partial omega-squared).
bf.prior A number between 0.5 and 2 (default 0.707), the prior width to use in calculat-
ing Bayes factors.
results.subtitle
xlab, ylab Labels for x and y axis variables. If NULL (default), variable names for x and y
will be used.
var.equal a logical variable indicating whether to treat the two variances as being equal.
If TRUE then the pooled variance is used to estimate the variance otherwise the
Welch (or Satterthwaite) approximation to the degrees of freedom is used.
intervals (0.95).
nboot Number of bootstrap samples for computing confidence interval for the effect
size (Default: 100).
tr Trim level for the mean when carrying out robust tests. In case of an error,
try reducing the value of tr, which is by default set to 0.2. Lowering the value
might help.
centrality.plotting
Logical that decides whether centrality tendency measure is to be displayed as
a point with a label (Default: TRUE). Function decides which central tendency
measure to show depending on the type argument.
• mean for parametric statistics
• median for non-parametric statistics
ggbetweenstats 13
• trimmed mean for robust statistics

• MAP estimator for Bayesian statistics
If you want default centrality parameter, you can specify this using centrality.type
argument.
centrality.type
Decides which centrality parameter is to be displayed. The default is to choose
the same as type argument. You can specify this to be:
• "parameteric" (for mean)
• "nonparametric" (for median)
• robust (for trimmed mean)
• bayes (for MAP estimator)
Just as type argument, abbreviations are also accepted.
centrality.point.args, centrality.label.args
A list of additional aesthetic arguments to be passed to ggplot2::geom_point
and ggrepel::geom_label_repel geoms, which are involved in mean plotting.
outlier.tagging
Decides whether outliers should be tagged (Default: FALSE).
outlier.label Label to put on the outliers that have been tagged. This can’t be the same as x
argument.
outlier.coef Coefficient for outlier detection using Tukey’s method. With Tukey’s method,
outliers are below (1st Quartile) or above (3rd Quartile) outlier.coef times
the Inter-Quartile Range (IQR) (Default: 1.5).
outlier.shape Hiding the outliers can be achieved by setting outlier.shape = NA. Impor-
tantly, this does not remove the outliers, it only hides them, so the range cal-
culated for the y-axis will be the same with outliers shown and outliers hidden.
outlier.color Default aesthetics for outliers (Default: "black").
outlier.label.args
A list of additional aesthetic arguments to be passed to ggrepel::geom_label_repel
for outlier label plotting.
point.args A list of additional aesthetic arguments to be passed to the geom_point display-
ing the raw data.
violin.args A list of additional aesthetic arguments to be passed to the geom_violin.
ggsignif.args A list of additional aesthetic arguments to be passed to ggsignif::geom_signif.
etc.).
ggstatsplot.layer
package, palette
Name of the package from which the given palette is to be extracted. The avail-
ggplot.component
14 ggcoefstats

References
https://indrajeetpatil.github.io/ggstatsplot/articles/web_only/ggbetweenstats.html
See Also
grouped_ggbetweenstats, ggwithinstats, grouped_ggwithinstats
Examples
# to get reproducible results from bootstrapping
set.seed(123)
library(ggstatsplot)
# simple function call with the defaults

ggstatsplot::ggbetweenstats(
data = mtcars,
x = am,
y = mpg,
title = "Fuel efficiency by type of car transmission",
caption = "Transmission (0 = automatic, 1 = manual)"
)
# more detailed function call

ggstatsplot::ggbetweenstats(
data = datasets::morley,
x = Expt,
y = Speed,
type = "nonparametric",
plot.type = "box",
xlab = "The experiment number",
ylab = "Speed-of-light measurement",
p.adjust.method = "fdr",
outlier.tagging = TRUE,
outlier.label = Run,
ggtheme = ggplot2::theme_grey(),
ggstatsplot.layer = FALSE
)
ggcoefstats Dot-and-whisker plots for regression analyses

ggcoefstats 15
Description
Maturing
Plot with the regression coefficients’ point estimates as dots with confidence interval whiskers and
other statistical details included as labels.
Usage
ggcoefstats(
x,
output = "plot",
statistic = NULL,
conf.int = TRUE,
conf.level = 0.95,
k = 2L,
exclude.intercept = FALSE,
effsize = "eta",
meta.analytic.effect = FALSE,
meta.type = "parametric",
bf.message = TRUE,
sort = "none",
xlab = "regression coefficient",
ylab = "term",
title = NULL,
subtitle = NULL,
caption = NULL,
only.significant = FALSE,
point.args = list(size = 3, color = "blue"),
errorbar.args = list(height = 0),
vline = TRUE,
vline.args = list(size = 1, linetype = "dashed"),
stats.labels = TRUE,
stats.label.color = NULL,
stats.label.args = list(size = 3, direction = "y"),
palette = "Dark2",
...
)
Arguments
x A model object to be tidied, or a tidy data frame containing results from a regres-
sion model. Function internally uses parameters::model_parameters to get
a tidy dataframe. If a data frame is used, it must contain columns named term
(names of predictors) and estimate (corresponding estimates of coefficients or
other quantities of interest).
output Character describing the expected output from this function: "plot" (visualiza-
tion of regression coefficients) or "tidy" (tidy dataframe of results parameters::model_parameters
or "glance" (object from performance::model_performance).
statistic Which statistic is to be displayed (either "t" or "f"or "z" or "chi") in the label.
This is relevant if the x argument is a dataframe.
16 ggcoefstats
conf.int Logical. Decides whether to display confidence intervals as error bars (Default:
TRUE).
conf.level Numeric deciding level of confidence or credible intervals (Default: 0.95).
exclude.intercept
Logical that decides whether the intercept should be excluded from the plot
(Default: FALSE).
effsize Character describing the effect size to be displayed: "eta" (default) or "omega".
This argument is relevant only for models objects of class aov, anova, aovlist,
"Gam", and "manova".
meta.analytic.effect
Logical that decides whether subtitle for meta-analysis via linear (mixed-effects)
models (default: FALSE). If TRUE, input to argument subtitle will be ignored.
This will be mostly relevant if a data frame with estimates and their standard
errors is entered.
meta.type Type of statistics used to carry out random-effects meta-analysis. If "parametric"
(default), metafor::rma function will be used. If "robust", metaplus::metaplus
function will be used. If "bayes", metaBMA::meta_random function will be
used.
bf.message Logical that decides whether results from running a Bayesian meta-analysis as-
suming that the effect size d varies across studies with standard deviation t (i.e.,
a random-effects analysis) should be displayed in caption. Defaults to TRUE.
sort If "none" (default) do not sort, "ascending" sort by increasing coefficient
value, or "descending" sort by decreasing coefficient value.
xlab, ylab Labels for x- and y- axis variables, respectively (Defaults: "regression coefficient"
and "term").
subtitle The text for the plot subtitle. The input to this argument will be ignored if
meta.analytic.effect is set to TRUE.
only.significant
If TRUE, only stats labels for significant effects is shown (Default: FALSE). This
can be helpful when a large number of regression coefficients are to be displayed
in a single plot. Relevant only when the output is a plot.
point.args Additional arguments that will be passed to ggplot2::geom_point geom. Please
see documentation for that function to know more about these arguments.
errorbar.args Additional arguments that will be passed to ggplot2::geom_errorbarh geom.
Please see documentation for that function to know more about these arguments.
vline Decides whether to display a vertical line (Default: "TRUE").
vline.args Additional arguments that will be passed to ggplot2::geom_vline geom. Please
see documentation for that function to know more about these arguments.
stats.labels Logical. Decides whether the statistic and p-values for each coefficient are to be
attached to each dot as a text label using ggrepel (Default: TRUE).
stats.label.color
Color for the labels. If set to NULL, colors will be chosen from the specified
package (Default: "RColorBrewer") and palette (Default: "Dark2").
ggcoefstats 17
stats.label.args
Additional arguments that will be passed to ggrepel::geom_label_repel geom.
Please see documentation for that function to know more about these arguments.
etc.).
ggstatsplot.layer
... Additional arguments to tidying method. For more, see parameters::model_parameters.
Note
Important: In case you want to carry out meta-analysis using this function, it assumes that you
have already downloaded the needed package (metafor, metaplus, or metaBMA) for meta-analysis.
1. All rows of regression estimates where either of the following quantities is NA will be removed
if labels are requested: estimate, statistic, p.value.
2. Given the rapid pace at which new methods are added to these packages, it is recommended
that you install the GitHub versions of parameters and performance in order to make most
of this function.
References
https://indrajeetpatil.github.io/ggstatsplot/articles/web_only/ggcoefstats.html
Examples
set.seed(123)
# model object
mod <- lm(formula = mpg ~ cyl * am, data = mtcars)
# to get a plot
ggcoefstats(x = mod, output = "plot")
# to get a tidy dataframe

ggcoefstats(x = mod, output = "tidy")
# to get a glance summary

ggcoefstats(x = mod, output = "glance")
18 ggcorrmat
ggcorrmat Visualization of a correlation matrix
Description
Maturing
Correlation matrix plot or a dataframe containing results from pairwise correlation tests. The pack-
age internally uses ggcorrplot::ggcorrplot for creating the visualization matrix, while the cor-
relation analysis is carried out using the correlation::correlation function.
Usage
ggcorrmat(
data,
cor.vars = NULL,
cor.vars.names = NULL,
output = "plot",
matrix.type = "upper",
tr = 0.2,
partial = FALSE,
k = 2L,
sig.level = 0.05,
conf.level = 0.95,
bf.prior = 0.707,
pch = "cross",
ggcorrplot.args = list(method = "square", outline.color = "black"),
palette = "Dark2",
colors = c("#E69F00", "white", "#009E73"),
title = NULL,
subtitle = NULL,
caption = NULL,
...
)
Arguments
data Dataframe from which variables specified are preferentially to be taken.
cor.vars List of variables for which the correlation matrix is to be computed and visual-
ized. If NULL (default), all numeric variables from data will be used.
cor.vars.names Optional list of names to be used for cor.vars. The names should be entered
in the same order.
output Character that decides expected output from this function. If "plot", the visu-
alization matrix will be returned. If "dataframe" (or literally anything other
than "plot"), a dataframe containing all details from statistical analyses (e.g.,
ggcorrmat 19
correlation coefficients, statistic values, p-values, no. of observations, etc.) will

be returned.
matrix.type Character, "upper" (default), "lower", or "full", display full matrix, lower
triangular or upper triangular matrix.
• "parametric"
• "nonparametric"
• "robust"
• "bayes"
might help.
partial Can be TRUE for partial correlations. For Bayesian partial correlations, "full"
instead of pseudo-Bayesian partial correlations (i.e., Bayesian correlation based
on frequentist partialization) are returned.
sig.level Significance level (Default: 0.05). If the p-value in p-value matrix is bigger
than sig.level, then the corresponding correlation coefficient is regarded as
insignificant and flagged as such in the plot. Relevant only when output =
"plot".
intervals (0.95).
ing Bayes factors and posterior estimates.
p.adjust.method
"fdr", "none".
pch Decides the point shape to be used for insignificant correlation coefficients (only
valid when insig = "pch"). Default: pch = "cross".
ggcorrplot.args
A list of additional (mostly aesthetic) arguments that will be passed to ggcorrplot::ggcorrplot
function. The list should avoid any of the following arguments since they are
already internally being used: corr, method, p.mat, sig.level, ggtheme,
colors, lab, pch, legend.title, digits.
colors A vector of 3 colors for low, mid, and high correlation values. If set to NULL,
manual specification of colors will be turned off and 3 colors from the specified
palette from package will be selected.
etc.).
20 ggcorrmat
ggstatsplot.layer
ggplot.component
References
https://indrajeetpatil.github.io/ggstatsplot/articles/web_only/ggcorrmat.html
See Also
grouped_ggcorrmat ggscatterstats grouped_ggscatterstats
Examples
set.seed(123)
# if `cor.vars` not specified, all numeric variables used

ggstatsplot::ggcorrmat(iris)
# to get the correlation matrix

# note that the function will run even if the vector with variable names is
# not of same length as the number of variables
ggstatsplot::ggcorrmat(
data = ggplot2::msleep,
type = "robust",
cor.vars = sleep_total:bodywt,
cor.vars.names = c("total sleep", "REM sleep"),
matrix.type = "lower"
)
# to get the correlation analyses results in a dataframe

ggstatsplot::ggcorrmat(
cor.vars = sleep_total:bodywt,
partial = TRUE,
output = "dataframe"
)
ggdotplotstats 21
ggdotplotstats Dot plot/chart for labeled numeric data.
Description
Maturing
A dot chart (as described by William S. Cleveland) with statistical details from one-sample test
included in the plot as a subtitle.
Usage
ggdotplotstats(
data,
x,
y,
xlab = NULL,
ylab = NULL,
title = NULL,
subtitle = NULL,
caption = NULL,
test.value = 0,
bf.prior = 0.707,
bf.message = TRUE,
effsize.type = "g",
conf.level = 0.95,
nboot = 100,
tr = 0.2,
k = 2,
point.args = list(color = "black", size = 3, shape = 16),
centrality.line.args = list(color = "blue", size = 1),
output = "plot",
...
)
Arguments
x A numeric variable from the dataframe data.
y Label or grouping variable.
xlab Labels for x and y axis variables. If NULL (default), variable names for x and y
will be used.
22 ggdotplotstats
ylab Labels for x- and y- axis variables, respectively (Defaults: "regression coefficient"
and "term").
• "parametric"
• "nonparametric"
• "robust"
• "bayes"
test.value A number indicating the true value of the mean (Default: 0).
effsize.type Type of effect size needed for parametric tests. The argument can be "d" (for
Cohen’s d) or "g" (for Hedge’s g).
conf.level Confidence/Credible Interval (CI) level. Default to 0.95 (95%).
might help.
results.subtitle
point.args A list of additional aesthetic arguments passed to geom_point.
centrality.plotting
argument.
centrality.type
ggdotplotstats 23

centrality.line.args
A list of additional aesthetic arguments to be passed to the geom_line used to
display the lines corresponding to the centrality parameter.
ggplot.component
etc.).
ggstatsplot.layer
References
https://indrajeetpatil.github.io/ggstatsplot/articles/web_only/ggdotplotstats.html
See Also
grouped_gghistostats, gghistostats, grouped_ggdotplotstats
Examples
set.seed(123)
# plot
ggdotplotstats(
data = ggplot2::mpg,
x = cty,
y = manufacturer,
title = "Fuel economy data",
xlab = "city miles per gallon",
caption = substitute(
paste(italic("Source"), ": EPA dataset on http://fueleconomy.gov")
)
)
24 gghistostats
gghistostats Histogram for distribution of a numeric variable
Description
Maturing
Histogram with statistical details from one-sample test included in the plot as a subtitle.
Usage
gghistostats(
data,
x,
binwidth = NULL,
xlab = NULL,
title = NULL,
subtitle = NULL,
caption = NULL,
test.value = 0,
bf.prior = 0.707,
bf.message = TRUE,
effsize.type = "g",
conf.level = 0.95,
nboot = 100,
tr = 0.2,
k = 2L,
bar.fill = "grey50",
centrality.line.args = list(size = 1, color = "blue"),
normal.curve = FALSE,
normal.curve.args = list(size = 2),
output = "plot",
...
)
Arguments
binwidth The width of the histogram bins. Can be specified as a numeric value, or a
function that calculates width from x. The default is to use the max(x) -min(x)
/ sqrt(N). You should always check this value and explore multiple widths to
find the best to illustrate the stories in your data.
gghistostats 25
will be used.
• "parametric"
• "nonparametric"
• "robust"
• "bayes"
effsize.type Type of effect size needed for parametric tests. The argument can be "d" (for
Cohen’s d) or "g" (for Hedge’s g).
might help.
etc.).
ggstatsplot.layer
bar.fill Character input that decides which color will uniformly fill all the bars in the
histogram (Default: "grey50").
results.subtitle
centrality.plotting
26 gghistostats

argument.
centrality.type
centrality.line.args
A list of additional aesthetic arguments to be passed to the geom_line used to
display the lines corresponding to the centrality parameter.
normal.curve A logical value that decides whether to super-impose a normal curve using
stats::dnorm(mean(x),sd(x)). Default is FALSE.
normal.curve.args
A list of additional aesthetic arguments to be passed to the normal curve.
ggplot.component
References
https://indrajeetpatil.github.io/ggstatsplot/articles/web_only/gghistostats.html
See Also
grouped_gghistostats, ggdotplotstats, grouped_ggdotplotstats
Examples
set.seed(123)
# using defaults, but modifying which centrality parameter is to be shown

gghistostats(
data = ToothGrowth,
x = len,
xlab = "Tooth length",
centrality.type = "np"
)
ggpiestats 27
ggpiestats Pie charts with statistical tests
Description
Maturing
Pie charts for categorical data with statistical details included in the plot as a subtitle.
Usage
ggpiestats(
data,
x,
y = NULL,
counts = NULL,
paired = FALSE,
label = "percentage",
label.args = list(direction = "both"),
label.repel = FALSE,
k = 2L,
proportion.test = TRUE,
perc.k = 0,
bf.message = TRUE,
ratio = NULL,
conf.level = 0.95,
sampling.plan = "indepMulti",
fixed.margin = "rows",
prior.concentration = 1,
title = NULL,
subtitle = NULL,
caption = NULL,
legend.title = NULL,
palette = "Dark2",
output = "plot",
...
)
Arguments
28 ggpiestats
• "parametric"
• "nonparametric"
• "robust"
• "bayes"
paired Logical indicating whether data came from a within-subjects or repeated mea-
sures design study (Default: FALSE). If TRUE, McNemar’s test expression will be
returned. If FALSE, Pearson’s chi-square test will be returned.
results.subtitle
label Character decides what information needs to be displayed on the label in each
pie slice. Possible options are "percentage" (default), "counts", "both".
label.repel Whether labels should be repelled using ggrepel package. This can be helpful
in case the labels are overlapping.
proportion.test
Decides whether proportion test for x variable is to be carried out for each level
of y (Default: TRUE). In ggbarstats, only p-values from this test will be dis-
played.
perc.k Numeric that decides number of decimal places for percentage labels (Default:
0).
ratio A vector of proportions: the expected proportions for the proportion test (should
sum to 1). Default is NULL, which means the null is equal theoretical proportions
across the levels of the nominal variable. This means if there are two levels
this will be ratio = c(0.5,0.5) or if there are four levels this will be ratio =
c(0.25,0.25,0.25,0.25), etc.
intervals (0.95).
sampling.plan Character describing the sampling plan. Possible options are "indepMulti"
(independent multinomial; default), "poisson", "jointMulti" (joint multino-
mial), "hypergeom" (hypergeometric). For more, see ?BayesFactor::contingencyTableBF().
fixed.margin For the independent multinomial sampling plan, which margin is fixed ("rows"
or "cols"). Defaults to "rows".
ggpiestats 29
prior.concentration
Specifies the prior concentration parameter, set to 1 by default. It indexes the ex-
pected deviation from the null hypothesis under the alternative, and corresponds
to Gunel and Dickey’s (1974) "a" parameter.
etc.).
ggstatsplot.layer
ggplot.component
References
https://indrajeetpatil.github.io/ggstatsplot/articles/web_only/ggpiestats.html
See Also
grouped_ggpiestats, ggbarstats, grouped_ggbarstats
Examples
set.seed(123)
# one sample goodness of fit proportion test

ggstatsplot::ggpiestats(ggplot2::msleep, vore)
# association test (or contingency table analysis)

ggstatsplot::ggpiestats(
30 ggscatterstats
data = mtcars,
x = vs,
y = cyl
)
ggscatterstats Scatterplot with marginal distributions and statistical results
Description
Maturing
Scatterplots from ggplot2 combined with marginal histograms/boxplots/density plots with statisti-
cal details added as a subtitle.
Usage
ggscatterstats(
data,
x,
y,
conf.level = 0.95,
bf.prior = 0.707,
bf.message = TRUE,
tr = 0.2,
k = 2L,
label.var = NULL,
label.expression = NULL,
point.args = list(size = 3, alpha = 0.4),
point.width.jitter = 0,
point.height.jitter = 0,
point.label.args = list(size = 3),
smooth.line.args = list(size = 1.5, color = "blue"),
marginal = TRUE,
marginal.type = "densigram",
marginal.size = 5,
xfill = "#009E73",
yfill = "#D55E00",
xlab = NULL,
ylab = NULL,
title = NULL,
subtitle = NULL,
caption = NULL,
output = "plot",
...
)
ggscatterstats 31
Arguments
x The column in data containing the explanatory variable to be plotted on the x-
axis. Can be entered either as a character string (e.g., "x") or as a bare expression
(e.g, x).
y The column in data containing the response (outcome) variable to be plotted on
the y-axis. Can be entered either as a character string (e.g., "y") or as a bare
expression (e.g, y).
• "parametric"
• "nonparametric"
• "robust"
• "bayes"
intervals (0.95).
might help.
results.subtitle
label.var Variable to use for points labels. Can be entered either as a bare expression (e.g,
var1) or as a string (e.g., "var1").
label.expression
An expression evaluating to a logical vector that determines the subset of data
points to label. This argument can be entered either as a bare expression (e.g., y
< 4 & z < 20) or as a string (e.g., "y < 4 & z < 20").
point.args A list of additional aesthetic arguments to be passed to ggplot2::geom_point
geom used to display the raw data points.
point.width.jitter, point.height.jitter
Degree of jitter in x and y direction, respectively. Defaults to 0 (0%) of the reso-
lution of the data. Note that the jitter should not be specified in the point.args
because this information will be passed to two different geoms: one displaying
the points and the other displaying the *labels for these points.
point.label.args
geom used to display the labels.
smooth.line.args
A list of additional aesthetic arguments to be passed to ggplot2::geom_smooth
geom used to display the regression line.
32 ggscatterstats
marginal Decides whether ggExtra::ggMarginal() plots will be displayed; the default

is TRUE.
marginal.type Type of marginal distribution to be plotted on the axes ("histogram", "boxplot",
"density", "violin", "densigram").
marginal.size Integer describing the relative size of the marginal plots compared to the main
plot. A size of 5 means that the main plot is 5x wider and 5x taller than the
marginal plots.
xfill, yfill Character describing color fill for x and y axes marginal distributions (default:
"#009E73" (for x) and "#D55E00" (for y)). Note that the defaults are colorblind-
friendly.
will be used.
ylab Labels for x and y axis variables. If NULL (default), variable names for x and y
will be used.
etc.).
ggstatsplot.layer
ggplot.component
Note
• If you set marginal = TRUE, the resulting plot can not be further modified with ggplot2 func-
tions since it is no longer a ggplot object. In case you want a ggplot object, set marginal =
FALSE. Also have a look at the ggplot.component argument.
• The plot uses ggrepel::geom_label_repel to attempt to keep labels from over-lapping to
the largest degree possible. As a consequence plot times will slow down massively (and the
plot file will grow in size) if you have a lot of labels that overlap.
References
https://indrajeetpatil.github.io/ggstatsplot/articles/web_only/ggscatterstats.html
ggwithinstats 33
See Also
grouped_ggscatterstats, ggcorrmat, grouped_ggcorrmat
Examples
set.seed(123)
# creating dataframe with rownames converted to a new column

mtcars_new <- as_tibble(mtcars, rownames = "car")
# simple function call with the defaults

ggstatsplot::ggscatterstats(
data = mtcars_new,
x = wt,
y = mpg,
label.var = car,
label.expression = wt < 4 & mpg < 20,
# making further customizations with `ggplot2` functions
ggplot.component = list(ggplot2::scale_y_continuous(
limits = c(5, 35),
breaks = seq(5, 35, 5)
))
)
ggwithinstats Box/Violin plots for group or condition comparisons in within-subjects

(or repeated measures) designs.
Description
Maturing
A combination of box and violin plots along with raw (unjittered) data points for within-subjects
designs with statistical details included in the plot as a subtitle.
Usage
ggwithinstats(
data,
x,
y,
pairwise.display = "significant",
effsize.type = "unbiased",
bf.prior = 0.707,
bf.message = TRUE,
xlab = NULL,
34 ggwithinstats
ylab = NULL,
caption = NULL,
title = NULL,
subtitle = NULL,
k = 2L,
conf.level = 0.95,
nboot = 100L,
tr = 0.2,
centrality.point.args = list(size = 5, color = "darkred"),
centrality.label.args = list(size = 3, nudge_x = 0.4, segment.linetype = 4),
centrality.path = TRUE,
centrality.path.args = list(color = "red", size = 1, alpha = 0.5),
point.path = TRUE,
point.path.args = list(alpha = 0.5, linetype = "dashed"),
outlier.tagging = FALSE,
outlier.coef = 1.5,
outlier.label.args = list(size = 3),
violin.args = list(width = 0.5, alpha = 0.2),
ggsignif.args = list(textsize = 3, tip_length = 0.01),
palette = "Dark2",
output = "plot",
...
)
Arguments
• "parametric"
• "nonparametric"
• "robust"
• "bayes"
pairwise.comparisons
Logical that decides whether pairwise comparisons are to be displayed (default:
TRUE). Please note that only significant comparisons will be shown by default.
To change this behavior, select appropriate option with pairwise.display ar-
gument. The pairwise comparison dataframes are prepared using the pairwiseComparisons::pairw
function. For more details about pairwise comparisons, see the documentation
for that function.
ggwithinstats 35
pairwise.display
Decides which pairwise comparisons to display. Available options are:
• "all"
You can use this argument to make sure that your plot is not uber-cluttered when
you have multiple groups being compared and scores of pairwise comparisons
being displayed.
p.adjust.method
"fdr", "none".
effsize.type Type of effect size needed for parametric tests. The argument can be "eta"
(partial eta-squared) or "omega" (partial omega-squared).
ing Bayes factors.
results.subtitle
will be used.
ylab Labels for x and y axis variables. If NULL (default), variable names for x and y
will be used.
intervals (0.95).
might help.
centrality.plotting
argument.
36 ggwithinstats
centrality.type
centrality.point.args
centrality.label.args
centrality.path.args, point.path.args
A list of additional aesthetic arguments passed on to geom_path connecting raw
data points and mean points.
point.path, centrality.path
Logical that decides whether individual data points and means, respectively,
should be connected using geom_path. Both default to TRUE. Note that point.path
argument is relevant only when there are two groups (i.e., in case of a t-test). In
case of large number of data points, it is advisable to set point.path = FALSE
as these lines can overwhelm the plot.
outlier.tagging
Decides whether outliers should be tagged (Default: FALSE).
argument.
outlier.coef Coefficient for outlier detection using Tukey’s method. With Tukey’s method,
outliers are below (1st Quartile) or above (3rd Quartile) outlier.coef times
the Inter-Quartile Range (IQR) (Default: 1.5).
outlier.label.args
for outlier label plotting.
etc.).
ggstatsplot.layer
ggwithinstats 37
ggplot.component
Note
1. Please note that the function expects that the data is already sorted by subject/repeated mea-
sures ID.
2. To carry out Bayesian analysis for ANOVA designs, you will need to install the development
version of BayesFactor (0.9.12-4.3). You can download it by running: remotes::install_github("richarddmo
References
https://indrajeetpatil.github.io/ggstatsplot/articles/web_only/ggwithinstats.html
See Also
grouped_ggbetweenstats, ggbetweenstats, grouped_ggwithinstats
Examples
# setup
set.seed(123)
# two groups (*t*-test)

ggwithinstats(
data = VR_dilemma,
x = modality,
y = score,
xlab = "Presentation modality",
ylab = "Proportion of utilitarian decisions"
)
# more than two groups (anova)

library(WRS2)
ggstatsplot::ggwithinstats(
data = WineTasting,
x = Wine,
y = Taste,
type = "np", # non-parametric test
outlier.tagging = TRUE,
outlier.label = Taster
)
38 grouped_ggbarstats
grouped_ggbarstats Grouped bar (column) charts with statistical tests
Description
Maturing
Helper function for ggstatsplot::ggbarstats to apply this function across multiple levels of a
given factor and combining the resulting plots using ggstatsplot::combine_plots.
Usage
grouped_ggbarstats(
data,
x,
y,
counts = NULL,
grouping.var,
output = "plot",
...
)
Arguments
grouping.var A single grouping variable (can be entered either as a bare name x or as a string
"x").
grouped_ggbarstats 39
annotation.args
... Arguments passed on to ggbarstats
xlab Custom text for the x axis label (Default: NULL, which will cause the x
axis label to be the x variable).
ylab Custom text for the y axis label (Default: NULL).
type A character specifying the type of statistical approach. Four possible op-
tions:
• "parametric"
• "nonparametric"
• "robust"
• "bayes"
Corresponding abbreviations are also accepted: "p" (for parametric), "np"
(for nonparametric), "r" (for robust), or "bf" (for Bayesian).
paired Logical indicating whether data came from a within-subjects or re-
peated measures design study (Default: FALSE). If TRUE, McNemar’s test
expression will be returned. If FALSE, Pearson’s chi-square test will be re-
turned.
results.subtitle Decides whether the results of statistical tests are to be dis-
played as a subtitle (Default: TRUE). If set to FALSE, only the plot will be
returned.
label Character decides what information needs to be displayed on the label
in each pie slice. Possible options are "percentage" (default), "counts",
"both".
proportion.test Decides whether proportion test for x variable is to be car-
ried out for each level of y (Default: TRUE). In ggbarstats, only p-values
from this test will be displayed.
perc.k Numeric that decides number of decimal places for percentage labels
(Default: 0).
bf.message Logical that decides whether to display Bayes Factor in favor of
the null hypothesis. This argument is relevant only for parametric test
(Default: TRUE).
ratio A vector of proportions: the expected proportions for the proportion test
(should sum to 1). Default is NULL, which means the null is equal theoretical
proportions across the levels of the nominal variable. This means if there
are two levels this will be ratio = c(0.5,0.5) or if there are four levels
this will be ratio = c(0.25,0.25,0.25,0.25), etc.
conf.level Scalar between 0 and 1. If unspecified, the defaults return 95%
confidence/credible intervals (0.95).
sampling.plan Character describing the sampling plan. Possible options are
"indepMulti" (independent multinomial; default), "poisson", "jointMulti"
(joint multinomial), "hypergeom" (hypergeometric). For more, see ?BayesFactor::contingenc
fixed.margin For the independent multinomial sampling plan, which margin
is fixed ("rows" or "cols"). Defaults to "rows".
prior.concentration Specifies the prior concentration parameter, set to 1 by
default. It indexes the expected deviation from the null hypothesis under the
alternative, and corresponds to Gunel and Dickey’s (1974) "a" parameter.
40 grouped_ggbetweenstats
subtitle The text for the plot subtitle. Will work only if results.subtitle
= FALSE.
etc.).
ggstatsplot.layer Logical that decides whether theme_ggstatsplot theme
elements are to be displayed along with the selected ggtheme (Default:
TRUE). theme_ggstatsplot is an opinionated theme layer that override
some aspects of the selected ggtheme.
package Name of the package from which the given palette is to be extracted.
The available palettes and packages can be checked by running View(paletteer::palettes_d_
palette Name of the package from which the given palette is to be extracted.
ggplot.component A ggplot component to be added to the plot prepared by
ggstatsplot. This argument is primarily helpful for grouped_ variants of
all primary functions. Default is NULL. The argument should be entered as
a ggplot2 function or a list of ggplot2 functions.
See Also
ggbarstats, ggpiestats, grouped_ggpiestats
Examples
set.seed(123)
# let's create a smaller dataframe

diamonds_short <- ggplot2::diamonds %>%
dplyr::filter(.data = ., cut %in% c("Very Good", "Ideal")) %>%
dplyr::filter(.data = ., clarity %in% c("SI1", "SI2", "VS1", "VS2")) %>%
dplyr::sample_frac(tbl = ., size = 0.05)
# plot
# let's skip statistical analysis
ggstatsplot::grouped_ggbarstats(
data = diamonds_short,
x = color,
y = clarity,
grouping.var = cut,
plotgrid.args = list(nrow = 2)
)
grouped_ggbetweenstats
Violin plots for group or condition comparisons in between-subjects
designs repeated across all levels of a grouping variable.
grouped_ggbetweenstats 41
Description
Maturing
Helper function for ggstatsplot::ggbetweenstats to apply this function across multiple levels
of a given factor and combining the resulting plots using ggstatsplot::combine_plots.
Usage
grouped_ggbetweenstats(
data,
x,
y,
grouping.var,
output = "plot",
...
)
Arguments
"x").
argument.
annotation.args
... Arguments passed on to ggbetweenstats
plot.type Character describing the type of plot. Currently supported plots are
"box" (for only boxplots), "violin" (for only violin plots), and "boxviolin"
(for a combination of box and violin plots; default).
xlab Labels for x and y axis variables. If NULL (default), variable names for x
and y will be used.
ylab Labels for x and y axis variables. If NULL (default), variable names for x
and y will be used.
42 grouped_ggbetweenstats
pairwise.comparisons Logical that decides whether pairwise comparisons

are to be displayed (default: TRUE). Please note that only significant com-
parisons will be shown by default. To change this behavior, select appro-
priate option with pairwise.display argument. The pairwise comparison
dataframes are prepared using the pairwiseComparisons::pairwise_comparisons
function. For more details about pairwise comparisons, see the documenta-
tion for that function.
p.adjust.method Adjustment method for p-values for multiple comparisons.
Possible methods are: "holm" (default), "hochberg", "hommel", "bonferroni",
"BH", "BY", "fdr", "none".
pairwise.display Decides which pairwise comparisons to display. Available
options are:
• "all"
You can use this argument to make sure that your plot is not uber-cluttered
when you have multiple groups being compared and scores of pairwise
comparisons being displayed.
bf.prior A number between 0.5 and 2 (default 0.707), the prior width to use
in calculating Bayes factors.
(Default: TRUE).
returned.
= FALSE.
outlier.color Default aesthetics for outliers (Default: "black").
outlier.tagging Decides whether outliers should be tagged (Default: FALSE).
outlier.shape Hiding the outliers can be achieved by setting outlier.shape
= NA. Importantly, this does not remove the outliers, it only hides them, so
the range calculated for the y-axis will be the same with outliers shown and
outliers hidden.
outlier.label.args A list of additional aesthetic arguments to be passed to
ggrepel::geom_label_repel for outlier label plotting.
outlier.coef Coefficient for outlier detection using Tukey’s method. With
Tukey’s method, outliers are below (1st Quartile) or above (3rd Quartile)
outlier.coef times the Inter-Quartile Range (IQR) (Default: 1.5).
centrality.plotting Logical that decides whether centrality tendency mea-
sure is to be displayed as a point with a label (Default: TRUE). Function
decides which central tendency measure to show depending on the type
argument.
argument.
grouped_ggbetweenstats 43
centrality.type Decides which centrality parameter is to be displayed. The

default is to choose the same as type argument. You can specify this to be:
point.args A list of additional aesthetic arguments to be passed to the geom_point
displaying the raw data.
centrality.point.args A list of additional aesthetic arguments to be passed
to ggplot2::geom_point and ggrepel::geom_label_repel geoms, which
are involved in mean plotting.
centrality.label.args A list of additional aesthetic arguments to be passed
etc.).
tions:
• "parametric"
• "nonparametric"
• "robust"
• "bayes"
effsize.type Type of effect size needed for parametric tests. The argument
can be "eta" (partial eta-squared) or "omega" (partial omega-squared).
var.equal a logical variable indicating whether to treat the two variances as
being equal. If TRUE then the pooled variance is used to estimate the vari-
ance otherwise the Welch (or Satterthwaite) approximation to the degrees
of freedom is used.
44 grouped_ggcorrmat

nboot Number of bootstrap samples for computing confidence interval for the
effect size (Default: 100).
try reducing the value of tr, which is by default set to 0.2. Lowering the
value might help.
References
https://indrajeetpatil.github.io/ggstatsplot/articles/web_only/ggbetweenstats.html
See Also
ggbetweenstats, ggwithinstats, grouped_ggwithinstats
Examples
set.seed(123)
# the most basic function call

ggstatsplot::grouped_ggbetweenstats(
data = dplyr::filter(ggplot2::mpg, drv != "4"),
x = year,
y = hwy,
grouping.var = drv,
conf.level = 0.99
)
# modifying individual plots using `ggplot.component` argument

ggstatsplot::grouped_ggbetweenstats(
data = dplyr::filter(
ggstatsplot::movies_long,
genre %in% c("Action", "Comedy"),
mpaa %in% c("R", "PG")
),
x = genre,
y = rating,
grouping.var = mpaa,
results.subtitle = FALSE,
ggplot.component = ggplot2::scale_y_continuous(
breaks = seq(1, 9, 1),
limits = (c(1, 9))
)
)
grouped_ggcorrmat Visualization of a correlalogram (or correlation matrix) for all levels

of a grouping variable
grouped_ggcorrmat 45
Description
Maturing
Helper function for ggstatsplot::ggcorrmat to apply this function across multiple levels of a
Usage
grouped_ggcorrmat(
data,
cor.vars = NULL,
grouping.var,
output = "plot",
...
)
Arguments
data Dataframe from which variables specified are preferentially to be taken.
cor.vars List of variables for which the correlation matrix is to be computed and visual-
ized. If NULL (default), all numeric variables from data will be used.
"x").
output Character that decides expected output from this function. If "plot", the visu-
alization matrix will be returned. If "dataframe" (or literally anything other
than "plot"), a dataframe containing all details from statistical analyses (e.g.,
correlation coefficients, statistic values, p-values, no. of observations, etc.) will
be returned.
annotation.args
... Arguments passed on to ggcorrmat
cor.vars.names Optional list of names to be used for cor.vars. The names
should be entered in the same order.
partial Can be TRUE for partial correlations. For Bayesian partial correlations,
"full" instead of pseudo-Bayesian partial correlations (i.e., Bayesian corre-
lation based on frequentist partialization) are returned.
matrix.type Character, "upper" (default), "lower", or "full", display full
matrix, lower triangular or upper triangular matrix.
sig.level Significance level (Default: 0.05). If the p-value in p-value matrix
is bigger than sig.level, then the corresponding correlation coefficient
is regarded as insignificant and flagged as such in the plot. Relevant only
when output = "plot".
colors A vector of 3 colors for low, mid, and high correlation values. If set to
NULL, manual specification of colors will be turned off and 3 colors from
the specified palette from package will be selected.
pch Decides the point shape to be used for insignificant correlation coefficients
(only valid when insig = "pch"). Default: pch = "cross".
46 grouped_ggcorrmat
ggcorrplot.args A list of additional (mostly aesthetic) arguments that will be

passed to ggcorrplot::ggcorrplot function. The list should avoid any of
the following arguments since they are already internally being used: corr,
method, p.mat, sig.level, ggtheme, colors, lab, pch, legend.title,
digits.
tions:
• "parametric"
• "nonparametric"
• "robust"
• "bayes"
value might help.
in calculating Bayes factors and posterior estimates.
etc.).
= FALSE.
References
https://indrajeetpatil.github.io/ggstatsplot/articles/web_only/ggcorrmat.html
See Also
ggcorrmat, ggscatterstats, grouped_ggscatterstats
grouped_ggdotplotstats 47
Examples
set.seed(123)
# for plot
ggstatsplot::grouped_ggcorrmat(
data = iris,
grouping.var = Species,
type = "robust",
p.adjust.method = "holm"
)
# for dataframe
ggstatsplot::grouped_ggcorrmat(
grouping.var = vore,
type = "bayes",
output = "dataframe"
)
grouped_ggdotplotstats
Grouped histograms for distribution of a labeled numeric variable
Description
Maturing
Helper function for ggstatsplot::ggdotplotstats to apply this function across multiple levels
of a given factor and combining the resulting plots using ggstatsplot::combine_plots.
Usage
grouped_ggdotplotstats(
data,
x,
y,
grouping.var,
output = "plot",
...
)
Arguments
y Label or grouping variable.
48 grouped_ggdotplotstats
"x").
annotation.args
... Arguments passed on to ggdotplotstats
point.args A list of additional aesthetic arguments passed to geom_point.
tions:
• "parametric"
• "nonparametric"
• "robust"
• "bayes"
value might help.
centrality.line.args A list of additional aesthetic arguments to be passed
to the geom_line used to display the lines corresponding to the centrality
parameter.
and y will be used.
= FALSE.
(Default: TRUE).
can be "d" (for Cohen’s d) or "g" (for Hedge’s g).
returned.
grouped_ggdotplotstats 49

argument.
argument.
etc.).
ylab Labels for x- and y- axis variables, respectively (Defaults: "regression
coefficient" and "term").
References
https://indrajeetpatil.github.io/ggstatsplot/articles/web_only/ggdotplotstats.html
See Also
grouped_gghistostats, ggdotplotstats, gghistostats
Examples
set.seed(123)
# removing factor level with very few no. of observations

df <- dplyr::filter(.data = ggplot2::mpg, cyl %in% c("4", "6", "8"))
# plot
ggstatsplot::grouped_ggdotplotstats(
data = df,
x = cty,
50 grouped_gghistostats
y = manufacturer,
grouping.var = cyl,
test.value = 15.5,
ggplot.component = ggplot2::scale_x_continuous(
sec.axis = ggplot2::dup_axis(),
limits = c(12, 24),
breaks = seq(12, 24, 2)
)
)
grouped_gghistostats Grouped histograms for distribution of a numeric variable
Description
Maturing
Helper function for ggstatsplot::gghistostats to apply this function across multiple levels of
a given factor and combining the resulting plots using ggstatsplot::combine_plots.
Usage
grouped_gghistostats(
data,
x,
grouping.var,
binwidth = NULL,
output = "plot",
...
)
Arguments
"x").
binwidth The width of the histogram bins. Can be specified as a numeric value, or a
function that calculates width from x. The default is to use the max(x) -min(x)
/ sqrt(N). You should always check this value and explore multiple widths to
find the best to illustrate the stories in your data.
grouped_gghistostats 51

annotation.args
... Arguments passed on to gghistostats
normal.curve A logical value that decides whether to super-impose a normal
curve using stats::dnorm(mean(x),sd(x)). Default is FALSE.
normal.curve.args A list of additional aesthetic arguments to be passed to
the normal curve.
bar.fill Character input that decides which color will uniformly fill all the
bars in the histogram (Default: "grey50").
tions:
• "parametric"
• "nonparametric"
• "robust"
• "bayes"
can be "d" (for Cohen’s d) or "g" (for Hedge’s g).
value might help.
centrality.line.args A list of additional aesthetic arguments to be passed
to the geom_line used to display the lines corresponding to the centrality
parameter.
and y will be used.
= FALSE.
(Default: TRUE).
etc.).
52 grouped_gghistostats

returned.
argument.
argument.
References
https://indrajeetpatil.github.io/ggstatsplot/articles/web_only/gghistostats.html
See Also
gghistostats, ggdotplotstats, grouped_ggdotplotstats
Examples
set.seed(123)
# plot
ggstatsplot::grouped_gghistostats(
data = iris,
x = Sepal.Length,
test.value = 5,
grouping.var = Species,
bar.fill = "orange",
ggplot.component = list(
ggplot2::scale_x_continuous(breaks = seq(3, 9, 1), limits = (c(3, 9))),
ggplot2::scale_y_continuous(breaks = seq(0, 25, 5), limits = (c(0, 25)))
),
plotgrid.args = list(nrow = 1),
annotation.args = list(tag_levels = "i"),
)
grouped_ggpiestats 53
grouped_ggpiestats Grouped pie charts with statistical tests
Description
Helper function for ggstatsplot::ggpiestats to apply this function across multiple levels of a
Usage
grouped_ggpiestats(
data,
x,
y = NULL,
counts = NULL,
grouping.var,
output = "plot",
...
)
Arguments
"x").
annotation.args
... Arguments passed on to ggpiestats
54 grouped_ggpiestats
proportion.test Decides whether proportion test for x variable is to be car-

ried out for each level of y (Default: TRUE). In ggbarstats, only p-values
from this test will be displayed.
perc.k Numeric that decides number of decimal places for percentage labels
(Default: 0).
label Character decides what information needs to be displayed on the label
in each pie slice. Possible options are "percentage" (default), "counts",
"both".
label.repel Whether labels should be repelled using ggrepel package. This
can be helpful in case the labels are overlapping.
tions:
• "parametric"
• "nonparametric"
• "robust"
• "bayes"
returned.
(Default: TRUE).
= FALSE.
etc.).
grouped_ggscatterstats 55
paired Logical indicating whether data came from a within-subjects or re-

peated measures design study (Default: FALSE). If TRUE, McNemar’s test
expression will be returned. If FALSE, Pearson’s chi-square test will be re-
turned.
ratio A vector of proportions: the expected proportions for the proportion test
(should sum to 1). Default is NULL, which means the null is equal theoretical
proportions across the levels of the nominal variable. This means if there
are two levels this will be ratio = c(0.5,0.5) or if there are four levels
this will be ratio = c(0.25,0.25,0.25,0.25), etc.
sampling.plan Character describing the sampling plan. Possible options are
"indepMulti" (independent multinomial; default), "poisson", "jointMulti"
(joint multinomial), "hypergeom" (hypergeometric). For more, see ?BayesFactor::contingenc
fixed.margin For the independent multinomial sampling plan, which margin
is fixed ("rows" or "cols"). Defaults to "rows".
prior.concentration Specifies the prior concentration parameter, set to 1 by
default. It indexes the expected deviation from the null hypothesis under the
alternative, and corresponds to Gunel and Dickey’s (1974) "a" parameter.
References
https://indrajeetpatil.github.io/ggstatsplot/articles/web_only/ggpiestats.html
See Also
ggbarstats, ggpiestats, grouped_ggbarstats
Examples
# grouped one-sample proportion test
# let's skip statistical analysis
ggstatsplot::grouped_ggpiestats(
data = mtcars,
grouping.var = am,
x = cyl
)
grouped_ggscatterstats
Scatterplot with marginal distributions for all levels of a grouping
variable
Description
Maturing
Grouped scatterplots from ggplot2 combined with marginal distribution plots with statistical de-
tails added as a subtitle.
56 grouped_ggscatterstats
Usage
grouped_ggscatterstats(
data,
x,
y,
grouping.var,
label.var = NULL,
label.expression = NULL,
output = "plot",
...
)
Arguments
x The column in data containing the explanatory variable to be plotted on the x-
axis. Can be entered either as a character string (e.g., "x") or as a bare expression
(e.g, x).
y The column in data containing the response (outcome) variable to be plotted on
the y-axis. Can be entered either as a character string (e.g., "y") or as a bare
expression (e.g, y).
"x").
label.var Variable to use for points labels. Can be entered either as a bare expression (e.g,
var1) or as a string (e.g., "var1").
label.expression
An expression evaluating to a logical vector that determines the subset of data
points to label. This argument can be entered either as a bare expression (e.g., y
< 4 & z < 20) or as a string (e.g., "y < 4 & z < 20").
annotation.args
... Arguments passed on to ggscatterstats
point.label.args A list of additional aesthetic arguments to be passed to
ggrepel::geom_label_repel geom used to display the labels.
smooth.line.args A list of additional aesthetic arguments to be passed to
ggplot2::geom_smooth geom used to display the regression line.
point.args A list of additional aesthetic arguments to be passed to ggplot2::geom_point
geom used to display the raw data points.
grouped_ggscatterstats 57
marginal Decides whether ggExtra::ggMarginal() plots will be displayed;

the default is TRUE.
point.width.jitter Degree of jitter in x and y direction, respectively. De-
faults to 0 (0%) of the resolution of the data. Note that the jitter should not
be specified in the point.args because this information will be passed to
two different geoms: one displaying the points and the other displaying the
*labels for these points.
point.height.jitter Degree of jitter in x and y direction, respectively. De-
faults to 0 (0%) of the resolution of the data. Note that the jitter should not
be specified in the point.args because this information will be passed to
two different geoms: one displaying the points and the other displaying the
*labels for these points.
marginal.type Type of marginal distribution to be plotted on the axes ("histogram",
"boxplot", "density", "violin", "densigram").
marginal.size Integer describing the relative size of the marginal plots com-
pared to the main plot. A size of 5 means that the main plot is 5x wider and
5x taller than the marginal plots.
xfill Character describing color fill for x and y axes marginal distributions
(default: "#009E73" (for x) and "#D55E00" (for y)). Note that the defaults
are colorblind-friendly.
yfill Character describing color fill for x and y axes marginal distributions
(default: "#009E73" (for x) and "#D55E00" (for y)). Note that the defaults
are colorblind-friendly.
tions:
• "parametric"
• "nonparametric"
• "robust"
• "bayes"
value might help.
etc.).
(Default: TRUE).
58 grouped_ggscatterstats

returned.
and y will be used.
and y will be used.
= FALSE.
References
https://indrajeetpatil.github.io/ggstatsplot/articles/web_only/ggscatterstats.html
See Also
ggscatterstats, ggcorrmat, grouped_ggcorrmat
Examples
# to ensure reproducibility
set.seed(123)
# basic function call

data = dplyr::filter(movies_long, genre == "Comedy" | genre == "Drama"),
x = length,
y = rating,
type = "robust",
grouping.var = genre
)
# using labeling
# (also show how to modify basic plot from within function call)
data = dplyr::filter(ggplot2::mpg, cyl != 5),
x = displ,
y = hwy,
grouping.var = cyl,
type = "robust",
label.var = manufacturer,
label.expression = hwy > 25 & displ > 2.5,
ggplot.component = ggplot2::scale_y_continuous(sec.axis = ggplot2::dup_axis())
)
# labeling without expression
data = dplyr::filter(
grouped_ggwithinstats 59
.data = movies_long,
rating == 7,
genre %in% c("Drama", "Comedy")
),
x = budget,
y = length,
grouping.var = genre,
bf.message = FALSE,
label.var = "title",
marginal = FALSE,
annotation.args = list(caption = "All movies have IMDB rating greater than 7")
)
grouped_ggwithinstats Violin plots for group or condition comparisons in within-subjects de-

signs repeated across all levels of a grouping variable.
Description
A combined plot of comparison plot created for levels of a grouping variable.
Usage
grouped_ggwithinstats(
data,
x,
y,
grouping.var,
output = "plot",
...
)
Arguments
"x").
argument.
60 grouped_ggwithinstats

annotation.args
... Arguments passed on to ggwithinstats
point.path Logical that decides whether individual data points and means, re-
spectively, should be connected using geom_path. Both default to TRUE.
Note that point.path argument is relevant only when there are two groups
(i.e., in case of a t-test). In case of large number of data points, it is advis-
able to set point.path = FALSE as these lines can overwhelm the plot.
centrality.path Logical that decides whether individual data points and means,
respectively, should be connected using geom_path. Both default to TRUE.
Note that point.path argument is relevant only when there are two groups
(i.e., in case of a t-test). In case of large number of data points, it is advis-
able to set point.path = FALSE as these lines can overwhelm the plot.
centrality.path.args A list of additional aesthetic arguments passed on to
geom_path connecting raw data points and mean points.
point.path.args A list of additional aesthetic arguments passed on to geom_path
connecting raw data points and mean points.
tions:
• "parametric"
• "nonparametric"
• "robust"
• "bayes"
pairwise.comparisons Logical that decides whether pairwise comparisons
are to be displayed (default: TRUE). Please note that only significant com-
parisons will be shown by default. To change this behavior, select appro-
priate option with pairwise.display argument. The pairwise comparison
dataframes are prepared using the pairwiseComparisons::pairwise_comparisons
function. For more details about pairwise comparisons, see the documenta-
tion for that function.
pairwise.display Decides which pairwise comparisons to display. Available
options are:
• "all"
You can use this argument to make sure that your plot is not uber-cluttered
when you have multiple groups being compared and scores of pairwise
comparisons being displayed.
can be "eta" (partial eta-squared) or "omega" (partial omega-squared).
in calculating Bayes factors.
grouped_ggwithinstats 61

(Default: TRUE).
returned.
and y will be used.
and y will be used.
= FALSE.
value might help.
argument.
argument.
centrality.point.args A list of additional aesthetic arguments to be passed
centrality.label.args A list of additional aesthetic arguments to be passed
outlier.tagging Decides whether outliers should be tagged (Default: FALSE).
outlier.coef Coefficient for outlier detection using Tukey’s method. With
Tukey’s method, outliers are below (1st Quartile) or above (3rd Quartile)
outlier.coef times the Inter-Quartile Range (IQR) (Default: 1.5).
62 grouped_ggwithinstats
outlier.label.args A list of additional aesthetic arguments to be passed to

ggrepel::geom_label_repel for outlier label plotting.
etc.).
References
https://indrajeetpatil.github.io/ggstatsplot/articles/web_only/ggwithinstats.html
See Also
ggwithinstats, ggbetweenstats, grouped_ggbetweenstats
Examples
set.seed(123)
# the most basic function call

grouped_ggwithinstats(
data = VR_dilemma,
x = modality,
y = score,
grouping.var = order,
type = "np", # non-parametric test
# additional modifications for **each** plot using `ggplot2` functions
ggplot.component = ggplot2::scale_y_continuous(
breaks = seq(0, 1, 0.1),
limits = c(0, 1)
)
)
iris_long 63
iris_long Edgar Anderson’s Iris Data in long format.
Description
Edgar Anderson’s Iris Data in long format.
Usage
iris_long
Format
• id. Dummy identity number for each flower (150 flowers in total).
• Species. The species are Iris setosa, versicolor, and virginica.
• condition. Factor giving a detailed description of the attribute (Four levels: "Petal.Length",
"Petal.Width", "Sepal.Length", "Sepal.Width").
• attribute. What attribute is being measured ("Sepal" or "Pepal").
• measure. What aspect of the attribute is being measured ("Length" or "Width").
• value. Value of the measurement.
Details
This famous (Fisher’s or Anderson’s) iris data set gives the measurements in centimeters of the
variables sepal length and width and petal length and width, respectively, for 50 flowers from each
of 3 species of iris. The species are Iris setosa, versicolor, and virginica.
This is a modified dataset from datasets package.
Examples
dim(iris_long)
head(iris_long)
dplyr::glimpse(iris_long)
movies_long Movie information and user ratings from IMDB.com (long format).
Description
Movie information and user ratings from IMDB.com (long format).
Usage
movies_long
64 movies_wide
Format
A data frame with 1,579 rows and 8 variables
• title. Title of the movie.

• year. Year of release.
• budget. Total budget (if known) in US dollars
• length. Length in minutes.
• rating. Average IMDB user rating.
• votes. Number of IMDB users who rated this movie.
• mpaa. MPAA rating.
• genre. Different genres of movies (action, animation, comedy, drama, documentary, romance,
short).
Details
Modified dataset from ggplot2movies package.

The internet movie database, https://imdb.com/, is a website devoted to collecting movie data
supplied by studios and fans. It claims to be the biggest movie database on the web and is run by
amazon.
Movies were are identical to those selected for inclusion in movies_wide but this dataset has been
constructed such that every movie appears in one and only one genre category.
Source
https://CRAN.R-project.org/package=ggplot2movies
Examples
dim(movies_long)
head(movies_long)
dplyr::glimpse(movies_long)
movies_wide Movie information and user ratings from IMDB.com (wide format).
Description
Movie information and user ratings from IMDB.com (wide format).
Usage
movies_wide
theme_ggstatsplot 65
Format
A data frame with 1,579 rows and 13 variables
• title. Title of the movie.
• year. Year of release.
• budget. Total budget in millions of US dollars
• length. Length in minutes.
• rating. Average IMDB user rating.
• votes. Number of IMDB users who rated this movie.
• mpaa. MPAA rating.
• action, animation, comedy, drama, documentary, romance, short. Binary variables represent-
ing if movie was classified as belonging to that genre.
• NumGenre. The number of different genres a film was classified in an integer between one
and four
Details
Modified dataset from ggplot2movies package.
The internet movie database, https://imdb.com/, is a website devoted to collecting movie data
supplied by studios and fans. It claims to be the biggest movie database on the web and is run by
amazon.
Movies were selected for inclusion if they had a known length and had been rated by at least one
imdb user. Small categories such as documentaries and NC-17 movies were removed.
Source
https://CRAN.R-project.org/package=ggplot2movies
Examples
dim(movies_wide)
head(movies_wide)
dplyr::glimpse(movies_wide)
theme_ggstatsplot Default theme used in all ggstatsplot package plots
Description
Maturing
Common theme used across all plots generated in ggstatsplot and assumed by the author to be
aesthetically pleasing to the user/reader.
Usage
theme_ggstatsplot(ggtheme = ggplot2::theme_bw(), ggstatsplot.layer = TRUE)
theme_corrmat()
theme_pie(ggtheme = ggplot2::theme_bw(), ggstatsplot.layer = TRUE)

66 Titanic_full
Arguments
etc.).
ggstatsplot.layer
Value
A ggplot2 object with the theme_ggstatsplot theme overlaid.
References
https://indrajeetpatil.github.io/ggstatsplot/articles/web_only/theme_ggstatsplot.
html
Titanic_full Titanic dataset.
Description
Titanic dataset.
Usage
Titanic_full
Format
• id. Dummy identity number for each person.
• Class. 1st, 2nd, 3rd, Crew.
• Sex. Male, Female.
• Age. Child, Adult.
• Survived. No, Yes.
Details
This data set provides information on the fate of passengers on the fatal maiden voyage of the ocean
liner ’Titanic’, summarized according to economic status (class), sex, age and survival.
This is a modified dataset from datasets package.
Examples
dim(Titanic_full)
head(Titanic_full)
dplyr::glimpse(Titanic_full)
VR_dilemma 67
VR_dilemma Virtual reality moral dilemmas.
Description
Virtual reality moral dilemmas.
Usage
VR_dilemma
Format
• id. Dummy identity number for each participant.
• order. The order in which the participants completed the two sessions: "text_first" (0) or
"text_second" (1).
• modality. Describes how the moral dilemmas were presented to the participants: either in text
format ("text") or in Virtual Reality ("vr").
• score. Proportion of "utilitarian" decisions. In other words, of the 4 decisions, how many
affirmative were responses. Range: 0 (all utilitarian) - 1 (none utilitarian).
Details
Dataset from a study where participants completed identical moral dilemmas in two different ses-
sions held on separate days: in one session, they read text description of the scenario, while
in another session they completed the same scenarios in Virtual Reality (videos: https://www.
youtube.com/watch?v=ebdU3HhhYs8). The study investigated if there was a discrepancy between
how people judged the same scenarios while reading them in text versus experiencing them in vir-
tual reality.
Source
https://psyarxiv.com/ry3ap/
Examples
dim(VR_dilemma)
head(VR_dilemma)
dplyr::glimpse(VR_dilemma)
Index
∗ datasets VR_dilemma, 67
bugs_long, 4
bugs_wide, 5
iris_long, 63
movies_long, 63
movies_wide, 64
Titanic_full, 66
VR_dilemma, 67
_PACKAGE (ggstatsplot-package), 3
bugs_long, 4
bugs_wide, 5
combine_plots, 6
ggbarstats, 7, 29, 39, 40, 55

ggbetweenstats, 10, 37, 41, 44, 62
ggcoefstats, 14
ggcorrmat, 18, 33, 45, 46, 58
ggdotplotstats, 21, 26, 48, 49, 52
gghistostats, 23, 24, 49, 51, 52
ggpiestats, 10, 27, 40, 53, 55
ggscatterstats, 20, 30, 46, 56, 58
ggstatsplot (ggstatsplot-package), 3
ggstatsplot-package, 3
ggwithinstats, 14, 33, 44, 60, 62
grouped_ggbarstats, 10, 29, 38, 55
grouped_ggbetweenstats, 14, 37, 40, 62
grouped_ggcorrmat, 20, 33, 44, 58
grouped_ggdotplotstats, 23, 26, 47, 52
grouped_gghistostats, 23, 26, 49, 50
grouped_ggpiestats, 10, 29, 40, 53
grouped_ggscatterstats, 20, 33, 46, 55
grouped_ggwithinstats, 14, 37, 44, 59
iris_long, 63
movies_long, 63
movies_wide, 64
theme(legend.position=...), 6
theme_corrmat (theme_ggstatsplot), 65
theme_ggstatsplot, 65
theme_pie (theme_ggstatsplot), 65
Titanic_full, 66
68

Ggstatsplot

Uploaded by

Copyright:

Available Formats

Ggstatsplot

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Ggstatsplot

Uploaded by

Copyright:

Available Formats

Package ‘ggstatsplot’

March 11, 2021

ggstatsplot-package ggstatsplot: ’ggplot2’ Based Plots with Statistical Details

• ggbetweenstats function to produce information-rich comparison plot between different

For more documentation, see the dedicated Website.

bugs_long Tidy version of the "Bugs" dataset.

• subject. Dummy identity number for each participant.

bugs_wide Wide-format version of the "Bugs" dataset.

Wide-format version of the "Bugs" dataset.

A data frame with 93 rows and 6 variables

• subject. Dummy identity number for each participant.

combine_plots Combining and arranging multiple plots in a grid

# preparing the first plot

data = subset(iris, iris$Species == "setosa"),

# preparing the second plot

# combining the plot with a title and a caption

ggbarstats Bar (column) charts with statistical tests

output Character that describes what is to be returned: can be "plot" (default) or

# association test (or contingency table analysis)

ggbetweenstats Box/Violin plots for group or condition comparisons in between-

To change this behavior, select appropriate option with pairwise.display ar-

• trimmed mean for robust statistics

output Character that describes what is to be returned: can be "plot" (default) or

# simple function call with the defaults

# more detailed function call

ggcoefstats Dot-and-whisker plots for regression analyses

# to get a tidy dataframe

# to get a glance summary

ggcorrmat Visualization of a correlation matrix

correlation coefficients, statistic values, p-values, no. of observations, etc.) will

grouped_ggcorrmat ggscatterstats grouped_ggscatterstats

# if `cor.vars` not specified, all numeric variables used

# to get the correlation matrix

# to get the correlation analyses results in a dataframe

ggdotplotstats Dot plot/chart for labeled numeric data.

• robust (for trimmed mean)

gghistostats Histogram for distribution of a numeric variable

• MAP estimator for Bayesian statistics

# using defaults, but modifying which centrality parameter is to be shown

ggpiestats Pie charts with statistical tests

# one sample goodness of fit proportion test

# association test (or contingency table analysis)

ggscatterstats Scatterplot with marginal distributions and statistical results

marginal Decides whether ggExtra::ggMarginal() plots will be displayed; the default

# creating dataframe with rownames converted to a new column

# simple function call with the defaults

ggwithinstats Box/Violin plots for group or condition comparisons in within-subjects

# two groups (*t*-test)

# more than two groups (anova)

grouped_ggbarstats Grouped bar (column) charts with statistical tests

# let's create a smaller dataframe

pairwise.comparisons Logical that decides whether pairwise comparisons

centrality.type Decides which centrality parameter is to be displayed. The

conf.level Scalar between 0 and 1. If unspecified, the defaults return 95%

# the most basic function call

# modifying individual plots using `ggplot.component` argument

grouped_ggcorrmat Visualization of a correlalogram (or correlation matrix) for all levels

ggcorrplot.args A list of additional (mostly aesthetic) arguments that will be

centrality.plotting Logical that decides whether centrality tendency mea-

# two groups (t-test)