Meta-Analyses of Experimental Data in Animal Nutrition : D. Sauvant, P. Schmidely, J. J. Daudin and N. R. St-Pierre
Meta-Analyses of Experimental Data in Animal Nutrition : D. Sauvant, P. Schmidely, J. J. Daudin and N. R. St-Pierre
Research in animal sciences, especially nutrition, increasingly requires processing and modeling of databases. In certain areas
of research, the number of publications and results per publications is increasing, thus periodically requiring quantitative
summarizations of literature data. In such instances, statistical methods dealing with the analysis of summary (literature) data,
known as meta-analyses, must be used. The implementation of a meta-analysis is done in several phases. The rst phase
concerns the denition of the study objectives and the identication of the criteria to be used in the selection of prior
publications to be used in the construction of the database. Publications must be scrupulously evaluated before being entered
into the database. During this phase, it is important to carefully encode each record with pertinent descriptive attributes
(experiments, treatments, etc.) to serve as important reference points for the rest of the analysis. Databases from literature
data are inherently unbalanced statistically, leading to considerable analytical and interpretation difculties; missing data are
frequent, and data structures are not the outcomes of a classical experimental system. An initial graphical examination of the
data is recommended to enhance a global view as well as to identify specic relationships to be investigated. This phase is
followed by a study of the meta-system made up of the database to be interpreted. These steps condition the denition of the
applied statistical model. Variance decomposition must account for inter- and intrastudy sources; dependent and independent
variables must be identied either as discrete (qualitative) or continuous (quantitative). Effects must be dened as either xed
or random. Often, observations must be weighed to account for differences in the precision of the reported means. Once model
parameters are estimated, extensive analyses of residual variations must be performed. The roles of the different treatments
and studies in the results obtained must be identied. Often, this requires returning to an earlier step in the process. Thus,
meta-analyses have inherent heuristic qualities.
Keywords: meta-analysis, mixed models, nutrition
Introduction
The research environment in the animal sciences, especially
nutrition, has markedly evolved in the recent past. In
particular, there is a noticeable increase in the number
of publications, each containing an increasing number of
quantitative measurements. Meanwhile, treatments often
have smaller effects on the systems being studied than in
the past. Additionally, controlled and non-controlled factors,
such as the basal plane of nutrition, vary from study
to study, thus requiring at some point a quantitative summarization of past research.
*Salaries and research support were provided by state and federal funds
appropriated to the Ohio Agricultural Research and Development Center, the
Ohio State University.
E-mail: sauvant@agroparistech.fr
Fundamental research in the basic animal science disciplines generates results that increasingly are at a much
lower level of aggregation than those of applied research
(organs, whole animals), thus supporting the necessity of
integrative research. Research stakeholders, those who ultimately use the research outcome, increasingly want more
quantitative knowledge, particularly on animal response
to diet, and of better precision. Forecasting and decisionsupport software require quantitative information. Additionally, research prioritization by funding sources may force
abandoning active research activities in certain elds. In such
instances, meta-analyses can still support discovery activities
based on the published literature.
The objectives of this paper are to describe the application
of meta-analytic methods to animal nutrition studies, including the development and validation of literature-derived
1203
>
1204
>
>
Meta-analyses in nutrition
be considered as either random or xed. If a dataset
comprised many individual studies from multiple research
centers, the study effect should be considered random
because each study is conceptually a random outcome from
a large population of studies to which inference is to be
made (St-Pierre, 2001). This is especially important if the
meta-analysis has for objective the empirical modeling of
biological responses, or the collective summarizations of
measurements that only had a secondary or minor role in
prior experiments, because it is likely that the researcher in
those instances has a targeted range of inference much
larger than the limited conditions represented by the specic studies. There are instances, however, where each
experiment can be considered as an outcome each from a
different population. In such instances, the levels of study
or trial are, in essence, considered arbitrarily chosen by
the research community, and the study effect must then be
considered xed. In such an instance, the range of inference
for the meta-analysis is limited to the domain of the specic
experiments in the dataset. This is of little concern if the
objective of the meta-analysis is that of global hypothesis
testing, but it does severely limit the applicability of its
results for other objectives.
x
x
x
x
In general, the variance between studies is large compared to the variance within studies, hence underlying the
importance of including the study effect into the metaanalytical model. The study effect represents the sum of the
effects of many factors that differ between studies, but
factors that are not in the model because they either were
not measured, or have been excluded from the model, or
for which the functional form in the model is inadequately
representing the true but unknown functional form (e.g. the
model assumes a linear relationship between the dependent and one independent continuous variable whereas the
true relationship is nonlinear). In the absence of interactions
between design variables (e.g. studies) and the covariates
(e.g. all model variables of interest), parameter estimates
for the covariates are unbiased, but the study effect can add
a large uncertainty to future predictions (St-Pierre, 2001).
The presence of signicant interactions between studies
and at least one covariate is more problematic since this
indicates that the effect of the covariate is dependent on
the study, implying that the effect of a factor is dependent
on the levels of unidentied factors.
OBJECTIVES
CONCEPTUAL
Basis
oo
SELECTION
OF EXPERIMENTS
& DATA ENTRY
GRAPHICAL
ANALYSIS
Literature search,
Experiments
Survey,
DETERMINATION OF
META-DESIGN
SELECTION OF
STATISTICAL
MODEL
Adjustments
USERS
POST-ANALYSIS
EVALUATION
Data ltering
There are at least three steps necessary to effective
data ltering. First, the analyst must ensure that the study
under consideration is coherent with the objectives of the
meta-analysis. That is, the meta-analytic objectives dictate
that some traits must be measured and reported. If, for
example, the objective of meta-analysis is to quantify the
relationship between dietary neutral detergent ber (NDF)
concentration and DMI, then one must ensure that both
NDF concentration and DMI were measured and reported
in all studies. The second step consists of a thorough and
critical review of each publication under consideration,
focusing on the detection of errors in the reporting of
quantitative results. This underlines the importance of
having a highly trained professional involved in this phase
of the study. Only after publications have passed this
expert quality lter should their results be entered in the
database. Verication of data entries is then another
essential component to the process. In this third step, it is
important to ensure that a selected publication does not
appear to be an outlier with respect to the characteristics
and relations under consideration.
1206
Meta-analyses in nutrition
1100
Chewing (Min/d)
1000
900
800
700
600
500
400
300
20
30
40
50
60
70
NDF (% of DM)
60
n = 517
m = 38.4
s = 13
50
Frequency
40
30
20
10
0
10
20
30
40
50
60
70
80
NDF (% of DM)
>
>
>
10
0
15
25
35
45
55
Mean NDF (%DM)
65
75
hi 1=n Xi Xm 2 =SXi Xm 2 ;
>
1208
>
Meta-analyses in nutrition
some sort of weighing scheme. Systems used for weighing
observations form two broad categories.
RMSE 1:02:
RMSE 1:56:
Statistical models
The independent variable can be either discrete or continuous. With binary data (healthy/sick, for example), generalized linear models (GLM) based on the logit or probit
link functions are generally recommended (Agresti, 2002).
Because of advances in computational power, the GLM has
been extended to include random effects in what is called
the generalized linear mixed model (GLMM). In its version
9, the SAS system includes a beta release of the GLIMMX
procedure to t these complicated models. In nutrition,
however, the large majority of the dependent variables
subjected to meta-analyses are continuous, and their analyses are treated at length in the remainder of this paper.
St-Pierre (2001) made a compelling argument to include
the study effect in all meta-analytic models. Because of the
severe imbalance in most databases used for meta-analyses, the exclusion of the study effect in the model leads to
biased parameter estimates of the effects of other factors
under investigation, and severe biases in variance estimates. In general, the study effect should be considered
random because it represents, in essence, the sum of the
effects of a great many factors, all with relatively small
effects on the dependent variable. Statistical theory indicates that these effects would be close to Gaussian
(normal), thus much better estimated if treated as random
effects. Practical recommendations regarding the selection
of the type of effect for the studies are presented in Table 2.
In short, the choice depends on the size of the conceptual
population, and the sample size (the number of studies in
the meta-analysis).
The ultimate (and correct) meta-analysis would be one
where all the primary (raw) data used to perform the
analyses in each of the selected publications were available
to the analyst. In such an instance, a large segmented
model that includes all the design effects of the original
studies (e.g. the columns and rows effects in Latin squares)
plus the effects to be investigated by the meta-analysis
could be tted by least-squares or maximum likelihood
methods. Although computationally complex, such huge
meta-analytic models should be no more difcult to solve
than the large models used by geneticists to estimate
the breeding values of animals using very large national
Table 2 Guidelines to establish whether an effect should be considered xed or random in a meta-analytic model*
Population
Case
Case
Case
Case
1
2
3
4
T is
T is
T is
T is
smalllarge
large
large
Experiment
tT
t5T
tT
t 5 T, and t is very small
Fixed effect
Random effect
Should be xed but random works better
Should be random but xed may work better
(i.e. variance components can be poorly estimated)
1210
Meta-analyses in nutrition
research. The following SAS statements can be used to
solve this model:
PROC MIXED DATA Mydata CL COVTEST;
CLASSES study tau;
MODEL Y tau;
RUN;
where Yij 5 the dependent variable, B0 5 overall (interstudy) intercept (a xed effect equivalent to m in (4)),
Si 5 the random effect of the i th study, assumed , iidN
(0, s2S ), B1 5 the overall regression coefcient of Y on X
(a xed effect), Xij 5 the value of the continuous predictor
variable, bi 5 the random effect of study on the regression
coefcient of Y on X, assumed , iidN (0, s2b ), and eij 5 the
residual errors, assumed , iidN (0, s2e ). Also, eij, bi and Si
are assumed to be independent random variables.
The following SAS statements can be used to solve this
model:
PROC MIXED DATA Mydata CL COVTEST;
CLASSES study;
MODEL Y X=SOLUTION;
RANDOM study study X;
RUN;
>
>
1212
Meta-analyses in nutrition
1000
10
800
Y = 55.9 + 29.4 X - 0.242 X2
700
Frequency
Chewing (min/d)
900
?
5
600
500
0
20
30
40
50
NDF (% DM)
60
70
References
Agresti A 2002. Categorical data analysis, 2nd edition. John Wiley & Sons,
Hoboken, New Jersey.
Bravo D, Sauvant D, Bogaert C and Meschy F III 2003. Quantitative aspects of
phosphorus excretion in ruminants. Reproduction, Nutrition, Development 43,
285300.
Cochran WG 1968. The effectiveness of adjustment by subclassication in
removing bias in observational studies. Biometrics 24, 295313.
De Groot MH 1970. Optimal statistical decision. Mc Graw-Hill, New York.
Draper NR and Smith H 1998. Applied regression analysis. John Wiley & Sons,
Inc., New York.
Euge`ne M, Archime`de H and Sauvant D 2004. Quantitative meta-analysis on
the effects of defaunation of the rumen on growth, intake and digestion in
ruminants. Livestock Production Science 85, 8191.
Firkins JL, Eastridge ML, St-Pierre N and Noftsger SM 2001. Invited: Effects of
grain variability and processing on starch utilization by lactating dairy cattle.
Journal of Animal Science 79(E. Suppl.), E218E238.
Glass GV 1976. Primary, secondary and meta-analysis of research. Education
Research 5, 38.
Grosman M and Koops WJ 1988. Multiphasic analysis of lactation curves in
dairy cattle. Journal of Dairy Science 71, 15981608.
Lovatto PA and Sauvant D 2002. Meta-analyse et modelisation de lingestion
volontaire chez le porc. Journees de la Recherche Porcine, Paris (FRA) 2002/02/
0507, 129134.
Mantel N and Haenszel W 1959. Statistical aspects of the analysis of data from
retrospective studies of disease. Journal National of Cancer Institute 22,
719748.
1214