Saldanha Thompson, 2002
Saldanha Thompson, 2002
Saldanha Thompson, 2002
1. BACKGROUND
Figure 1. Part of an instructional activity designed to help students make sense of com-
puter simulations of drawing many random samples from a population. Simulation input
(left) and output (right) windows were displayed in the classroom and the instructor posed
questions designed to orchestrate reflective discussions about the simulations.
who performed better on the instructional activities and those who were
able to hold coherent discourse about the mathematical ideas highlighted
in instruction – had developed a multi-tiered scheme of conceptual oper-
ations centered around the images of repeatedly sampling from a popu-
lation, recording a statistic, and tracking the accumulation of statistics as
they distribute themselves along a range of possibilities. These images and
operations were tightly aligned with those promoted in classroom instruc-
tional tasks and discussions. As such, we conjecture that these students’
engagement in the instructional activities played an important role in their
developing such a scheme. For instance, we had students practice ima-
gining and describing a coordinated multi-level process that gives rise to
sampling distributions (and to the simulations’ results):
Level 1: Randomly select items to accumulate a sample of a given size
from a population. Record a sample statistic of interest.
Level 2: Repeat Level 1 process a large number of times and accumulate
a collection of statistics.
Level 3: Partition the collection in Level 2 to determine what proportion
of statistics lie beyond (below) a given threshold value.
In classroom discussions the instructor employed a metaphor designed
to help students distinguish and coordinate these different levels. The meta-
phor entails imagining a collected sample of dichotomous opinions (“yes”
or “no”) in Level 1 as a box containing ‘1’s (for “yes”) and ‘0’s (for “no”).
It then entails labeling each box with a ‘1’ (or a “0”) if the proportion
of its contents is greater (or less) than a given threshold value. In this
metaphor, what accumulates in Level 2 is a collection of ‘1’s and ‘0’s,
each of which represents a sample whose statistic is greater (less) than the
threshold value. At Level 3, the metaphor entails calculating the percent of
the Level 2 collection that are ‘1’ or that are ‘0’, depending on the required
comparison.
The following excerpt illustrates one student’s coherent image of the
multi-tiered sampling process, the development of which appeared to have
been facilitated by his use of this metaphor. We take this student’s coher-
ent image as an expression of the stable scheme of conceptual operations
characterized above. In the excerpt, the student (D) interpreted a sampling
simulation’s command and the result of running it as he viewed familiar
simulation windows on a computer screen (see Figure 1)2 :
262 LUIS SALDANHA AND PATRICK THOMPSON
D: Ok. It’s asking. . .the question is. . .like “do you like Garth Brooks?”.
You’re gonna go out and ask 30 people, it’s gonna ask 30 people 4500
times if they like Garth Brooks. The uh. . .(talks to himself) what’s
this? let’s see. . .the actual. . .like the amount of people who actually
like Garth Brooks are. . .or 3 out of 10 people actually prefer like
Garth Brooks’ music. And uh. . .for the 30. . .when you go out and
take one sample of 30 people, the cut off fraction means that if you’re
gonna count, you’re gonna count that sample, if like 37% of the 30
people preferred Garth Brooks. And then it’s going to tally up how
many of the samples had 37% people that preferred Garth Brooks.
So like the answer would be I don’t know, like whatever, 2000 out
of 4500 samples had at least 37% of people preferring Garth Brooks.
[. . .]
I: How was it that you thought about it that allowed you to keep things
straight? [. . .]
D: I just thought of it like . . . I don’t know, I sort of thought of it like
how you were saying. Like. . .if the like 1s and the 0s if you ask 30
uh if like 10 of them say they like Garth Brooks – or for every person
who likes Garth Brooks you put a 1 down, if they don’t you put a
zero. You do that 30 times and you’re gonna get like I don’t know,
15 ones and 15 zeros you add up, you add them up. Then it says the
cutoff fraction for each sample is 37% so you have like at least 37%
of the. . .like those or. . .30 – if you add it up and divided it by the
30 and it’s at least 37% then you have like another pile of like little
papers and you put a one on like the big, the big one for the sample
or a zero if it’s less than – if the whole sample is less than 37%. The
1s and 0s I don’t know. . .you said something about like. . .that sort of
helped.
A significant feature of student D’s thinking was his ability to clearly dis-
tinguish different levels of the resampling processes – never confounding
the number of people in a sample with the number of samples taken – while
coordinating the various levels into a structured whole. Additionally, and
relatedly, student D interpreted the result of the simulation as an amount
(percentage) of sample proportions, thus suggesting that he understood
that the multi-level process generated a collection of sample proportions.3
Student D’s coherent image contrasts sharply with that of many poorer-
performing students who persistently confounded numbers of people in a
sample with numbers of samples drawn. The following interview excerpt
illustrates one such student’s (M) difficulties in the context of explaining
similar computer simulations:
CONCEPTIONS OF SAMPLE 263
Segment 1
I: Ok, Suppose that, here’s what I’m gonna do, uhh instead of 4500
samples I’m gonna take uhh, 1000 samples. Everything’s gonna stay
the same – sample size is 30, population fraction is 3/10ths, but now
were’ just taking 1000 samples. What would you expect the results to
be?
[. . .]
M: Uhh, somewhere around like (short silence), hmm around like 25 to
30% of those 1000 samples.
I: Why 25 to 30%?
M: Because it’s uhh . . . easier to uhh, I mean
I: What are you basing that judgment on?
M: Uhh, the actual population percentage, of 30
I: Ok, so you figure it’ll be about 30%, 25 to 30, because the population
fraction is 30%?
M: Yeah, somewhere close to that.
[. . .]
Segment 2
I: Alright (runs simulation, result displayed on output screen is “189 of
these 1000 repetitions . . .”)
M: 2/10ths, 20%. Hmm, it’s still a little less
I: So it’s a little less than 20%, right?
M: Hmm hmm, huh (seems surprised)
[. . .]
Segment 3
I: Alright. Suppose that now we, let’s do this, let’s make 2500 samples
(changes parameter value in command window). What fraction of
those samples, I mean what result would you now expect, for the
number of samples that we’re going to get that exceed 37% preferring
Garth Brooks?
M: About 1/5 of those.
[. . .]
I: Now, before you would have said “well, 3/10ths of the 2500 samples,
the 2500 repetitions”
M: Hmm hmm
I: Do you still sort of lean that way, that you should get around 3/10ths
of the –?
M: I think it should, but I don’t understand why it’s not, why it keeps
coming out with 1/5th rather than 1/3rd .
264 LUIS SALDANHA AND PATRICK THOMPSON
of a statistic were based largely on how they thought the value compared
to the underlying population parameter per se, instead of on how it might
compare to a clustering of the statistic’s values.
On the basis of such characteristics, we conjecture that these students’
encompassing image of sample was additive – that is, in these instructional
settings they tended to view a sample simply as a subset of a population
and to view multiple samples as multiple subsets.
A contrasting image of sample is suggested in the following excerpt of
student D explaining the purpose of simulating resampling:
D: If like. . .if you represent – if you give it like the split of the population
and then you run it through the how – number of samples or whatever
it’ll give you the same results as if – because in real life the population
like of America actually has a split on whatever, on Pepsi, so it’ll give
you the same results as if you actually went out, did a survey with
people of that split.
I: Ok, now. What do you mean by “same results”? On any particular
survey at all – you’ll get exactly what it –?
D: No, no. Each sample won’t be the same but it’s a. . .it’d be. . .could be
close, closer. . .
266 LUIS SALDANHA AND PATRICK THOMPSON
4. C ONCLUSION
ACKNOWLEDGEMENT
N OTES
1. Similar samples share a common size, selection method, and parent population. Fur-
thermore, they are selected to obtain information about a common population charac-
teristic.
2. The simulation was of sampling people’s preference for a particular musician from a
hypothetical population having a known proportion of it preferring the musician.
3. We note that student D’s prediction of the simulation result was highly inaccurate
in this excerpt. Shortly thereafter, however, he quickly revised his prediction with a
highly accurate one and continued to make such accurate predictions throughout the
rest of the interview. We thus believe that his initial prediction was not an indication
of a poor sense of how the sample proportions were distributed, rather it was merely
the result of his focus, in the moment, on explaining how the simulation worked and
what it generated.
R EFERENCES
delMas, R.C., Garfield, J. and Chance, B.L.: 1999, Exploring the Role of Computer Simu-
lations in Developing Understanding of Sampling Distributions, Paper presented at the
American Educational Research Association, Montreal.
Glasersfeld, E. v.: 1995, Radical Constructivism: A Way of Knowing and Learning, Falmer
Press, London.
Harel, G. and Confrey, J. (eds.): 1994, The Development of Multiplicative Reasoning in the
Learning of Mathematics, SUNY Press, Albany, NY.
Inhelder, B. and Piaget, J.: 1964, The Early Growth of Logic in the Child: Classification
and Seriation, W.W. Norton and Company Inc., New York.
Kahneman, D. and Tversky, A.: 1982, ‘Variants of uncertainty,’ in D. Kahneman, P. Slovic
and A. Tversky (eds.), Judgment under Uncertainty: Heuristics and Biases, Cambridge
University press, New York, pp. 509–521.
Kahneman, D. and Tversky, A.: 1972, ‘Subjective probability: A judgement of represent-
ativeness,’ Cognitive Psychology 3, 430–454.
Konold, C.: 1989, ‘Informal conceptions of probability,’ Cognition and Instruction 6(1),
59–98.
Rubin, A., Bruce, B. and Tenney, Y.: 1991, Learning about Sampling: Trouble at the Core
of Statistics, In D. Vere-Jones (ed.), Proceedings of the Third International Conference
on Teaching Statistics, ISI Publications in Statistical Education, Dunedin, New Zealand,
Vol. 1, pp. 314–319.
270 LUIS SALDANHA AND PATRICK THOMPSON
Saldanha, L.A. and Thompson, P.W.: 2001, ‘Students’ reasoning about sampling distri-
butions and statistical inference,’ in R. Speiser and C. Maher (eds.), Proceedings of
the Twenty Third Annual Meeting of the North American Chapter of the International
Group for the Psychology of Mathematics Education, ERIC Clearinghouse for Science,
Mathematics, and Environmental Education, Columbus, OH, Snowbird, Utah, Vol. 1, pp.
449–454.
Schwartz, D.L., Goldman, S.R., Vye, N.J. and Barron, B.J.: 1998, ‘Aligning everyday
and mathematical reasoning: The case of sampling assumptions,’ in S.P. Lajoie (ed.),
Reflections on Statistics: Learning, Teaching, and Assessment in Grades K-12, Lawrence
Erlbaum Associates, Mahwah, NJ, pp. 233–273.
Sedlmeier, P.: 1999, Improving Statistical Reasoning: Theoretical Models and Practical
Implications, Lawrence Erlbaum, Mahwah, NJ.
Sedlmeier, P. and Gigerenzer, G.: 1997, ‘Intuitions about sample size: The empirical law
of large numbers,’ Journal of Behavioral Decision Making 10, 33–51.
Shaugnessy, J.M., Watson, J., Moritz, J. and Reading, C.: 1999, School Students’ Acknow-
ledgment of Statistical Variation, Paper presented at the Research Presession Symposium
of the 77th Annual NCTM Conference, San Francisco, CA.
Simon, M.A.: 1995, ‘Reconstructing mathematics pedagogy from a constructivist perspect-
ive,’ Journal for Research in Mathematics Education 26(2), 114–145.
Steffe, L.P. and Thompson, P.W.: 2000, ‘Teaching experiment methodology: Underlying
principles and essential elements,’ in A.E. Kelly and R.A. Lesh (eds.), Handbook of
Research Design in Mathematics and Science Education, Lawrence Erlbaum Associates,
inc., Mahwah, NJ, pp. 267–306.
Thompson, P.W. and Saldanha, L.A.: in press, ‘Fractions and multiplicative reasoning,’ in
J. Kilpatrick and G. Martin (eds.), Research Companion to the Principles and Standards
for School Mathematics, NCTM, Reston, VA, pp. 95–114.
Thompson, P.W. and Saldanha, L.A.: 2000, ‘Epistemological analyses of mathemat-
ical ideas: A research methodology,’ in M.L. Fernandez (ed.), Proceedings of the
Twenty Second Annual Meeting of the North American Chapter of the International
Group for the Psychology of Mathematics Education, ERIC Clearinghouse for Science,
Mathematics, and Environmental Education, Columbus, OH, Tucson, AZ, Vol. 2, pp.
403–408.
Watson, J.M. and Moritz, J.B.: 2000, ‘Developing concepts of sampling,’ Journal for
Research in Mathematics Education 31(1), 44–70.
Vanderbilt University,
Dept. of Teaching and Learning,
GPC Box 330, 240 Wyatt Center,
Nashville, TN 37203,
Telephone (615) 322-8100, Fax (615) 322-8999,
E-mail: luis.a.saldanha@vanderbilt.edu