Mathematics Statistics
Mathematics Statistics
Key Concepts
1. Statistics:
• The study of data collection, analysis, interpretation, presentation, and organization.
• It helps make decisions based on data and predict trends.
• It involves summarizing large amounts of data in a way that is easy to understand
and analyze.
2. Types of Data:
• Qualitative (Categorical) Data: Data that describes qualities or characteristics (e.g.,
colors, gender, names).
• Quantitative (Numerical) Data: Data that represents numbers and can be measured
(e.g., height, weight, age).
3. Data Collection Methods:
• Primary Data: Data collected firsthand through surveys, experiments, or
observations.
• Secondary Data: Data collected by someone else and used by the researcher for
analysis (e.g., census data, research papers).
4. Sampling:
• Population: The entire group being studied.
• Sample: A subset of the population, often used when studying the whole population
is not feasible.
• Sampling Methods:
• Random Sampling: Every member of the population has an equal chance of
being selected.
• Systematic Sampling: Select every n-th member from a list.
• Stratified Sampling: Divide the population into subgroups (strata) and
sample from each subgroup.
• Convenience Sampling: Use data that is easiest to collect.
3. Probability
Probability is the measure of the likelihood that a certain event will occur.
• Basic Probability:
• The probability of an event A occurring is given by:
of favorable outcomes number of outcomesP(A)=Total number of outcomesNumber of favo
rable outcomes
• Complementary Events:
• The probability of the complement of event A, denoted A′, is:
P(A′)=1−P(A)
• Addition Rule (for two events):
• If events A and B are mutually exclusive (they cannot happen at the same time), then:
P(A∪B)=P(A)+P(B)
• If the events are not mutually exclusive (they can happen at the same time), then:
P(A∪B)=P(A)+P(B)−P(A∩B)
• Multiplication Rule (for independent events):
• If events A and B are independent, then:
P(A∩B)=P(A)×P(B)
• Conditional Probability:
• The probability of event A given that event B has occurred is:
P(A∣B)=P(B)P(A∩B)
4. Data Representation
Various methods exist to represent data visually.
• Bar Graphs:
• Used for categorical data, where the height of each bar represents the frequency or
count of each category.
• Histograms:
• Used for continuous numerical data. The data is divided into bins, and the height of
each bar represents the frequency of data points in that bin.
• Pie Charts:
• Used for categorical data. The circle is divided into sections to represent different
categories or proportions.
• Line Graphs:
• Used to display data points in a continuous time series (e.g., stock prices over time).
• Box-and-Whisker Plots:
• Used to represent the distribution of data based on five-number summary (minimum,
first quartile, median, third quartile, and maximum).
• Scatter Plots:
• Used to show the relationship between two variables by plotting data points on a
Cartesian plane.
5. Measures of Position
• Quartiles:
• Divide a data set into four equal parts.
• First Quartile (Q1): The median of the lower half of the data (25th percentile).
• Second Quartile (Q2): The median of the entire data set (50th percentile).
• Third Quartile (Q3): The median of the upper half of the data (75th percentile).
• Interquartile Range (IQR):
• The difference between the third and first quartile. It represents the spread of the
middle 50% of the data.
IQR=Q3−Q1
• Z-scores:
• The Z-score indicates how many standard deviations a data point is from the mean.
Z=σx−μ
Where x is a data point, μ is the mean, and σ is the standard deviation.