Blog Posts
Statistics Formulas Cheat Sheet for Students

Statistics Formulas Cheat Sheet for Students: Every Formula You Need

You open your statistics textbook to review for the exam. Chapter 3 has one set of formulas. Chapter 7 introduces completely different notation. Chapter 10 adds Greek letters that look identical but mean different things. The formulas are scattered across 400 pages with no single reference that pulls them together. You flip back and forth, losing time and confidence.

This is not a study problem. It is a format problem. Statistics textbooks teach concepts sequentially but never consolidate the formulas into one place. The PDFs floating around online are either too dense (Stanford's 12-page reference) or too sparse (a single page missing half the formulas you need). Most are not mobile-friendly, and none explain when to use each formula.

This cheat sheet solves that. Every core formula from introductory statistics is organized by topic, presented in clean tables, and paired with a plain-English explanation of what it calculates and when you need it. Bookmark this page and use it alongside your textbook, homework, and exam prep.

Descriptive Statistics

Descriptive statistics summarize a dataset. These formulas tell you the center, spread, and shape of your data.

Measures of Central Tendency

FormulaNameWhat It CalculatesWhen to Use
x̄ = Σxᵢ / nSample MeanAverage value of the datasetDefault measure of center for symmetric data
Median = middle value when sortedMedianThe value that splits the dataset in halfUse when data is skewed or has outliers
Mode = most frequent valueModeThe value that appears most oftenUse for categorical data or to find peaks

Measures of Spread

FormulaNameWhat It CalculatesWhen to Use
Range = max − minRangeDistance between the largest and smallest valuesQuick but rough measure of spread
s² = Σ(xᵢ − x̄)² / (n − 1)Sample VarianceAverage squared deviation from the meanWhen you need spread in squared units
s = √[Σ(xᵢ − x̄)² / (n − 1)]Sample Standard DeviationAverage distance from the meanMost common measure of spread
σ² = Σ(xᵢ − μ)² / NPopulation VarianceVariance using the entire populationOnly when you have the full population
σ = √[Σ(xᵢ − μ)² / N]Population Standard DeviationStandard deviation for the full populationRare — usually use sample version
IQR = Q3 − Q1Interquartile RangeSpread of the middle 50% of dataUse with median for skewed data

Position and Shape

FormulaNameWhat It CalculatesWhen to Use
z = (x − x̄) / sZ-Score (sample)How many standard deviations x is from the meanComparing values across different datasets
z = (x − μ) / σZ-Score (population)Standard score using population parametersWhen population mean and SD are known
Percentile rank = (values below x / n) × 100PercentilePercentage of data below a given valueStandardized test scores, rankings
CV = (s / x̄) × 100%Coefficient of VariationRelative variability as a percentageComparing spread across different scales

Probability

Probability formulas calculate the likelihood of events.

Basic Probability Rules

FormulaNameWhat It CalculatesWhen to Use
P(A) = favorable outcomes / total outcomesClassical ProbabilityProbability of event AEqual-likelihood outcomes (coins, dice, cards)
P(A') = 1 − P(A)Complement RuleProbability that A does not happenWhen "not A" is easier to calculate
P(A ∪ B) = P(A) + P(B) − P(A ∩ B)Addition Rule (General)Probability of A or B or bothTwo events that can overlap
P(A ∪ B) = P(A) + P(B)Addition Rule (Mutually Exclusive)Probability of A or BEvents that cannot happen together
P(A ∩ B) = P(A) × P(B)Multiplication Rule (Independent)Probability of A and B bothEvents that do not affect each other
P(A ∩ B) = P(A) × P(B|A)Multiplication Rule (Dependent)Probability of A and B bothEvents where A affects B
P(A|B) = P(A ∩ B) / P(B)Conditional ProbabilityProbability of A given B occurredWhen one event is already known

Bayes' Theorem

FormulaNameWhen to Use
P(A|B) = [P(B|A) × P(A)] / P(B)Bayes' TheoremUpdating probability with new evidence; medical testing, spam filters

Counting Methods

FormulaNameWhat It CalculatesWhen to Use
n! = n × (n−1) × (n−2) × ... × 1FactorialTotal arrangements of n itemsPermutations and combinations
P(n,r) = n! / (n−r)!PermutationArrangements where order mattersRanked lists, sequences
C(n,r) = n! / [r!(n−r)!]CombinationSelections where order does not matterCommittees, card hands, groups

Probability Distributions

Binomial Distribution

Use when counting the number of successes in a fixed number of independent trials with the same probability of success.

FormulaWhat It Calculates
P(X = k) = C(n,k) × p^k × (1−p)^(n−k)Probability of exactly k successes in n trials
μ = npMean of the binomial distribution
σ = √[np(1−p)]Standard deviation of the binomial distribution

Requirements: Fixed number of trials (n), two outcomes (success/failure), constant probability (p), independent trials.

Normal Distribution

The bell-shaped curve that describes many natural phenomena. Most statistical inference is based on the normal distribution.

FormulaWhat It Calculates
z = (x − μ) / σConvert any normal value to a standard normal score
x = μ + zσConvert a z-score back to the original scale

Key properties: Mean = median = mode. About 68% of data falls within 1 SD, 95% within 2 SD, 99.7% within 3 SD (the 68-95-99.7 rule).

Poisson Distribution

Use when counting the number of events in a fixed interval of time or space when events occur independently at a constant average rate.

FormulaWhat It Calculates
P(X = k) = (λ^k × e^(−λ)) / k!Probability of exactly k events when average rate is λ
μ = λMean equals the rate parameter
σ = √λStandard deviation of the Poisson distribution

Sampling Distributions

These formulas describe the behavior of sample statistics when you take repeated samples from a population.

FormulaNameWhat It CalculatesWhen to Use
μ_x̄ = μMean of Sampling DistributionExpected value of the sample meanCenter of all possible sample means
σ_x̄ = σ / √nStandard Error of the MeanSpread of the sampling distributionMeasures precision of the sample mean
σ_p̂ = √[p(1−p) / n]Standard Error of a ProportionSpread of sample proportionsPolls, surveys, proportion estimates

Central Limit Theorem: For large enough n (typically n ≥ 30), the sampling distribution of x̄ is approximately normal regardless of the population shape.

Confidence Intervals

Confidence intervals estimate a population parameter within a range.

FormulaNameWhen to Use
x̄ ± z*(σ / √n)CI for Mean (σ known)Large sample, population SD known
x̄ ± t*(s / √n)CI for Mean (σ unknown)Small sample or population SD unknown
p̂ ± z*√[p̂(1−p̂) / n]CI for ProportionEstimating a population proportion
(n−1)s² / χ²_upper, (n−1)s² / χ²_lowerCI for VarianceEstimating population variance

Common Z-Values for Confidence Levels

Confidence Levelz* Value
90%1.645
95%1.960
99%2.576

Sample Size Formulas

FormulaWhat It DeterminesWhen to Use
n = (z*σ / E)²Sample size for meanPlanning a study to estimate a mean with margin of error E
n = p̂(1−p̂)(z* / E)²Sample size for proportionPlanning a survey with margin of error E

Hypothesis Testing

Hypothesis testing determines whether sample data provides enough evidence to reject a claim about a population.

Test Statistics

FormulaNameWhen to Use
z = (x̄ − μ₀) / (σ / √n)Z-Test for Meanσ known, large sample
t = (x̄ − μ₀) / (s / √n)T-Test for Meanσ unknown (most common case)
z = (p̂ − p₀) / √[p₀(1−p₀) / n]Z-Test for ProportionTesting a population proportion
t = (x̄₁ − x̄₂) / √(s₁²/n₁ + s₂²/n₂)Two-Sample T-TestComparing means of two groups
t = d̄ / (s_d / √n)Paired T-TestBefore/after measurements on same subjects
χ² = Σ[(O − E)² / E]Chi-Square TestCategorical data, goodness of fit, independence

Hypothesis Testing Decision Table

StepAction
1State null hypothesis (H₀) and alternative hypothesis (H₁)
2Choose significance level (α), typically 0.05
3Calculate the test statistic using the appropriate formula
4Find the p-value or compare to critical value
5If p-value < α, reject H₀. If p-value ≥ α, fail to reject H₀

Type I and Type II Errors

Error TypeWhat HappensProbabilityAlso Called
Type IReject H₀ when H₀ is trueα (significance level)False positive
Type IIFail to reject H₀ when H₁ is trueβFalse negative
PowerCorrectly reject H₀ when H₁ is true1 − βSensitivity

Linear Regression

Linear regression models the relationship between two variables.

FormulaNameWhat It Calculates
ŷ = b₀ + b₁xRegression EquationPredicted value of y for a given x
b₁ = Σ[(xᵢ − x̄)(yᵢ − ȳ)] / Σ(xᵢ − x̄)²SlopeChange in y for each unit change in x
b₀ = ȳ − b₁x̄Y-InterceptPredicted y when x equals zero
r = Σ[(xᵢ − x̄)(yᵢ − ȳ)] / [(n−1) × sₓ × sᵧ]Correlation CoefficientStrength and direction of linear relationship (−1 to 1)
r² = (explained variation) / (total variation)Coefficient of DeterminationProportion of y's variation explained by x
sₑ = √[Σ(yᵢ − ŷᵢ)² / (n−2)]Standard Error of EstimateAverage distance of data points from the regression line

Interpreting r Values

r Value RangeInterpretation
0.00 to 0.19Very weak
0.20 to 0.39Weak
0.40 to 0.59Moderate
0.60 to 0.79Strong
0.80 to 1.00Very strong

Negative values indicate an inverse relationship. The sign of r matches the sign of the slope b₁.

ANOVA (Analysis of Variance)

ANOVA tests whether the means of three or more groups are significantly different.

FormulaNameWhat It Calculates
F = MS_between / MS_withinF-StatisticRatio of between-group variance to within-group variance
SS_between = Σnⱼ(x̄ⱼ − x̄)²Between-Group Sum of SquaresVariation due to differences between group means
SS_within = ΣΣ(xᵢⱼ − x̄ⱼ)²Within-Group Sum of SquaresVariation within each group
MS = SS / dfMean SquareSum of squares divided by degrees of freedom

Decision rule: If F > F_critical (or p-value < α), at least one group mean differs significantly.

Quick Reference: Which Formula to Use

Use this decision table during exams to quickly identify the correct formula.

You Want To...Data TypeUse This
Describe center of dataQuantitativeMean or median
Describe spread of dataQuantitativeStandard deviation or IQR
Find probability of an eventCategoricalProbability rules
Count successes in trialsBinary outcomesBinomial distribution
Estimate a population meanQuantitative sampleConfidence interval for mean
Estimate a population proportionCategorical sampleConfidence interval for proportion
Test a claim about a meanQuantitative, σ knownZ-test
Test a claim about a meanQuantitative, σ unknownT-test
Compare two group meansTwo quantitative samplesTwo-sample t-test
Compare three or more meansMultiple quantitative groupsANOVA
Test relationship between two variablesTwo quantitative variablesLinear regression / correlation
Test categorical associationTwo categorical variablesChi-square test

How to Use This Cheat Sheet Effectively

During lectures: Keep this page open alongside your notes. When the professor introduces a new formula, find it on the cheat sheet to see where it fits in the bigger picture.

During homework: Use the "Which Formula to Use" table above to identify the right formula for each problem. The "When to Use" column tells you the conditions that must be met.

During exam prep: If your professor allows a formula sheet, use the tables in this article as a starting template and customize it with your professor's specific notation. If no formula sheet is allowed, use the cheat sheet to practice until the formulas are memorized.

For handwritten formula notes: Many students write formulas by hand for better retention. If you want a digital backup of your handwritten formula sheets, photograph them with Pixno (opens in a new tab) to create searchable, organized digital copies. This is especially useful for math-heavy courses where symbols and notation are easier to write by hand than to type.

Symbol Reference

SymbolMeaning
x̄ (x-bar)Sample mean
μ (mu)Population mean
sSample standard deviation
σ (sigma)Population standard deviation
nSample size
NPopulation size
p̂ (p-hat)Sample proportion
pPopulation proportion
α (alpha)Significance level
β (beta)Probability of Type II error
λ (lambda)Rate parameter (Poisson)
χ² (chi-square)Chi-square statistic
dfDegrees of freedom
H₀Null hypothesis
H₁ or HₐAlternative hypothesis
ΣSummation

Related Reading