← All books
Book · Intermediate · 50+ hours

Probability & Statistics for AI/ML

From Foundations to Advanced Statistical Inference

Master probability distributions and mathematical statistics from foundations to advanced inference. Learn Bayesian methods, information theory, and statistical learning for data science and machine learning.

31Chapters
175Sections
74hReading
13Parts
Part I·1 chapter · 6 sections

PrerequisitesMathematical foundations.

Prerequisites & Mathematical Foundations

Review essential mathematics: set theory, combinatorics, calculus, and linear algebra

6 sections120 min read
Start chapter
  1. 01Mode of Convergence and Stat Distributions25m
  2. 02Set Theory Essentials15m
  3. 03Combinatorics Review20m
  4. 04Calculus Refresher25m
  5. 05Linear Algebra Essentials20m
  6. 06Python and NumPy Setup15m
Part II·3 chapters · 18 sections

Probability BasicsCore probability theory.

Part IV·2 chapters · 11 sections

MultivariateJoint distributions and transformations.

Transformations of Random Variables

Functions of random variables, Jacobian method, and order statistics

5 sections125 min read
Start chapter
  1. 01Functions of Random Variables25m
  2. 02Jacobian Transformation Method30m
  3. 03Sum of Random Variables25m
  4. 04Order Statistics25m
  5. 05Convolutions20m
Part V·2 chapters · 14 sections

Limit TheoremsConvergence and fundamental theorems.

Part VII·3 chapters · 16 sections

TestingHypothesis testing.

Fundamentals of Testing

Hypothesis testing framework, Type I/II errors, power, and p-values

5 sections125 min read
Start chapter
  1. 01Hypothesis Testing Framework25m
  2. 02Type I and Type II Errors20m
  3. 03Power of a Test25m
  4. 04P-Values - Proper Interpretation25m
  5. 05Neyman-Pearson Lemma30m

Common Statistical Tests

Z-tests, t-tests, Chi-square tests, F-tests, and likelihood ratio tests

6 sections150 min read
Start chapter
  1. 01Z-Tests and T-Tests25m
  2. 02Chi-Square Tests25m
  3. 03F-Tests20m
  4. 04Likelihood Ratio Tests30m
  5. 05Wald and Score Tests25m
  6. 06Permutation Tests25m

Multiple Testing and Modern Issues

Multiple comparisons, FDR, A/B testing, and sequential testing

5 sections125 min read
Start chapter
  1. 01Multiple Comparisons Problem25m
  2. 02Bonferroni Correction20m
  3. 03False Discovery Rate25m
  4. 04A/B Testing for ML Applications30m
  5. 05Sequential Testing25m
Part VIII·3 chapters · 16 sections

BayesianBayesian statistics.

Bayesian Foundations

Bayesian paradigm, prior and posterior distributions, and conjugate priors

5 sections140 min read
Start chapter
  1. 01Bayesian vs Frequentist Paradigm25m
  2. 02Prior Distributions30m
  3. 03Posterior Distributions30m
  4. 04Conjugate Priors30m
  5. 05Non-informative and Jeffreys Priors25m

Bayesian Inference

Bayesian point estimation, MAP, credible intervals, and model comparison

5 sections125 min read
Start chapter
  1. 01Bayesian Point Estimation25m
  2. 02Maximum A Posteriori25m
  3. 03Bayesian Credible Intervals20m
  4. 04Bayes Factors and Model Comparison30m
  5. 05Empirical Bayes25m

Computational Bayesian Methods

Monte Carlo, MCMC, Metropolis-Hastings, Gibbs sampling, and variational inference

6 sections185 min read
Start chapter
  1. 01Monte Carlo Integration25m
  2. 02Markov Chain Monte Carlo35m
  3. 03Metropolis-Hastings Algorithm30m
  4. 04Gibbs Sampling30m
  5. 05Variational Inference35m
  6. 06Hamiltonian Monte Carlo30m
Part IX·1 chapter · 6 sections

Information TheoryEntropy and divergences.

Information Theoretic Foundations

Entropy, cross-entropy, KL divergence, and mutual information

6 sections160 min read
Start chapter
  1. 01Shannon Entropy25m
  2. 02Cross-Entropy25m
  3. 03KL Divergence30m
  4. 04Mutual Information25m
  5. 05Information Theory in ML Loss Functions30m
  6. 06Maximum Entropy Principle25m
Part X·2 chapters · 9 sections

Multivariate AnalysisPCA and dimensionality reduction.

Dimensionality Reduction Deep Dive

Eigendecomposition, SVD, t-SNE, UMAP, and random projections

4 sections110 min read
Start chapter
  1. 01Eigenvalue Decomposition for Statistics25m
  2. 02SVD and Statistical Applications30m
  3. 03t-SNE and UMAP30m
  4. 04Random Projections25m
Part XI·2 chapters · 12 sections

RegressionLinear models and GLMs.

Part XII·3 chapters · 12 sections

Advanced MLStochastic processes and PGMs.

Stochastic Processes

Markov chains, HMMs, Gaussian processes, and Poisson processes

4 sections120 min read
Start chapter
  1. 01Markov Chains30m
  2. 02Hidden Markov Models30m
  3. 03Gaussian Processes35m
  4. 04Poisson Processes25m

Probabilistic Graphical Models

Bayesian networks, Markov random fields, and inference in graphical models

4 sections110 min read
Start chapter
  1. 01Bayesian Networks30m
  2. 02Markov Random Fields25m
  3. 03Inference in Graphical Models30m
  4. 04Learning Graphical Model Structure25m

Statistical Decision Theory

Loss functions, risk, admissibility, and minimax decision rules

4 sections95 min read
Start chapter
  1. 01Loss Functions25m
  2. 02Risk and Bayes Risk25m
  3. 03Admissibility20m
  4. 04Minimax Decision Rules25m
Part 13·3 chapters · 11 sections

ApplicationsDeep learning and causal inference.

Statistics in Deep Learning

Weight initialization, batch normalization, dropout, and uncertainty quantification

4 sections105 min read
Start chapter
  1. 01Weight Initialization Theory25m
  2. 02Batch Normalization Statistics25m
  3. 03Dropout as Bayesian Approximation25m
  4. 04Uncertainty Quantification30m

Causal Inference

Potential outcomes, propensity scores, instrumental variables, and causal graphs

4 sections105 min read
Start chapter
  1. 01Potential Outcomes Framework30m
  2. 02Propensity Score Methods25m
  3. 03Instrumental Variables25m
  4. 04Causal Graphs25m

Putting It All Together

Statistical thinking for ML, common pitfalls, and further resources

3 sections65 min read
Start chapter
  1. 01Statistical Thinking for ML Problems25m
  2. 02Common Statistical Pitfalls25m
  3. 03Further Reading and Resources15m
The capstone

Where the book lands in practice.

Chapter 28·4 sections

Statistics in Deep Learning

Weight initialization, batch normalization, dropout, and uncertainty quantification

Open chapter
Chapter 29·4 sections

Causal Inference

Potential outcomes, propensity scores, instrumental variables, and causal graphs

Open chapter
Chapter 30·3 sections

Putting It All Together

Statistical thinking for ML, common pitfalls, and further resources

Open chapter

175 sections. Begin with one.

Chapter 0 — Prerequisites & Mathematical Foundations — is where every reader starts.