Measure Theory, Probability, and Stochastic Processes
Page Contents
ToggleCourse Overview
This page brings together graduate-level lectures and exercises on measure theory, probability, and stochastic processes. The material follows a unified measure-theoretic approach, structured in three parts: Part I — Measure Theory and Integration, Part II — Probability and Random Sequences, and Part III — Stochastic Processes and Ergodic Theory.
Each lecture is available as a downloadable PDF, accompanied by exercises with detailed solutions. Selected lectures also include video recordings in English and French.
This foundational framework underpins the resources available on Statistics Theory, Machine & Deep Learning, Information Theory, Queueing Theory, and Point Processes.
Part I: Measure Theory and Integration
1 Measure Theory
PDF, video, and exercises of the lecture on Measure Theory, the cornerstone of modern integration and probability theory.
We begin with the fundamental concept of measurability, exploring algebras and σ-algebras, the construction of measurable sets and generated σ-algebras, and the pivotal Borel σ-algebra. We then study measurable functions, their properties such as measurability at the limit, the concept of simple measurable functions, and the indispensable simple approximation theorem. We move on to a rigorous examination of measures, starting with their definition and essential lemmas. Through insightful examples including the Dirac measure and the weighted counting measure, we elucidate the concepts of finite, probability, and σ-finite measures. We then address the continuity of measures, the extension and uniqueness of measures from algebras and semirings, the cumulative distribution functions, the concept of almost everywhere, and conclude with the existence and uniqueness of the Lebesgue measure on ℝⁿ.
Course Outline:
- Measurability
- Algebra and σ-algebra
- Measurable functions
- Measures
- Extension and uniqueness of measures
- Cumulative distribution function
- Almost everywhere
- Lebesgue measure
2 Lebesgue Integral
PDF, video, and exercises of the lecture on the Lebesgue Integral, a profound extension of the classical Riemann integral.
We begin by detailing the construction of the Lebesgue integral, with essential results on the exchange of integral and limit operations, exemplified by the Monotone Convergence Theorem, Fatou’s Lemma, and the Dominated Convergence Theorem. We present the Change of Variable formula, providing a rigorous framework for transforming integrals between spaces. We then focus on Fubini’s theorem, investigating the existence and uniqueness of product measures and the Fubini-Tonelli theorem, which provides conditions under which the order of integration can be interchanged. We explore the Radon-Nikodym theorem, central to understanding the relationship between measures and functions, with the notions of absolute continuity and the Lebesgue decomposition. Finally, we turn to Lp spaces, covering essentially bounded functions, Minkowski’s inequality, Hölder’s inequality, the celebrated Riesz-Fischer theorem, the theory of the Hilbert space L², and convergence properties within Lp spaces.
Course Outline:
- Construction of the Lebesgue integral
- Exchange of integral and limit
- Change of variable
- Fubini’s theorem
- Integration by parts for measures
- Radon-Nikodym theorem
- Lebesgue decomposition
- Lp spaces
3 Integration of Functions of a Real Variable
PDF, videos and exercises of the lecture on Integration of Functions of a Real Variable, adapting to the Lebesgue framework classical results that are traditionally presented for the Riemann integral.
After constructing the Lebesgue integral on general measure spaces in the previous lectures, this chapter restricts to mappings of a real variable, with ℝ equipped with the Lebesgue measure. Many classical results of integration on ℝ are traditionally stated for the Riemann integral, as in the well-known textbook Cours de Mathématiques Spéciales by Ramis, Deschamps and Odoux. We adapt them here to the Lebesgue framework, for mappings with values in ℝ̄n or ℂn. We develop the Chasles relation, integral mapping and antiderivatives, the mean value formulas, the Newton-Leibniz formula, integration by parts, change of variables, Taylor’s formula with integral remainder, integrability by comparison, the integration of comparison relations, and finally the generalized integral with its calculation techniques. A pedagogical highlight is that integration is a regularizing operation: starting from a possibly discontinuous integrand, the integral mapping is always continuous, and differentiable almost everywhere thanks to the Lebesgue Differentiation Theorem.
Course Outline:
- Introduction
- Motivation: from Riemann to Lebesgue
- Oriented Integral
- Properties of the Integral on ℝ
- Chasles Relation
- Integral Mapping and Regularity
- Antiderivatives
- Mean Value Formulas
- Integral vs. Antiderivatives
- Newton-Leibniz Formula
- Integration by Parts
- Change of Variables
- Taylor’s Formula with Integral Remainder
- Integrability by Comparison
- Comparison by Inequality
- Domination and Equivalence
- Integration of Comparison Relations
- Integrable Case
- Non-Integrable Case
- Reference Functions and Comparison Rules
- Bertrand Integrals
- Generalized Integral
- Link with the Lebesgue Integral
- Correspondence with Riemann Terminology
- Cauchy Criterion for Integrals
- Abel’s Rule and Oscillating Integrals
- Bilateral Generalized Integral
- Calculation Techniques for a Generalized Integral
- Generalized Newton-Leibniz
- Change of Variables for the Generalized Integral
- Generalized Integration by Parts
Part II: Probability and Random Sequences
4 Probability Theory
PDF, video, and exercises of the lecture on Probability Theory, developed from its measure-theoretic foundations.
We begin by defining the probability space, encompassing the probability measure, events, and the concept of « almost surely » within probability theory. We then explore random variables and their associated probability distributions, covering the definition and characteristics of random variables, the σ-algebra generated by them, and the expectation of random variables with its key properties. We discuss moments, characteristic functions, cumulative distribution functions (CDFs), and probability density functions (PDFs), along with the notion of independence among random variables. Finally, we examine the monotone and dominated convergence theorems for random variables, providing crucial insights into the behavior of sequences of random variables.
Course Outline:
- Probability Space
- Random Variable
- Expectation of Random Variable
- Moments and Characteristic Function
- CDF and PDF
- Independence
- Monotone and Dominated Convergence for Random Variables
- Monotone Convergence for Random Variables
- Dominated Convergence for Random Variables
5 Discrete Random Variables and their Transform
PDF and exercises of the lecture on Discrete Random Variables and Their Transform.
We begin with the fundamental types of discrete random variables: the Bernoulli random variable, representing the simplest form of a random process with two outcomes; the Binomial random variable, modeling the number of successes in independent trials; the Geometric random variable, focusing on the number of trials needed to achieve the first success; and the Poisson random variable, modeling the number of events occurring in a fixed interval. We then explore the probability generating function (PGF), starting with its definition and fundamental properties, illustrating how it encapsulates the probabilistic information of a discrete random variable. We discuss how the PGF can be used to analyze the distribution of a random variable, derive its factorial moments, handle random sums of random variables, and exploit its monotonicity and convexity properties.
Course Outline:
- Fundamental discrete random variables
- Bernoulli random variable
- Binomial random variable
- Geometric random variable
- Poisson random variable
- Probability generating function
- Definition and properties
- Examples of probability generating functions
- Probability generating function and distribution analysis
- Factorial moments from probability generating function
- Random sum of random variables
- Monotonicity and convexity of the probability generating function
6 Continuous Random Variables and their Transforms
PDF and exercises of the lecture on Continuous Random Variables and Their Transforms.
We begin with the fundamental continuous random variables, focusing on the Gaussian random variable, renowned for its bell curve and central role in the Central Limit Theorem, and the Exponential random variable, crucial in modeling time until an event. We then explore the moment generating function (MGF), with its formal definition, the conditions for its existence through the radius of convergence, and its infinite series expansion. We cover the characteristic function, a Fourier transform of the probability density function, discussing its differentiability, the implications of its finite expansion, the pivotal Lévy’s inversion formula, and its infinite expansion. Finally, we discuss the Laplace transform, particularly useful in solving differential equations and analyzing systems, covering its differentiability, finite expansion, and infinite expansion, and noting that the Laplace transform is real analytic.
Course Outline:
- Fundamental continuous random variables
- Gaussian random variable
- Exponential random variable
- Moment generating function
- Definition and radius of convergence
- Infinite expansion of the moment generating function
- Characteristic function
- Differentiability and finite expansion of characteristic function
- Lévy’s inversion formula
- Characteristic function of random vectors
- Infinite expansion of the characteristic function
- Laplace transform
- Differentiability and finite expansion of Laplace transform
- Infinite expansion of the Laplace transform
- Laplace Transform is Real Analytic
7 Random Vectors and Gaussian Distribution
PDF and exercises of the lecture on Random Vectors and Gaussian Distribution.
We begin with the mathematical backbone of random vectors through the lens of characteristic functions, starting with Lévy’s inversion formula for random vectors, pivotal for recovering the distribution of random vectors from their characteristic functions, and the independence criterion, which provides a method to determine the statistical independence of components within random vectors. We then focus on square-integrable random vectors, introducing the covariance matrix, a critical tool for understanding the variance and correlation structure of random vectors, covering degenerate random vectors and the role of affine transformations. We turn to Gaussian random vectors, beginning with their definition and characteristic function, examining the relationship between independence and uncorrelation, and analyzing the probability density function of a nondegenerate Gaussian vector. Finally, we explore complex Gaussian random vectors, starting with the basics of complex random vectors and progressing to the properties of symmetric complex Gaussian vectors, the criterion for the independence of jointly CN random vectors, the spectral decomposition and characterization of CN vectors, the real representations of complex vectors and matrices, the covariance and linear combinations of CN random vectors, and the probability density function of CN random vectors.
Course Outline:
- Characteristic Functions of Random Vectors
- Lévy’s Inversion Formula for Random Vectors
- Independence Criterion
- Square-Integrable Random Vectors
- Covariance Matrix
- Degenerate Random Vectors
- Affine Transformations
- Gaussian Random Vectors
- Definition and Characteristic Function
- Independence Versus Uncorrelation
- Probability Density Function of a Nondegenerate Gaussian Vector
- Complex Gaussian Random Vectors
- Basics of Complex Random Vectors
- Symmetric Complex Gaussian Vector
- Criterion for Independence of Jointly CN Random Vectors
- Spectral Decomposition and Characterization of CN Vectors
- Real Representations of Complex Vectors and Matrices
- Covariance and Linear Combinations of CN Random Vectors
- Probability Density Function of CN Random Vectors
8 Convergence for Sequences of Random Variables
PDF and exercises of the lecture on Convergence for Sequences of Random Variables.
We begin with almost-sure convergence, covering the Strong Law of Large Numbers, the Borel-Cantelli lemmas, and various conditions guaranteeing almost-sure convergence. We then focus on convergence in probability, a weaker form of convergence, with the Continuous Mapping Theorem for convergence in probability. We discuss convergence in quadratic mean (L² convergence), particularly important in the context of mean-square error analysis and signal processing. We progress to weak convergence (convergence in distribution), covering the Portmanteau theorem, the Continuous Mapping Theorem for weak convergence, the Characteristic Function Criterion, and the fundamental Central Limit Theorem. We synthesize the different types of convergence through the connections between convergence types, explore the concept of stochastic order for comparing the magnitudes of random variables, and address the concept of tightness of probability measures with the pivotal Prohorov’s Theorem.
Course Outline:
- Almost-sure convergence
- Strong law of large numbers
- Borel-Cantelli lemmas
- Conditions for almost-sure convergence
- Convergence in probability
- Continuous mapping theorem for convergence in probability
- Convergence in quadratic mean
- Weak convergence – Convergence in distribution
- Portmanteau theorem
- Continuous mapping theorem for weak convergence
- Characteristic function criterion
- Central limit theorem
- Connections between convergence types
- Stochastic order
- Tightness of probability measures
- Tightness and uniform tightness
- Lemmas on Tightness
- Prohorov’s Theorem
Part III: Stochastic Processes and Ergodic Theory
9 Stochastic Processes
PDF and exercises of the lecture on Stochastic Processes, mathematical models depicting the evolution of systems over time through probabilistic mechanisms.
We begin by defining stochastic processes and discussing the concept of independence within these processes. This foundation sets the stage for understanding finite-dimensional distributions (fidis), essential for grasping the behavior of processes across various dimensions. We delve into Kolmogorov’s Extension Theorem, which provides a method for constructing a process from its finite-dimensional distributions, and we analyze sample paths, exploring their properties and behaviors, before concluding with second-order stochastic processes, focusing on their mean and covariance functions. We then differentiate between strict stationarity, which requires the joint distribution of any set of points to be invariant under time shifts, and wide-sense stationarity, which relaxes this to only require invariance in the mean and autocovariance functions. We explore the measurability of stochastic processes and introduce the stochastic integral, a critical tool in stochastic calculus. Finally, we cover Gaussian processes, starting with their definition and characteristic functions, the conditions for their existence and stationarity properties, and conclude with an in-depth look at the Wiener process, also known as Brownian motion, a cornerstone of stochastic modeling in finance and physics.
Course Outline:
- Foundations of Stochastic Processes
- Definition and Independence of Stochastic Processes
- Finite-Dimensional Distributions (fidis)
- Kolmogorov’s Extension Theorem
- Sample Paths of a Stochastic Process
- Second-Order Stochastic Process
- Strict and Wide-Sense Stationarity
- Stationary Stochastic Processes
- Wide-Sense Stationarity
- Measurability and Stochastic Integral
- Measurable Stochastic Processes
- Stochastic Integral
- Gaussian
10 Conditional Expectation and MMSE Estimation
PDF and exercises of the lecture on Conditional Expectation and MMSE Estimation, two fundamental concepts in probability theory and statistical estimation.
We begin with conditional expectation, starting with the fundamentals of conditioning with respect to a sigma-algebra and a random variable. We explore its properties and the important convergence theorems (monotone and dominated convergence) in the context of conditional expectation. We distinguish between conditional expectations for discrete and continuous variables and examine the special case of conditional expectation in Gaussian distributions. We then focus on Minimum Mean Square Error (MMSE) estimation, highlighting its relationship with conditional expectation, comparing MMSE estimation to conditional expectation and emphasizing their similarities. We finally introduce linear MMSE estimation, discussing its practical applications and advantages in various scenarios such as statistical analysis and signal processing.
Course Outline:
- Conditional Expectation
- Conditioning With Respect to a Sigma-Algebra
- Conditioning With Respect to a Random Variable
- Properties of Conditional Expectation
- Monotone and Dominated Convergence for Conditional Expectation
- Conditional Expectation for Discrete and Continuous Variables
- Conditional Expectation in Gaussian Case
- MMSE Estimation
- MMSE Estimation vs. Conditional Expectation
- Linear MMSE Estimation
11 Martingales and Ergodic Theory
PDF of the lecture on Martingales and Ergodic Theory, two pivotal areas in probability and dynamical systems.
We begin with an introduction to martingales, focusing on their definition and basic properties, highlighting their significance in stochastic processes. We delve into the optional stopping theorem and martingale convergence theorems, crucial for understanding the behavior of martingales under various conditions, and examine several examples of martingales illustrating their application in different probabilistic contexts. We then turn to ergodic theory, starting with the notions of flow, shift, and compatibility, essential for understanding the dynamics of systems over time. We introduce the stationary framework, providing a foundation for analyzing systems that exhibit statistical regularity. A key highlight is Birkhoff’s Pointwise Ergodic Theorem, a fundamental result that connects individual trajectories with long-term statistical behavior. Finally, we explore the ergodic stationary framework, offering insights into the behavior of systems that are both stationary and ergodic.
Course Outline:
- Martingales
- Definition and Basic Properties of Martingales
- Optional Stopping and Martingale Convergence Theorems
- Examples of Martingales
- Ergodic Theory
- Flow, Shift and Compatibility
- Stationary Framework
- Birkhoff’s Pointwise Ergodic Theorem
- Ergodic Stationary Framework
Book Coming Soon
A book based on this material is currently in preparation:
Mohamed Kadhem KARRAY — « Measure Theory, Probability, and Stochastic Processes: An Integrated Approach ».
The book will provide a unified treatment of measure theory, probability, and stochastic processes, expanding and consolidating the lectures available on this page. Stay tuned for the publication announcement.
About These Topics
These graduate-level lectures on measure theory, probability, and stochastic processes are designed for students and researchers seeking a rigorous mathematical foundation for advanced applications in statistics, machine learning, information theory, queueing theory, and stochastic geometry. The material adopts a unified measure-theoretic approach, where probability theory is naturally derived from measure theory, providing a coherent framework across the three parts.
Part I covers the foundations of measure theory, the construction of the Lebesgue integral, key results such as Fubini’s theorem, the Radon-Nikodym theorem, and Lp spaces, and the classical theory of integration on the real line.
Part II develops probability theory from its measure-theoretic foundations, covering random variables and their transforms (probability generating functions, moment generating functions, characteristic functions, Laplace transforms), random vectors and Gaussian distributions, and the various modes of convergence of sequences of random variables including the Central Limit Theorem and Prohorov’s theorem.
Part III introduces stochastic processes, covering foundations through Kolmogorov’s extension theorem, stationarity, the stochastic integral, Gaussian processes, the Wiener process, conditional expectation and MMSE estimation, martingales, and ergodic theory including Birkhoff’s pointwise ergodic theorem.
Each lecture is accompanied by exercises with detailed solutions, designed to reinforce the theoretical material and build problem-solving skills. The material draws on classical references in measure theory and probability, presented in a self-contained format suitable for independent study, graduate coursework, or research preparation.
Last Updated on 29 mai 2026 by Mohamed Kadhem KARRAY