Lecture Notes

A structured curriculum in statistical inference, causal inference, machine learning, AI, and decision science.

This page collects course notes developed for my teaching in computational mathematics and data science degree programs at Embry-Riddle Aeronautical University. The curriculum is organized across four connected modules: (i) Statistical Inference, (ii) Core Causal Inference, (iii) Advanced Causal Inference, and (iv) AI and Machine Learning. Together, these modules move from statistical foundations to causal design, advanced causal workflows, machine learning, AI, diagnostics, simulations, and decision-focused interpretation.

Scroll through the modules below and click course names to open the full lecture sequences. If you notice an error or typo, please let me know.

Module 1: Statistical Inference (MA 412 and MA 506)

Statistical Inference is the entry point for the lecture curriculum. It builds the quantitative habits needed for later causal, machine-learning, and AI courses, including careful data description, uncertainty reasoning, assumption checks, and evidence-to-decision translation.

This module is designed for readers who want statistical work to feel connected to action. The first course builds the language of data, variation, bias, and modeling. The second course turns that language into formal inference, where likelihoods, intervals, tests, Bayesian updating, and decision reports support defensible analysis.

The two courses work as a pair. One trains the eye to see structure in data. The other trains the judgment to recognize where evidence is strong, weak, or incomplete.

Course 1: Statistical Foundations for Data Science

Histogram showing a skewed resolution-time distribution from the statistical foundations course
Figure: A skewed service-resolution distribution used to show why summary statistics need context before they become decision evidence (adapted from Lecture 04: Distributions, Variability, and Tail Risk).

This course builds the habits behind responsible data analysis. It covers units and variables, randomness, distributions, sampling problems, simulation, and careful communication about what a model can support.

Course 2: Statistical Inference for Decision Science

Coverage comparison plot for Wald, likelihood, and bootstrap confidence intervals
Figure: Empirical interval coverage across scenarios, illustrating that uncertainty methods must be matched to the decision setting (adapted from Lecture 04: Confidence Intervals: Wald, Likelihood, and Bootstrap).

This course moves from descriptive analysis to evidence claims. It covers likelihood, uncertainty intervals, testing, resampling, regression inference, Bayesian updating, missing data, model checking, and value-of-information reasoning.

Module 2: Core Causal Inference (MA 412 and MA 413)

Core Causal Inference is the backbone of the lecture curriculum. It develops the conceptual grammar needed for the rest of the site, including causal questions, interventions, estimands, randomization, credible adjustment, and quasi-experimental variation.

The sequence is split into four courses. Together they move from causal language to design practice. The reader first learns how to define the question, then how to create evidence through experiments, how to repair observational comparisons, and how to use policy variation in applied settings.

This module is the conceptual center of the site. Everything later, from causal ML to AI-assisted analysis, depends on getting this design logic right.

Course 1: Foundations of Causal Inference

Density plot showing imbalance created by a ranking policy
Figure: A ranking-policy imbalance plot used to make confounding visible before any estimator is introduced (adapted from Lecture 08: Confounders, Mediators, Colliders, and Selection Bias).

This course builds the language of causal reasoning, moving from interventions and potential outcomes to estimands, identification assumptions, DAGs, and common adjustment mistakes.

Course 2: Randomized Experiments and Product Experimentation

Experiment allocation comparison showing clean and bugged A/B test assignment
Figure: A clean-versus-bugged allocation diagnostic, showing why experiment validity depends on the assignment running as designed (adapted from Lecture 02: A/B Testing and Product Experimentation).

This course treats randomized evidence as an operational workflow. It covers experiment design, power, guardrail metrics, clustered assignment, noncompliance, interference, and clear reporting for launch decisions.

Course 3: Observational Adjustment

Risk-score distributional balance plot comparing raw, weighted, and matched groups
Figure: Balance diagnostics across raw, weighted, and matched comparisons, showing how adjustment is audited visually (adapted from Lecture 05: Covariate Balance Diagnostics).

When treatment assignment comes from observed behavior, credibility depends on measured confounding, overlap, and diagnostics. This course covers regression adjustment, propensity scores, matching, weighting, balance checks, doubly robust estimation, TMLE, and sensitivity analysis.

Course 4: Quasi-Experiments and Natural Experiments

Difference-in-differences estimate comparison across specifications
Figure: DiD estimates across near-equivalent specifications, highlighting how design credibility is checked through specification comparisons (adapted from Lecture 01: Difference-in-Differences).

This course studies designs that exploit timing, thresholds, instruments, shocks, or policy variation. Topics include difference-in-differences, event studies, staggered adoption, synthetic control, regression discontinuity, instrumental variables, encouragement designs, and interrupted time series.

Module 3: Advanced Causal Inference (MA 505 and MA 506)

Advanced Causal Inference is the second major causal module in the lecture sequence. It assumes the reader has worked through the core causal inference material, then moves toward the professional settings where causal work becomes more technical, more operational, and more decision-facing.

The module has three courses. Causal Machine Learning covers modern estimation and policy-learning workflows once the estimand is clear. Industry Applications shows how causal designs map to product, marketing, marketplace, and policy decisions. Advanced Topics covers the complications that mature causal projects must confront, including mechanisms, missingness, measurement error, transportability, interference, panels, discovery, Bayesian workflows, and AI-system complications.

Here the goal is to estimate effects and decide which effect matters, which method is credible, and how the result changes a real decision.

Course 1: Causal Machine Learning

Scatter plots comparing predicted uplift to true uplift
Figure: Predicted uplift against true uplift, showing why causal ML is judged by treatment-effect recovery and policy value (adapted from Lecture 02: CATE and Uplift Modeling).

This course focuses on heterogeneous treatment effects, CATE and uplift modeling, meta-learners, causal forests, Double ML, policy learning, treatment targeting, off-policy evaluation, and validation.

Course 2: Industry Applications of Causal Inference

Revenue trend before and after a campaign launch
Figure: Campaign-launch revenue trends, connecting causal design to the kind of business question that teams actually face (adapted from Lecture 01: Marketing Incrementality).

This course translates causal inference into applied decision workflows for marketing incrementality, pricing, promotions, ranking, retention, product launches, marketplaces, public-sector applications, and leadership decision summaries.

Course 3: Advanced Topics in Causal Inference

Network experiment plot with treated nodes highlighted
Figure: A network experiment where spillovers make the usual independent-unit assumption visibly fragile (adapted from Lecture 06: Interference and Spillovers).

This course studies the issues that make advanced causal projects difficult, including mediation, principal stratification, missing data, measurement error, transportability, interference, panel complications, causal discovery, Bayesian causal inference, and AI-system complications.

Module 4: AI and Machine Learning (DS 440, DS 540, DS 625 and DS 699)

AI and Machine Learning complements the statistics and causal curriculum with courses on predictive modeling, explanation, anomaly detection, and AI-assisted causal work. The common theme is decision science: models are useful when they support auditable choices, expose uncertainty, and remain connected to operational guardrails.

The module has four courses. The first covers supervised and unsupervised machine learning foundations for operational decision systems. The second develops interpretable ML and XAI as reviewable explanation workflows. The third treats anomaly detection as a triage and monitoring problem. The fourth shows how LLMs, RAG, agents, and structured evaluation can assist causal analysts while preserving identification discipline.

The through-line is practical governance. A useful model can score, cluster, explain, detect, or draft while making uncertainty and failure modes visible enough for responsible use.

Course 1: Machine Learning Basics for Decision Science

ROC and precision-recall curves comparing classification models
Figure: ROC and precision-recall curves, used to connect model performance to threshold choice and operational cost (adapted from Lecture 09: Classification Metrics, Thresholds, and Confusion Matrices).

A supervised and unsupervised learning course for decision systems, covering framing, leakage, baselines, regularization, trees, metrics, calibration, clustering, monitoring, and guardrails.

Course 2: Interpretable ML and XAI

SHAP attribution comparison under different baselines
Figure: SHAP attribution under different baselines, showing how explanation choices affect the story a model appears to tell (adapted from Lecture 08: SHAP Values: Intuition, Computation, and Failure Modes).

A mathematically grounded interpretability course covering transparent baselines, feature effects, SHAP, LIME, recourse, explanation stability, fairness, governance, and stakeholder communication.

Course 3: Anomaly Detection for Decision Systems

Checkout reliability time series with incident and benign event windows
Figure: Incident windows on checkout reliability metrics, showing why anomaly detection is a monitoring and triage workflow (adapted from Lecture 10: Time-Series Anomaly Detection).

A decision-focused anomaly detection course covering statistical detectors, distance and density methods, isolation forests, reconstruction, time series, thresholds, triage, root-cause analysis, and governance.

Course 4: AI for Causal Inference

Rubric heatmap evaluating good and bad AI-generated causal outputs
Figure: An evaluation rubric heatmap for AI-generated causal outputs, emphasizing auditability over novelty (adapted from Lecture 22: Evaluating AI Outputs in Causal Workflows).

A course on using local LLMs, RAG, structured outputs, agents, diagnostics, code generation, sensitivity workflows, and evaluation to support causal analysts while preserving identification judgment.