DoubleML tutorial series

DoubleML
Orthogonalization
Python
Causal ML
A hands-on tutorial series for double/debiased machine learning, orthogonal scores, cross-fitting, inference, sensitivity analysis, and policy learning with DoubleML.

DoubleML is the tutorial track for orthogonal scores, sample splitting, flexible nuisance models, and transparent treatment-effect inference.

DoubleML workflow diagram showing nuisance learning, orthogonalization, cross-fitting, and inference
Figure: DoubleML workflow diagram showing how nuisance learning, orthogonalization, cross-fitting, and inference fit together (adapted from Tutorial 00: Environment And Library Tour).

Tutorial Sequence

00: Environment And Library Tour

Introduces DoubleML, orthogonal-score workflows, synthetic PLR examples, and the package objects used throughout the tutorial series.

03: Partially Linear Regression PLR

Covers the PLR model for continuous outcomes and treatments, including residualization, nuisance prediction quality, score behavior, and repeated-split stability.

04: Partially Linear IV PLIV

Adds instrumental variables to the partially linear setup, highlighting first-stage relevance, exclusion-risk stress tests, and residualized IV diagnostics.

06: Interactive IV Model IIVM

Uses binary instruments and treatment take-up to estimate local causal effects, with attention to compliance, weak instruments, and exclusion violations.

07: Difference-In-Differences DID

Connects DoubleML to panel-style causal designs by combining DiD logic, nuisance adjustment, parallel-trend diagnostics, and treatment-effect estimation.

08: Sample Selection Models

Studies missing or selectively observed outcomes, separating selection mechanisms from treatment effects and auditing positivity for observed samples.

09: Regression Discontinuity Design RDD

Frames sharp and fuzzy RDD as local causal designs, with running-score plots, bandwidth sensitivity, first-stage checks, and local effect interpretation.

16: Custom Scores And Advanced API

Looks under the hood at custom scores, advanced API patterns, and how to extend DoubleML while preserving the statistical logic of the estimator.

18: End-To-End DoubleML Case Study

Completes the series with a full applied workflow from design setup and nuisance modeling through estimates, diagnostics, sensitivity, and final communication.