CausalRL

Know what you're estimating. Know when to trust it. Know how it was produced.

📚 Docs • 🎓 Tutorials • 💡 Examples • 🖼️ Gallery • 📄 Cite

CausalRL is a research-grade Python library for off-policy evaluation (OPE) that makes causal assumptions explicit. It goes beyond point estimates—combining estimand-first design, diagnostics-first reporting, and reproducible benchmarks so you can tell not just what a policy is worth, but whether you should trust the estimate.

📦 v0.2.0 (research preview, alpha) · Import: import crl

✨ Why CausalRL?

🎯

Estimand-First

Every estimator is tied to a formal estimand with explicit identification assumptions

🔍

Diagnostics by Default

Overlap, ESS, weight tails, and shift checks run automatically with every evaluation

📊

20+ Estimators

IS, DR, WDR, MAGIC, MRDR, MIS, FQE, DualDICE, GenDICE, DRL, and more

📈

Sensitivity Analysis

Bounded-confounding curves for robustness to hidden confounders

📦

D4RL Compatible

Load D4RL and RL Unplugged datasets with built-in adapters

📝

Audit-Ready Reports

HTML reports with tables, figures, and full metadata bundles

🧪

Ground-Truth Benchmarks

Synthetic bandit/MDP suites with known true values for validation

⚡

Production Ready

Type-checked, tested, with deterministic seeding throughout

🚀 Quickstart

Installation

# Install from PyPI
pip install causalrl

# With all extras
pip install "causalrl[all]"

# Clone and install from source
git clone https://github.com/gsaco/causalrl
cd causalrl
pip install -e .

Your First OPE Evaluation

from crl.benchmarks.bandit_synth import SyntheticBandit, SyntheticBanditConfig
from crl.ope import evaluate_ope

# Create a synthetic benchmark with known ground truth
benchmark = SyntheticBandit(SyntheticBanditConfig(seed=0))
dataset = benchmark.sample(num_samples=1000, seed=1)

# Run end-to-end evaluation
report = evaluate_ope(dataset=dataset, policy=benchmark.target_policy)

# View results
print(report.summary_table())

# Generate audit-ready HTML report
report.save_html("report.html")

Output:

              Estimator    Value     Std      ESS  OverlapWarning
0                    IS   0.8234  0.0821   412.3           False
1                   WIS   0.8156  0.0634   412.3           False
2                    DR   0.8189  0.0512   412.3           False
3                   WDR   0.8167  0.0498   412.3           False
Ground Truth: 0.8200

CLI

# Quick bandit OPE demo
python -m examples.quickstart.bandit_ope

# MDP evaluation
python -m examples.quickstart.mdp_ope

# Run full benchmark suite
python -m experiments.run_benchmarks --suite all --out results/

📊 Sample Outputs

Estimator Comparison _{Point estimates with uncertainty quantification}	Overlap Diagnostics _{Importance weight ratio distribution}
Sensitivity Analysis _{Bounds under hidden confounding}	Temporal ESS _{Effective sample size across horizon}

🧠 The Three Pillars

Pillar	Why It Matters	What You Get
Estimands	Know what quantity you're estimating—not just which estimator	Explicit estimands with identification assumptions via `AssumptionSet`
Diagnostics	Know when an estimate is fragile before acting on it	Overlap checks, ESS, weight tails, shift diagnostics, sensitivity curves
Evidence	Know how results were produced for auditing and reproducibility	Versioned configs, deterministic seeds, structured report bundles

📦 Estimator Suite

Click to expand full estimator list

Category	Estimators	Notes
Importance Sampling	IS, WIS, SN-IS	Propensity-based weighting
Doubly Robust	DR, WDR	Combines regression with IS
Model-Assisted	MAGIC, MRDR	Variance reduction via modeling
Marginalized	MIS	State-marginal importance sampling
Value Function	FQE	Fitted Q-Evaluation
DICE Family	DualDICE, GenDICE	Distribution correction estimation
Double RL	DRL	Double reinforcement learning
High-Confidence	HCOPE bounds	Concentration-based bounds

🏗️ Architecture

┌─────────────┐     ┌─────────────┐     ┌─────────────┐     ┌─────────────┐
│   Dataset   │ ──▶ │  Estimand   │ ──▶ │ Estimators  │ ──▶ │   Report    │
│             │     │ + Assump.   │     │ + Diagnostics│    │  (HTML/JSON)│
└─────────────┘     └─────────────┘     └─────────────┘     └─────────────┘
      │                                        │
      ▼                                        ▼
┌─────────────┐                         ┌─────────────┐
│  Benchmarks │                         │ Sensitivity │
│ (Synth/D4RL)│                         │  Analysis   │
└─────────────┘                         └─────────────┘

📚 Learn the Library

Recommended learning path:

Reference:

🤝 Contributing

We welcome contributions! Check out:

📄 Citation

If you use CausalRL in academic work, please cite:

@software{causalrl,
  author = {Saco, Gabriel},
  title = {CausalRL: Estimand-first Causal Reinforcement Learning},
  year = {2024},
  url = {https://github.com/gsaco/causalrl}
}

Or use the "Cite this repository" button on GitHub.

📜 License

_{Built with ❤️ for the causal inference and reinforcement learning communities}

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
.github		.github
archive		archive
benchmarks		benchmarks
configs		configs
crl		crl
docs		docs
examples		examples
experiments		experiments
notebooks		notebooks
scripts		scripts
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
CITATION.bib		CITATION.bib
CITATION.cff		CITATION.cff
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MAINTAINERS.md		MAINTAINERS.md
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml

Folders and files

Latest commit

History

Repository files navigation

CausalRL

✨ Why CausalRL?

🎯

Estimand-First

🔍

Diagnostics by Default

📊

20+ Estimators

📈

Sensitivity Analysis

📦

D4RL Compatible

📝

Audit-Ready Reports

🧪

Ground-Truth Benchmarks

⚡

Production Ready

🚀 Quickstart

Installation

Your First OPE Evaluation

CLI

📊 Sample Outputs

🧠 The Three Pillars

📦 Estimator Suite

🏗️ Architecture

📚 Learn the Library

🤝 Contributing

📄 Citation

📜 License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages