Add variable selection priors #568

NathanielF · 2025-11-19T20:37:00Z

Just a draft for the minute. Working through some ideas.

📚 Documentation preview 📚: https://causalpy--568.org.readthedocs.build/en/568/

Signed-off-by: Nathaniel <[email protected]>

williambdean · 2025-11-20T01:03:40Z

causalpy/experiments/instrumental_variable.py

+    :param vs_prior_type : str or None, default=None
+        Type of variable selection prior: 'spike_and_slab', 'horseshoe', or None.
+        If None, uses standard normal priors.
+    :param vs_hyperparams : dict, optional


This is sphinx format and not numpy

We should add that into AGENTS.md if it's not already there.

This is sphinx format and not numpy

You got me. The doc strings were AI generated. Will fix.

drbenvincent · 2025-11-20T06:23:40Z

causalpy/variable_selection_priors.py

+    Provides continuous shrinkage with heavy tails, allowing strong signals
+    to escape shrinkage while weak signals are dampened:
+
+    β_j = τ · λ̃_j · β_j^raw


We should be able to add maths in here for nice rendering in the API docs

Signed-off-by: Nathaniel <[email protected]>

review-notebook-app · 2025-11-20T07:02:38Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

Signed-off-by: Nathaniel <[email protected]>

codecov · 2025-11-20T22:11:03Z

Codecov Report

❌ Patch coverage is 94.17808% with 17 lines in your changes missing coverage. Please review.
✅ Project coverage is 93.31%. Comparing base (2d6bba7) to head (18da6c4).

Files with missing lines	Patch %	Lines
causalpy/variable_selection_priors.py	89.34%	6 Missing and 7 partials ⚠️
causalpy/pymc_models.py	90.47%	2 Missing and 2 partials ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #568      +/-   ##
==========================================
+ Coverage   93.27%   93.31%   +0.04%     
==========================================
  Files          37       39       +2     
  Lines        5632     5911     +279     
  Branches      367      386      +19     
==========================================
+ Hits         5253     5516     +263     
- Misses        248      256       +8     
- Partials      131      139       +8

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Signed-off-by: Nathaniel <[email protected]>

NathanielF · 2025-11-22T08:25:35Z

Marking this one as ready for review. There is still some work to be done on the notebook illustrating the functionality. But i think there is enough here that's it worth flagging the architecture choices for discussion. I've made the variable selection priors available as a module. Currently, just integrated with the IV class, but in principle can be dropped into all regression based modules with coefficients. The pattern simply requires an if-else block to be used in e.g. the propensity score model, linear regression model etc....

What do you guys think?

Signed-off-by: Nathaniel <[email protected]>

juanitorduz · 2025-12-03T09:50:00Z

Great @NathanielF ! I think the notebooks need a bit more storyline and explanation 🙏

NathanielF · 2025-12-03T10:10:10Z

Cool, thanks @juanitorduz . I can take another pass at it this weekend.

cursor · 2025-12-09T12:06:19Z

PR Summary

Introduces reusable variable selection priors and integrates them into instrumental variables (IV) regression, with an option for binary treatments.

New causalpy/variable_selection_priors.py: spike-and-slab and horseshoe priors via VariableSelectionPrior, helpers for inclusion probabilities and shrinkage; exported in __init__.py
IV experiment/model updates: InstrumentalVariable and InstrumentalVariableRegression accept vs_prior_type, vs_hyperparams, and binary_treatment; outcome/treatment beta can use VS priors; adds binary-treatment likelihood (Bernoulli) with correlated latent errors (rho) and adjusted default priors
Propensity score outcome model: adds spline_knots parameter and uses it to size B-splines
Tests: new integration tests for IV with binary treatment and VS priors; unit tests for prior factories
Docs: adds iv_vs_priors.ipynb to IV toctree; updates interrogate badge

^{Written by Cursor Bugbot for commit 18da6c4. This will update automatically on new commits. Configure here.}

Signed-off-by: Nathaniel <[email protected]>

cursor · 2025-12-09T12:11:46Z

causalpy/experiments/instrumental_variable.py

-                the assumption of a simple IV experiment.
-                The coefficients should be interpreted appropriately."""
+                We will use the multivariate normal likelihood
+                for continuous treatment."""


Bug: Validation warning ignores binary_treatment flag setting

The input_validation method checks if the treatment variable has more than 2 unique values and warns that "We will use the multivariate normal likelihood for continuous treatment." However, this warning doesn't account for the new binary_treatment parameter. If a user sets binary_treatment=True while having continuous treatment data, the warning incorrectly suggests MVN will be used, when actually the Bernoulli likelihood will be applied (which would fail on non-binary data). The validation needs to cross-check the actual data against the self.binary_treatment flag.

Signed-off-by: Nathaniel <[email protected]>

causalpy/variable_selection_priors.py

Signed-off-by: Nathaniel <[email protected]>

causalpy/pymc_models.py

Signed-off-by: Nathaniel <[email protected]>

causalpy/pymc_models.py

Signed-off-by: Nathaniel <[email protected]>

causalpy/variable_selection_priors.py

NathanielF · 2025-12-09T21:01:07Z

Great @NathanielF ! I think the notebooks need a bit more storyline and explanation 🙏

Took another pass @juanitorduz , should be more friendly now

drbenvincent · 2025-12-15T09:55:00Z

I'll attempt to review this week before I down tools for winter break. FYI, I was not planning on another CausalPy release in 2025. I think there is a lot in the pipeline so that we can start 2026 with some nice major feature releases

juanitorduz · 2025-12-15T12:41:58Z