Time-Slice CV Class #2037

timbo112711 · 2025-10-26T02:46:37Z

Description

Currently to implement Time-Slice CV one would need to reference the example notebook and copy / pasta the code blocks. This PR aims to wrap the code inside the current notebook into a TimeSliceCrossValidator class. This new class will be used to streamline how users implement this methodology.

The class functions similar to the scikit-learn / generator approach for returning the spilts.

A simple flow would look the this,

# Initialize cross-validator
cv = TimeSliceCrossValidator(
    n_init=163,
    forecast_horizon=12,
    date_column="date",
    step_size=1,
)

# We can check how many splits we will have
# As a reference, the number of splits is computed as:
# n_iterations = y.size - n_init - forecast_horizon + 1
n_splits = cv.get_n_splits(X, y)

# Run the CV!
results = cv.run(
    X,
    y,
    yaml_path="<your-path-to-saved-model>",
)

Related Issue

Closes Time-Slice CV Class #1964
Related to #

Checklist

Checked that the pre-commit linting/style checks pass. Feel free to comment pre-commit.ci autofix to auto-fix.
Included tests that prove the fix is effective or that the new feature works
Added necessary documentation (docstrings and/or example notebooks) using numpydoc format.
If you are a pro: each commit corresponds to a relevant logical change

📚 Documentation preview 📚: https://pymc-marketing--2037.org.readthedocs.build/en/2037/

…g the date ranges (hence all geo levels for those dates) are kept.

review-notebook-app · 2025-10-26T02:46:42Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

timbo112711 · 2025-10-26T02:48:02Z

add tests
fix bug in CRSP method

codecov · 2025-10-26T02:49:47Z

Codecov Report

❌ Patch coverage is 11.65919% with 197 lines in your changes missing coverage. Please review.
✅ Project coverage is 62.45%. Comparing base (162e1a7) to head (f6c00f6).
⚠️ Report is 5 commits behind head on main.

Files with missing lines	Patch %	Lines
pymc_marketing/mmm/time_slice_cross_validation.py	11.65%	197 Missing ⚠️

❗ There is a different number of reports uploaded between BASE (162e1a7) and HEAD (f6c00f6). Click for more details.

HEAD has 5 uploads less than BASE

Flag BASE (162e1a7) HEAD (f6c00f6)

23 18

Additional details and impacted files

@@             Coverage Diff             @@
##             main    #2037       +/-   ##
===========================================
- Coverage   92.86%   62.45%   -30.42%     
===========================================
  Files          68       69        +1     
  Lines        9213     9596      +383     
===========================================
- Hits         8556     5993     -2563     
- Misses        657     3603     +2946

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

juanitorduz · 2025-10-27T08:29:53Z

Thanks @timbo112711 ! I will look inot this one carefully this week :)

cetagostini · 2025-10-27T10:41:54Z

pymc_marketing/mmm/time_slice_cross_validation.py

+        sampler_config: dict | None = None,
+        yaml_path: str | None = None,
+        mmm: Any | None = None,
+    ):


missing hint.

cetagostini · 2025-10-27T10:47:12Z

Hey, really nice implementation and critical feature we are missing!

A few requests on my side:

I see some test failings, lets try to get those work.
Then let's move the plots to the plot suite, instead of plot_param_stability better to use the suite in the class as property (plot.param_stability). Probably once you add the plots in the plot suite, you need to make sure they are agnostic to several dims (You can follow how other plots works), and plus add an small validator in the plots to see if the idata came from MMM or a cross validation, this relate to my next comment.
You can make the posteriors a single xarray object, instead of a list. In my opinion you have two options, an inference data object with each model as a variable (user can even decide based on the number of splits the name of each variable, we can have default names as in model builder) or a single xarray with model dimension. Any of those would be better and will integrate better with the plot suite API.

This things align a lot more with the vision we are following right now with the class! Could you take a look? Otherwise, I can help you! @timbo112711

Copilot

Pull Request Overview

This PR introduces a new TimeSliceCrossValidator class to standardize time-slice cross-validation for Media Mix Models (MMM). Previously, users had to copy code from example notebooks; now they can use this scikit-learn-style class with methods like split(), get_n_splits(), and run().

Key Changes:

New TimeSliceCrossValidator class with configurable n_init, forecast_horizon, date_column, and step_size parameters
Support for YAML-based model configuration or programmatic MMM instance passing
Comprehensive test suite covering basic functionality, step_size variations, and edge cases

Reviewed Changes

Copilot reviewed 2 out of 3 changed files in this pull request and generated 3 comments.

File	Description
`pymc_marketing/mmm/time_slice_cross_validation.py`	Implements the core `TimeSliceCrossValidator` class with cross-validation logic and visualization methods
`tests/mmm/test_time_slice_cross_validator.py`	Comprehensive test suite validating initialization, splitting logic, model fitting, and edge cases

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

pymc_marketing/mmm/time_slice_cross_validation.py

tests/mmm/test_time_slice_cross_validator.py

pymc_marketing/mmm/time_slice_cross_validation.py

timbo112711 · 2025-10-27T14:07:50Z

Hey, really nice implementation and critical feature we are missing!

A few requests on my side:

I see some test failings, lets try to get those work.

Then let's move the plots to the plot suite, instead of plot_param_stability better to use the suite in the class as property (plot.param_stability). Probably once you add the plots in the plot suite, you need to make sure they are agnostic to several dims (You can follow how other plots works), and plus add an small validator in the plots to see if the idata came from MMM or a cross validation, this relate to my next comment.

You can make the posteriors a single xarray object, instead of a list. In my opinion you have two options, an inference data object with each model as a variable (user can even decide based on the number of splits the name of each variable, we can have default names as in model builder) or a single xarray with model dimension. Any of those would be better and will integrate better with the plot suite API.

This things align a lot more with the vision we are following right now with the class! Could you take a look? Otherwise, I can help you! @timbo112711

@cetagostini thanks for the review 🙌🏻! These make sense and I'll start working towards each of these. Will reach out if I need anything!

…lice-cv-class

timbo112711 added 6 commits October 21, 2025 20:28

adds TimeSliceCrossValidator class

a186c6c

when passing an already built model object, we can use build_model()

dd906fd

update split indexing to boolean masks which ensures all rows matchin…

bf1f9af

…g the date ranges (hence all geo levels for those dates) are kept.

updates plot_predictions() method

3de0418

updates OOS plot

40f481f

Merge branch 'main' into time-slice-cv-class

5237fad

timbo112711 requested review from TeemuSailynoja, juanitorduz and williambdean October 26, 2025 02:46

timbo112711 self-assigned this Oct 26, 2025

github-actions bot added docs Improvements or additions to documentation MMM labels Oct 26, 2025

timbo112711 added 3 commits October 25, 2025 22:51

updates plot_crps() method

c2d51fe

updates CRSP plot!

775e927

adds tests for the new class

a02e38d

github-actions bot added the tests label Oct 27, 2025

Merge branch 'main' into time-slice-cv-class

9c91fd4

cetagostini reviewed Oct 27, 2025

View reviewed changes

juanitorduz requested a review from Copilot October 27, 2025 13:32

Copilot AI reviewed Oct 27, 2025

View reviewed changes

pymc_marketing/mmm/time_slice_cross_validation.py Outdated Show resolved Hide resolved

tests/mmm/test_time_slice_cross_validator.py Outdated Show resolved Hide resolved

pymc_marketing/mmm/time_slice_cross_validation.py Show resolved Hide resolved

timbo112711 added 3 commits October 27, 2025 10:10

removes y_combined since we do not use it

0d415e6

Merge remote-tracking branch 'origin/time-slice-cv-class' into time-s…

04e5213

…lice-cv-class

removes outdated comment in tests

f6c00f6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Time-Slice CV Class #2037

Time-Slice CV Class #2037

timbo112711 commented Oct 26, 2025 •

edited by github-actions bot

Loading

Uh oh!

review-notebook-app bot commented Oct 26, 2025

Uh oh!

timbo112711 commented Oct 26, 2025 •

edited

Loading

Uh oh!

codecov bot commented Oct 26, 2025 •

edited

Loading

Uh oh!

juanitorduz commented Oct 27, 2025

Uh oh!

cetagostini Oct 27, 2025

Uh oh!

cetagostini commented Oct 27, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

timbo112711 commented Oct 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Time-Slice CV Class #2037

Are you sure you want to change the base?

Time-Slice CV Class #2037

Conversation

timbo112711 commented Oct 26, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issue

Checklist

Uh oh!

review-notebook-app bot commented Oct 26, 2025

Uh oh!

timbo112711 commented Oct 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Oct 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

juanitorduz commented Oct 27, 2025

Uh oh!

cetagostini Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

cetagostini commented Oct 27, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

timbo112711 commented Oct 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

timbo112711 commented Oct 26, 2025 •

edited by github-actions bot

Loading

timbo112711 commented Oct 26, 2025 •

edited

Loading

codecov bot commented Oct 26, 2025 •

edited

Loading