-
Hi - for the CLV element of this model, I reviewed the section of the data requirements, and it looks like it's only concrete numbers related to purchases, customer age, etc. Is is possible to include other customer attributes in this model to help predict CLV (such as age group, geographic location, income range, etc.)? Are there any concerns adding these additional data points within the model? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
Hey @acuson,
You would parametrize the model as shown: ParetoNBDModel(data, model_config = {
"purchase_covariate_cols": ['covariate_col_name1', 'covariate_col_name2', ...],
"dropout_covariate_cols": ['covariate_col_name1', 'covariate_col_name2', ...]
}
) I'm working on a notebook with a covariate example in fact, and will be posting opening that PR within the next few days. Many of the covariate examples you mentioned are categorical. One-hot encoding would make for more interpretable results, but watch out for the dummy variable trap: |
Beta Was this translation helpful? Give feedback.
Hey @acuson,
ParetoNBDModel
currently supports time invariant covariates, and other models will support this in the future.You would parametrize the model as shown:
I'm working on a notebook with a covariate example in fact, and will be posting opening that PR within the next few days.
Many of the covariate examples you mentioned are categorical. One-hot encoding would make for more interpretable results, but watch out for the dummy variable trap:
Encoding Categorical Variables: …