-
Notifications
You must be signed in to change notification settings - Fork 63
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Potential sign of bug in confidence calculations #415
Comments
Assuming you are using a crosscat metamodel, not gpmcc, I happened on a possible explanation for this behavior earlier this week. # TODO: multistate impute doesn't exist yet
# e,confidence = su.impute_and_confidence_multistate(M_c, X_L, X_D, Y, Q, n,
# self.get_next_seed)
Then the confidence estimate makes an entirely new crosscat state from the posterior sample, trains it for 100 iterations, and returns the mean frequency of the maximum-likelihood category over those training iterations. Since that state is being trained on a sample from a gaussian, it's not surprising that the ML category has very high frequency. Essentially, the confidence-estimate code never gets to see the other mode of the posterior sample.
|
Makes sense, thanks! |
If you don't mind, I think it would be good to keep this issue open. It looks like you have brought a serious bug to light. |
@curlette, can you send the bdb file to alx@<the rest of my github name>.com, please?
I misread the code in the link. The first model is used only for the confidence calculation. The imputation sampling is done over all models, so you would expect the two modes to appear in the samples generated by |
During my analysis of the College Scorecard data I came across the following:
The plot below is of 500 simulated scores for a school whose tuition was inferred with 94% confidence (Everest Univ. Jacksonville).
It is noticeably somewhat bimodal, which we would expect to lower the confidence.
@fsaad @vkmvkmvkmvkm @raxraxraxraxrax @gregory-marton
The text was updated successfully, but these errors were encountered: