Add functions for Empirical Bayes by ddimmery · Pull Request #39 · eytan/Bandits.jl

ddimmery · 2017-05-31T18:07:48Z

This is for generic distributions, not just Beta-Binomial. It calculates online mean and variance (variance of the mean now) like MLELearner, but using the James-Stein estimator we have been using.

This adds a positive part James Stein estimator that isn't dependent on distribution of the outcome.

Rename from a dumb name to JamesSteinLearner. Also add in fix to the stochastic bandit because it wasn't working before.

This provides functions for generating samples from the (normal) posteriors of MLELearners and James-Stein learners.

MLELearner was calculating SD, but want to calculate SE

johnmyleswhite · 2017-06-01T01:45:25Z

src/03_learners/02_mle_learner.jl

    if nᵢ == 1
        learner.oldMs[a] = r
-        learner.Ss[a] = 0.0
+        learner.Ss[a] = learner.σ₀


Are these bug fixes?

Not clear that I should really be doing this. We're already abusing the learner.σ₀ notation a bit, since we aren't "really" using a prior as the notation would suggest. This just ensures that the standard error is never actually exactly zero. For a Bernoulli DGP, we may have an estimated standard deviation of zero for a while until we finally observe a success. So this is basically like an Agresti-Coull estimate, with this change.

Hmm. This seems like we should potentially be using something like NaN instead. Or at least not calling this MLE anymore.

johnmyleswhite · 2017-06-01T01:45:45Z

src/03_learners/02_mle_learner.jl

        learner.oldMs[a] = learner.newMs[a]
        learner.μs[a] = learner.newMs[a]
-        learner.σs[a] = sqrt(learner.Ss[a] / (nᵢ - 1))
+        learner.σs[a] = sqrt(learner.Ss[a] / (nᵢ - 1) / nᵢ)


Was this a bug as well?

Yes, and it was a really fun one to track down. Doing anything with MLELearner seems like it's been broken for a while (learner.σs was the estimated standard deviation of the data, not the standard error of the mean).

Wait, I think there's some confusion: this field was always supposed to be the standard deviation, not the standard error. So I think other places in the code are the problem rather than this line.

johnmyleswhite · 2017-06-01T01:47:55Z

src/03_learners/06_james_stein_learner.jl

+        learner.oldMs[a] = learner.newMs[a]
+        learner.ys[a] = learner.newMs[a]
+        learner.ss[a] = learner.Ss[a] / (nᵢ - 1) / nᵢ
+        y̅ = mean(learner.ys)


Probably not a problem, but want to confirm that these steps are changing the inferred means for all means, not just the observed arm. I don't think we assume invariance anywhere, but just confirming.

Yeah, that's correct. Every data point changes the predictions (slightly, due to the shrinkage to an updated global mean) for all data points.

johnmyleswhite · 2017-06-01T01:48:37Z

src/03_learners/06_james_stein_learner.jl

+        learner.ss[a] = learner.Ss[a] / (nᵢ - 1) / nᵢ
+        y̅ = mean(learner.ys)
+        φs = min(1.0, learner.ss ./ (sumabs2(learner.ys - y̅) ./ (learner.K - 3)))
+        learner.μs[:] = y̅ + (1 - φs) .* (learner.ys - y̅)


Very minor, but these computations allocate memory. If we update this to a newer version of Julia, using the following should be allocation-free:

learner.μs .= y̅ .+ (1 .- φs) .* (learner.ys .- y̅)

Ah, I've already been running on 0.5.1, didn't know this existed.

johnmyleswhite · 2017-06-01T01:48:51Z

src/03_learners/06_james_stein_learner.jl

+        y̅ = mean(learner.ys)
+        φs = min(1.0, learner.ss ./ (sumabs2(learner.ys - y̅) ./ (learner.K - 3)))
+        learner.μs[:] = y̅ + (1 - φs) .* (learner.ys - y̅)
+        learner.σs[:] = sqrt(


Same allocation concern.

johnmyleswhite · 2017-06-05T15:34:32Z

Should we chat for five minutes over Messenger or VC to figure out what we should do here? I think the main problem here is a lack of documentation in this code about the intent of various fields.

This doesn't address the concern over learner.σs containing the std dev rather than the std error, but it fixes a problem with the underlying calculation

Drew Dimmery added 5 commits April 24, 2017 22:20

Add Positive Part James-Stein Learner

1967c23

This adds a positive part James Stein estimator that isn't dependent on distribution of the outcome.

Merge branch 'master' into eb-mle

81b14b1

Rename to JamesSteinLearner

22f1fe9

Rename from a dumb name to JamesSteinLearner. Also add in fix to the stochastic bandit because it wasn't working before.

Add rand functions for MLE/James-Stein Learners

de2e9aa

This provides functions for generating samples from the (normal) posteriors of MLELearners and James-Stein learners.

Fix calculation of std error

76dfe71

MLELearner was calculating SD, but want to calculate SE

ddimmery requested a review from eytan May 31, 2017 18:08

johnmyleswhite reviewed Jun 1, 2017

View reviewed changes

Fixing variance calculation

6b856f2

This doesn't address the concern over learner.σs containing the std dev rather than the std error, but it fixes a problem with the underlying calculation

Conversation

ddimmery commented May 31, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

johnmyleswhite commented Jun 5, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants