Make algorithms work with batched updates by eytan · Pull Request #28 · eytan/Bandits.jl

eytan · 2015-02-22T22:34:18Z

Many of the algorithms get stuck on arms, either because they expect
the learner’s counts to be updated from zero, or because it always
selects the first arm with the maximum value (which is just the first
arm, when we initialize an MLELearner object to initially take on some
fixed mean for all arms). This fixes these issues, and makes sure
every arm is played at least once (in cases where this is the expected
behavior), and randomly selects among best arms if there are multiple
best arms. With this change, all algorithms should in expectation
sample from all arms with equal probability during the first batch.

Many of the algorithms get stuck on arms, either because they expect the learner’s counts to be updated from zero, or because it always selects the first arm with the maximum value (which is just the first arm, when we initialize an MLELearner object to initially take on some fixed mean for all arms). This fixes these issues, and makes sure every arm is played at least once (in cases where this is the expected behavior), and randomly selects among best arms if there are multiple best arms. With this change, all algorithms should in expectation sample from all arms with equal probability during the first batch.

Also make plots a little easier to read.

johnmyleswhite · 2015-02-22T23:05:21Z

src/04_algorithms/05_ucb/01_ucb1.jl

I think we should use require this information to be provided by the learner. MLELearner did provide this information and so would StreamStats objects.

The issue is that nobs only gets updated when the learner gets updated, which only happens at the end of the batch.

johnmyleswhite · 2015-02-22T23:07:54Z

Made some comments in-line. Let me know if you want help implementing the strategy I proposed.

eytan added 2 commits February 22, 2015 14:33

Fixed MOSS

a7fd393

Also make plots a little easier to read.

johnmyleswhite reviewed Feb 22, 2015
View reviewed changes

johnmyleswhite mentioned this pull request Feb 22, 2015

Algorithms should randomize max arm #27

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make algorithms work with batched updates#28

Make algorithms work with batched updates#28
eytan wants to merge 2 commits intomasterfrom
eytan/batch_friendly

eytan commented Feb 22, 2015

Uh oh!

johnmyleswhite Feb 22, 2015

Uh oh!

eytan Feb 22, 2015

Uh oh!

johnmyleswhite commented Feb 22, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

eytan commented Feb 22, 2015

Uh oh!

johnmyleswhite Feb 22, 2015

Choose a reason for hiding this comment

Uh oh!

eytan Feb 22, 2015

Choose a reason for hiding this comment

Uh oh!

johnmyleswhite commented Feb 22, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants