Extend sampler tests #200

Ailurus1 · 2024-02-09T15:01:42Z

No description provided.

serve/mlc_serve/engine/sampling_params.py

serve/tests/unittest/test_sampler.py

serve/mlc_serve/model/sampler.py

vvchernov

LGTM

vvchernov · 2024-02-19T12:02:11Z

Hello @sunggg! Could you check testing of sampler?

sunggg

Thank you @Ailurus1 for the great contribution!
Overall, it looks great, I have a couple of suggestion.
For the comments about more complicated tests, I think we don't have to address them in this PR to merge this PR before the migration to the new repo, but I'd like us to follow-up there.

sunggg · 2024-02-20T16:40:34Z

serve/tests/unittest/test_sampler.py

+def test_logit_bias_checker():
+    # logit bias values must be [-100, 100]
+    get_sampling_state([SamplingParams(logit_bias={1: 100, 3: -100, 2: 2})])
+    get_sampling_state([SamplingParams(logit_bias={34: 0, 23: -0.5})])
    # TODO(@team): it seems like the valid range is [1,vocab_size]. Double check.


Can we double check on this and remove this line if true?

I didn't find any descriptive API reference for indices in logit_bias, but figured that in transformers similar parameter sequence_bias must be >= 0, but if not then error message says it has to be positive so the usage of 0 is not quite clear here. I guess it can be also used since these are all indices and there is a token (<unk>) with id 0 so I changed range to [0, vocab_size) in the last commits

I see. Can we follow-up about this after this PR? I think this can be a rough edge so would like to address this before we run into the issue with customers. Maybe we can do deep-dive transformers or TGI to match our behavior.

serve/tests/unittest/test_sampler.py

sunggg · 2024-02-20T16:44:42Z

serve/tests/unittest/test_sampler.py

+            temperature = temperatures[i]
+            if temperature < SAMPLING_EPS:
+                temperature = 1.0
+            rep_pen = torch.where(


Since this is basically the same computation in adjust_logits, can we implement another approach that does same thing to cross-check?

Tried to do it in a bit more naive way in last commits

serve/tests/unittest/test_sampler.py

sunggg · 2024-02-20T16:49:20Z

serve/tests/unittest/test_sampler.py

+        ),
+        batch_size
+    ):
+        sampling_params = [SamplingParams(top_p=top_p, top_k=top_k) for top_p, top_k in top_pks]


Seems like I don't see the test with temperature=0. Let's add this in the next PR.

sunggg · 2024-02-20T16:52:26Z

serve/tests/unittest/test_sampler.py

+        assert isinstance(output.logprob_infos[idx].current_logprob, float)
+        assert output.logprob_infos[idx].top_token_ids.nelement() == 0
+        assert output.logprob_infos[idx].top_logprobs.nelement() == 0
+


ditto. we also need to test the complicated scenarios, such as,
(1) Requests in the same batch asks for the different top-k. Some of them are not logprobs
(2) Logprob can be used with other sampling params.

I think this may take some time, so I would like us to follow-up about this after this PR.

sunggg · 2024-02-20T16:53:19Z

serve/tests/unittest/test_sampler.py

+    and make sure that _apply_top_p_top_k from sampler.py does not produce too many -inf values
+    """)
+@pytest.mark.parametrize("batch_size", [1, 4, 8, 12])
+def test_mixture_of_requests(batch_size: int):


We should test this more exhaustively. Let's follow-up in the next PR.

…bias with engine

Ailurus1 · 2024-02-21T21:38:35Z

Hello @sunggg
Thank you for the review and comments!
I updated the tests according to the comments. Could you please take a look at the latest changes once again
Going to take a deeper look at the latest test for a mixture of different parameters in the follow-up PR

sunggg

LGTM, thanks for making the sampler more robust!
Like we discussed, let's address the remaining feedback in the new repo.

masahi · 2024-03-18T21:44:52Z

serve/tests/unittest/test_engine_with_samplers.py

@@ -422,6 +547,8 @@ def _test_json_mode(
    _test_stop(sync_engine)
    _test_logprobs(sync_engine)
    _test_logprobs_mixed_requests(sync_engine)
+    _test_num_sequences(sync_engine)
+    _test_logit_bias(sync_engine)


This tests seems to always fail when I run all tests in test_engine_with_samplers.py. But if I comment out other tests and run only this one, it works. So there is a strange issue in the tests. @Ailurus1 Can you take a look and send a fix to the ollm repo?

Ailurus1 added 13 commits February 9, 2024 14:47

Add test cases for temperature

c21e2ea

Fix temperature parameter verification

7e0a64f

Extend tests on temperature

e755eb7

Fix import

0bd3c4c

Fix test for logprobs after rebase

20c7da5

Add tests for repetition penalty

1aa769f

Refactor test for penalties

5b5e256

Fix types + update tests for logprobs after rebase

12ded7d

Fix type of error in tests

f4681c1

Fix tests for temperature

38fb95d

Update test for penalties

f26710e

Extend tests for top_p, top_k

3340a89

Remove irrelevant todo

0accd5e

vvchernov suggested changes Feb 14, 2024

View reviewed changes

serve/mlc_serve/engine/sampling_params.py Outdated Show resolved Hide resolved

serve/tests/unittest/test_sampler.py Show resolved Hide resolved

vvchernov mentioned this pull request Feb 14, 2024

[Param] Recheck and update repetition penalty parameter #202

Merged

Ailurus1 added 6 commits February 14, 2024 09:55

Extend test for logit_bias

1559d18

Remove underlines to execute with pytest

5c2c3a5

Add test for num_sequences

869511e

format + lint

792f51d

Upstream new changes

a05f31f

Add test for inspecting behaviour of logprobs depending on temperature

c0d3bd2

sunggg mentioned this pull request Feb 16, 2024

Hot fix for the repetition penalty and topp topk #214

Merged

Ailurus1 added 8 commits February 16, 2024 09:52

Merge new changes

b7b21be

Test with mixed greedy and random sampling requests + fix after merge

4ef3dcb

Fix get_sampling_state

cdeb1c5

Redesign test penalties

b72a2a9

Merge changes from batch-serving

29184f7

Set top_k as vocab_size when -1 + simplify test for penalties

20d28e0

Add pytest parametrization

2b28e33

Update mixed test

48300f0

Skip broken test

0409978

Ailurus1 changed the title ~~[WIP] Extend sampler tests~~ Extend sampler tests Feb 19, 2024

Ailurus1 marked this pull request as ready for review February 19, 2024 11:33

vvchernov suggested changes Feb 19, 2024

View reviewed changes

serve/mlc_serve/model/sampler.py Show resolved Hide resolved

Remove debug print

89b6999

vvchernov approved these changes Feb 19, 2024

View reviewed changes

sunggg reviewed Feb 20, 2024

View reviewed changes

Ailurus1 added 5 commits February 21, 2024 17:43

Corrections according to review comments + add simple test for logit_…

9c53ca4

…bias with engine

Fix indices in logit_bias since it starts from 0

3b3548f

Update test for penalties

7949db5

Add greedy sampling case in test for penalties

bf0c848

Remove debug lines

a4bda30

sunggg approved these changes Feb 22, 2024

View reviewed changes

sunggg merged commit d66880c into octoml:batch-serving Feb 22, 2024
1 check passed

masahi reviewed Mar 18, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extend sampler tests #200

Extend sampler tests #200

Ailurus1 commented Feb 9, 2024

vvchernov left a comment

vvchernov commented Feb 19, 2024

sunggg left a comment

sunggg Feb 20, 2024

Ailurus1 Feb 21, 2024

sunggg Feb 22, 2024

sunggg Feb 20, 2024

Ailurus1 Feb 21, 2024

sunggg Feb 20, 2024

sunggg Feb 22, 2024

sunggg Feb 20, 2024

sunggg Feb 20, 2024

Ailurus1 commented Feb 21, 2024

sunggg left a comment

masahi Mar 18, 2024 •

edited

Loading

Extend sampler tests #200

Extend sampler tests #200

Conversation

Ailurus1 commented Feb 9, 2024

vvchernov left a comment

Choose a reason for hiding this comment

vvchernov commented Feb 19, 2024

sunggg left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Ailurus1 commented Feb 21, 2024

sunggg left a comment

Choose a reason for hiding this comment

masahi Mar 18, 2024 • edited Loading

Choose a reason for hiding this comment

masahi Mar 18, 2024 •

edited

Loading