Disallow direct configuration of `max_num_batched_tokens` #75

sunggg · 2023-11-21T20:46:54Z

Instead, we use max_num_sequences and max_input_len since it is more intuitive.

masahi

Please make sure that benchmark and tests preserve their behavior after this change.

masahi · 2023-11-21T20:49:06Z

serve/benchmarks/benchmark_throughput.py

@@ -179,7 +178,6 @@ def main(args: argparse.Namespace):
    parser.add_argument("--local-id", type=str, required=True)
    parser.add_argument("--artifact-path", type=str, default="dist")
    parser.add_argument("--use-staging-engine", action="store_true")
-    parser.add_argument("--max-num-batched-tokens", type=int, default=-1)


Need to add max_num_sequences if you remove this

yep just figured it out so added :)

masahi · 2023-11-21T20:49:19Z

serve/tests/test_engine.py

@@ -120,7 +119,6 @@ def test(args: argparse.Namespace):
    parser.add_argument("--local-id", type=str, required=True)
    parser.add_argument("--artifact-path", type=str, default="dist")
    parser.add_argument("--num-shards", type=int, default=1)
-    parser.add_argument("--max-num-batched-tokens", type=int, default=-1)


Need to add max_num_sequences if you remove this

done

5c92e18

masahi reviewed Nov 21, 2023

View reviewed changes

fix

869ce6e

sunggg merged commit a5deaed into octoml:batch-serving Nov 21, 2023
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disallow direct configuration of `max_num_batched_tokens` #75

Disallow direct configuration of `max_num_batched_tokens` #75

sunggg commented Nov 21, 2023

masahi left a comment •

edited

Loading

masahi Nov 21, 2023

sunggg Nov 21, 2023

masahi Nov 21, 2023

Disallow direct configuration of max_num_batched_tokens #75

Disallow direct configuration of max_num_batched_tokens #75

Conversation

sunggg commented Nov 21, 2023

masahi left a comment • edited Loading

Choose a reason for hiding this comment

masahi Nov 21, 2023

Choose a reason for hiding this comment

sunggg Nov 21, 2023

Choose a reason for hiding this comment

masahi Nov 21, 2023

Choose a reason for hiding this comment

Disallow direct configuration of `max_num_batched_tokens` #75

Disallow direct configuration of `max_num_batched_tokens` #75

masahi left a comment •

edited

Loading