[Sampling] Softmax Predicate patch #3383

babusid · 2025-11-24T20:20:24Z

Solves the issue of softmax assigning probability to reserved tokens and messing up the sampling kernel. This patch extracts the active vocabulary from the HF tokenizer, and uses that to zero out the reserved tokens, and only generate probabilities for valid "active" tokens.

Issue is that the config read in the compile phase is a model-specific config object, that is defined in each models' implementation. Maybe the solution is not to add another field, but rather to overload / overwrite the vocab size parameter

babusid · 2025-11-24T20:27:19Z

python/mlc_llm/interface/compile.py

+    logger.info("TOP LEVEL MODEL CONFIG BEFORE OVERRIDES: %s", str(model_config))
+    _kwargs = getattr(model_config, "kwargs", {})
    model_config = args.overrides.apply(model_config)


Just a note here, i noticed that this override wipes out any kwargs in the original model_config. This PR isn't the place to address it probably, but I just wanted to call it out.

MasterJH5574

Thank you @babusid for the enhancement!

babusid and others added 4 commits November 24, 2025 10:35

original hardcoded fix from ruihang for qwen3 model

dd7f77c

Raised the hardcoded value to the _Rewriter level

d05bfff

fixed hack, should be generic now

5243ef2

babusid commented Nov 24, 2025

View reviewed changes

Formatting fixes

c188b4e

MasterJH5574 approved these changes Nov 25, 2025

View reviewed changes

MasterJH5574 changed the title ~~Softmax Predicate patch~~ [Sampling] Softmax Predicate patch Nov 25, 2025

MasterJH5574 merged commit 8b2195c into mlc-ai:main Nov 25, 2025
1 check failed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Sampling] Softmax Predicate patch #3383

[Sampling] Softmax Predicate patch #3383

Uh oh!

babusid commented Nov 24, 2025

Uh oh!

babusid Nov 24, 2025

Uh oh!

MasterJH5574 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[Sampling] Softmax Predicate patch #3383

[Sampling] Softmax Predicate patch #3383

Uh oh!

Conversation

babusid commented Nov 24, 2025

Uh oh!

babusid Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

MasterJH5574 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants