SpecPrefill labeled 'for MoE models' but code has no MoE gating

The admin UI (`_modal_model_settings.html`) and `model_settings.py` docstrings describe SpecPrefill as "for MoE/hybrid models", but the implementation in `patches/specprefill.py` has zero MoE-specific logic . It uses architecture-agnostic query extractors including `_llama_extract_queries` for standard dense transformers, and no MoE checks exist in the scheduler or engine.

This is misleading for users with dense models who may skip the feature based on the label.

**Fix:** PR #1044 removes the "MoE" qualifier from all 3 locations.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SpecPrefill labeled 'for MoE models' but code has no MoE gating #1045

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

SpecPrefill labeled 'for MoE models' but code has no MoE gating #1045

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions