UPSTREAM PR #21573: autoparser: fix MiniMax handling#1339
UPSTREAM PR #21573: autoparser: fix MiniMax handling#1339
Conversation
OverviewAnalysis of commit f0b2b56 ("parser: fix MiniMax handling") shows negligible performance impact. Out of 125,224 functions analyzed, 84 were modified (0.067%), with no new or removed functions. All changes are compiler optimization artifacts, not source code modifications—the commit only changed MiniMax template parsing in 2 files. Power Consumption: Virtually unchanged across all binaries (max change: -0.039%)
Function AnalysisAll modified functions are STL templates, JSON utilities, HTTP handlers, and template engine code—none affect inference hot paths (matrix operations, attention, KV cache). Changes show balanced improvements and regressions: Improvements (75% of functions):
Regressions (25% of functions):
All absolute changes under 200ns. No performance-critical functions affected. 💬 Questions? Tag @loci-dev |
e800934 to
a024d9c
Compare
7638ab4 to
f1b46d5
Compare
Note
Source pull request: ggml-org/llama.cpp#21573
Overview
There was a problem handling the generation prompt from MiniMax because it shares a trailing newline with the non-generation-prompt line.
Additional information
Added extra tests for Minimax.
Requirements