fix: clear stale inflight UI state by ai-ag2026 · Pull Request #2796 · nesquena/hermes-webui

ai-ag2026 · 2026-05-23T12:54:28Z

Summary

Clears stale composer busy state before starting a new send so blocked UI state does not survive failed/incomplete prior turns.
Drops stale optimistic sidebar rows once canonical session data arrives.
Preserves canonical idle server rows over stale optimistic local rows.
Lets /api/chat/start still run if optional optimistic pre-start UI bookkeeping throws, and keeps those warnings console-only.
Adds regression coverage for the inflight send/start race, stale optimistic sidebar rows, and pre-start UI helper failures.

Closes #2795.
Relates to the state-consistency umbrella #2361.

Test Plan

git diff --check origin/master...HEAD
node --check static/messages.js
node --check static/sessions.js
python3 -m pytest tests/test_inflight_send_start_race.py -q -o addopts= -> 7 passed
Added-line hygiene scans: private/local markers 0, secret-like hits 0

Notes

Prepared from fresh nesquena/hermes-webui:master at e091e65d56fba42f350a95f3308d6d43b3627a87; latest head d2f5c9069120a9ac56585eaf44e9621ec84e93a6 adds the pre-start UI survival follow-up.

nesquena-hermes · 2026-05-23T14:55:33Z

Summary

Read the diff at cron-pr-2796 against origin/master and traced through static/sessions.js:2012-2086 (_isOptimisticFirstTurnSessionRow / _mergeOptimisticFirstTurnSessions) and static/messages.js:228-318 (send() entry path). Two related issues:

A stale client-only S.busy=true (without a runtime-side confirmation) was diverting new user turns into the invisible queue path instead of starting a fresh /api/chat/start.
After a failed first turn, the locally optimistic sidebar row could persist forever because the merge unconditionally appended local-only rows back into merged and unconditionally Math.max'd the local message_count over the server's confirmed-idle count.

Code reference

The new busy reconciliation guard at static/messages.js:193-216:

function _clearStaleBusyStateBeforeSend({compressionRunning=false}={}){
  if(!S||!S.busy||compressionRunning) return false;
  const session=S.session||{};
  const sid=session.session_id||'';
  const hasRuntimeConfirmation=Boolean(
    S.activeStreamId||
    session.active_stream_id||
    session.pending_user_message||
    session.pending_started_at
  );
  if(hasRuntimeConfirmation) return false;
  ...
  if(typeof setBusy==='function') setBusy(false);
  ...
}

The four hasRuntimeConfirmation conditions are the canonical "the server actually owns a turn" signals, so the guard only clears state that the runtime never confirmed. Good. Compression carve-out is right: isCompressionUiRunning() (called at line 290) is a client-only state machine and would always fail hasRuntimeConfirmation, so an explicit early-return is required.

The sidebar-merge fix at static/sessions.js:2026-2055:

const fetched=merged[idx]||{};
const fetchedIsServerIdle=_isServerIdleSessionRow(fetched);
const keepLocalOptimistic=fetchedIsServerIdle?false:_shouldKeepLocalOnlyOptimisticSessionRow(local);
...
if(!keepLocalOptimistic&&typeof _dropStaleOptimisticSessionRow==='function') _dropStaleOptimisticSessionRow(sid);
merged[idx]={
  ...local,
  ...fetched,
  title:keepLocalOptimistic?(local.title||fetched.title):fetched.title,
  message_count:keepLocalOptimistic?Math.max(localCount,fetchedCount):fetchedCount,
  ...
};

The new _shouldKeepLocalOnlyOptimisticSessionRow gate at line 2026 requires either an in-flight send (_sendInProgress && sid===_sendInProgressSid), an active session with runtime confirmation, or active+busy within 5s of last_message_at. Outside that window the local row is dropped from the merge and _dropStaleOptimisticSessionRow(sid) clears INFLIGHT, _sessionStreamingById, and clearInflightState (defined at static/ui.js:4186).

Diagnosis

This is the correct shape: the merge previously could not distinguish "user just hit send, server hasn't echoed yet" from "send failed silently, optimistic row is now stuck." The fix encodes that distinction via:

_sendInProgress+_sendInProgressSid covers the < ~500ms window before the /api/chat/start response lands;
S.busy && hasRuntimeConfirmation covers an active stream;
5-second ageMs covers the "send is mid-flight, no runtime signal yet" tail.

After 5s with no runtime confirmation, the row should drop — and it does. The is_streaming field is also gated on keepLocalOptimistic, so a stale row won't keep the sidebar dot spinning.

One subtle correctness question worth confirming: the else branch at line 2079 now drops local rows that aren't in bySid (i.e. server-absent) unless _shouldKeepLocalOnlyOptimisticSessionRow(local) returns true. The pre-fix behavior was to always push them. If a session was just created via /api/session/new but the listing fetch raced and didn't see it yet, this gate would drop it — but the gate would return true for that case because _sendInProgress && sid===_sendInProgressSid would still be true during the same paint cycle. Good.

Test plan

tests/test_inflight_send_start_race.py:51-98 is the right shape:

test_send_clears_stale_busy_state_before_queue_branch asserts _clearStaleBusyStateBeforeSend runs before the if(S.busy||compressionRunning) branch and before api('/api/chat/start') — i.e. the reconciliation can't be moved underneath the queue gate without breaking the test.
test_server_absent_optimistic_first_turn_rows_are_not_kept_forever asserts the keep/drop gate exists, the merged.push(...) is inside the keep branch, and _dropStaleOptimisticSessionRow clears INFLIGHT.
test_server_idle_row_wins_over_stale_optimistic_count asserts a specific ternary shape for message_count and title — these are guard tests against future regressions where someone might restore the unconditional Math.max.

CI green 3.11/3.12/3.13. LGTM.

nesquena-hermes · 2026-05-23T19:17:30Z

Follow-up review — three new commits since the first pass

Re-reading the latest HEAD (d2f5c906) since my earlier review at 14:55Z. Three new commits landed, all of which address concrete edge cases worth highlighting:

`46c3b902` — preserve server idle rows during optimistic merge

static/sessions.js:2062-2078: The previous merge expression keepLocalOptimistic = !fetchedIsServerIdle || _shouldKeepLocalOnlyOptimisticSessionRow(local) had a logic bug — when the server returned an idle row but the local row was still considered "keepable" by the helper (e.g. recent send-in-progress), the OR kept keepLocalOptimistic=true and the merged row inherited the local-only active_stream_id/pending_user_message/is_streaming flags. So a server-confirmed-idle session could still render as busy in the sidebar.

The fix flips to fetchedIsServerIdle ? false : _shouldKeepLocalOnlyOptimisticSessionRow(local) — when the server says idle, idle wins, period. The follow-up changes to the merged-object fields encode that hard precedence:

active_stream_id: fetchedIsServerIdle ? null : (keepLocalOptimistic ? (fetched.active_stream_id || local.active_stream_id || null) : null),
pending_user_message: fetchedIsServerIdle ? null : (keepLocalOptimistic ? (fetched.pending_user_message || local.pending_user_message || null) : null),
pending_started_at: fetchedIsServerIdle ? null : (keepLocalOptimistic ? (fetched.pending_started_at || local.pending_started_at || null) : null),
is_streaming: fetchedIsServerIdle ? false : (keepLocalOptimistic && Boolean(...)),

This is the correct shape. The server is the source of truth for "is this session active?" and the merge should never let a stale local optimistic flag override that signal.

`de51d271` — let chat start survive pre-start UI errors

static/messages.js:436-503: This is the genuinely interesting one. The pre-start optimistic UI block (sidebar upsert, INFLIGHT save, title flash, polling kick-off) now lives inside a try { ... } catch(preStartError) { ... } where the catch path guarantees the user message reaches S.messages and INFLIGHT, sets busy state, and falls through to /api/chat/start.

The motivation, from the catch comment:

// The user turn must reach /api/chat/start even if local optimistic UI
// bookkeeping (render cache, storage quota, sidebar reconciliation, etc.)
// throws. Otherwise the pane can show a user bubble + spinner while the
// backend never receives the turn.

This is exactly the failure mode that the inflight-recovery storage-quota fix (b2477974, v0.51.117) was bumping into. If saveInflightState or upsertActiveSessionForLocalTurn raises (quota exceeded, schema mismatch, etc.), the user expects the message to still send — the optimistic UI bits are decoration, not load-bearing. The wrapper _runOptionalPreStartUiStep at static/messages.js:219-227 swallows individual helper errors silently, and the outer try/catch(preStartError) is the belt-and-suspenders backstop.

One nit: the setBusy(true) inside the catch falls back to a raw S.busy=true assignment if setBusy itself throws (try{setBusy(true);}catch(_){S.busy=true;}). That's reasonable, but means if setBusy was throwing because of a deeper state inconsistency, you're now operating with S.busy=true and no inflight-send button state update. Probably fine for a recovery edge case, but worth knowing about.

`d2f5c906` — hide nonfatal pre-start send warnings

static/messages.js:219-227: Adds _runOptionalPreStartUiStep(label, fn) and a guard test asserting the helper does NOT call setStatus('UI warning before send: ...'). The intent is right — flashing a status banner for a recoverable internal warning would scare users into thinking the message failed when /api/chat/start is about to succeed. Routing it through console.warn only is the right call.

Test coverage assessment

tests/test_inflight_send_start_race.py now has 5+ test functions pinning structural invariants:

test_pre_start_optimistic_ui_helpers_cannot_block_chat_start — confirms the _runOptionalPreStartUiStep wrapper exists and is used before /api/chat/start.
test_pre_start_optimistic_block_cannot_prevent_chat_start — pins the outer try { ... } catch(preStartError) { ... } and the catch-position-before-chat-start ordering.

The negative-case test asserting setStatus(\UI warning before send:` is not present in the helper body is a smart regression — without it, someone could re-add a status flash and silently re-introduce the "scary error before successful send" UX bug.

Verdict

LGTM on all three follow-up commits. The fault-tolerance shape is right: server-idle wins in sidebar merge, optimistic UI is decoration not load-bearing, and helper failures stay in the console. CI green on 3.11/3.12/3.13 per the PR description.

The compound fix here addresses a meaningfully larger class of "stuck inflight" failures than the original PR scope — between this and PR #2802, the v0.51.119 cycle should resolve a chunk of the long-tail session-state regressions reported over the last sprint.

nesquena-hermes · 2026-05-24T04:18:07Z

Shipped in v0.51.122 via release/stage-batch4 (#2815). All 5 sub-fixes squashed into one commit with authorship preserved. Closes #2795.

@ai-ag2026

…om 5 commits) Cherry-pick of PR nesquena#2796 by @ai-ag2026, squashed from 5 author commits onto current master: - dcee056 fix: drop stale optimistic sidebar rows - 3a73400 fix: clear stale busy state before send - 46c3b90 fix: preserve server idle rows during optimistic merge - de51d27 fix: let chat start survive pre-start UI errors - d2f5c90 fix: hide nonfatal pre-start send warnings Authorship preserved via --author. Code-only squash (no CHANGELOG).

…isk batch) Cherry-picked PRs: - nesquena#2802 (ai-ag2026) — drop stale cached user tail (supersedes held nesquena#2733) - nesquena#2796 (ai-ag2026) — clear stale inflight UI state (5-commit squash) - nesquena#2777 (b3nw) — flush pending render at segment boundaries - nesquena#2778 (b3nw) — reset reasoning accumulator per turn + prefer reasoning_content

…➔ 0.51.124) (#634) This PR contains the following updates: | Package | Update | Change | |---|---|---| | [ghcr.io/nesquena/hermes-webui](https://github.com/nesquena/hermes-webui) | patch | `0.51.108` → `0.51.124` | --- ### Release Notes <details> <summary>nesquena/hermes-webui (ghcr.io/nesquena/hermes-webui)</summary> ### [`v0.51.124`](https://github.com/nesquena/hermes-webui/blob/HEAD/CHANGELOG.md#v051124--2026-05-24--Release-CV-stage-batch6--3-PR-Windows-only-stack--agent-paths--docs--port-hardening) [Compare Source](nesquena/hermes-webui@v0.51.123...v0.51.124) ##### Added - **PR [#2805](nesquena/hermes-webui#2805 by [@Koraji95-coder](https://github.com/Koraji95-coder) — `start.ps1`: expand hermes-agent candidate paths for Windows installers. The launcher now searches `$env:USERPROFILE\.hermes\hermes-agent`, the dev-checkout sibling, and the Windows installer roots (`$env:LOCALAPPDATA\hermes\hermes-agent`, `${env:ProgramW6432}\hermes\hermes-agent`, `${env:ProgramFiles}\hermes\hermes-agent`, `${env:ProgramFiles(x86)}\hermes\hermes-agent`) with `Select-Object -Unique` to collapse WOW64 ProgramFiles redirection collisions on 32-bit PowerShell processes. Adds `-PathType Container` to the `HERMES_WEBUI_AGENT_DIR` guard so a file named `hermes_cli` doesn't false-positive. Null-guards `${env:ProgramFiles(x86)}` for constrained environments where it's missing. Zero impact on Linux/macOS — file is `start.ps1`, never loaded by `start.sh` or `bootstrap.py`. ##### Documentation - **PR [#2806](nesquena/hermes-webui#2806 by [@Koraji95-coder](https://github.com/Koraji95-coder) — Native Windows venv path corrected in `start.ps1` doc-comment and `README.md`. The previous text suggested "run bootstrap.py inside WSL2 once to create the venv, then this script can use that venv" — but a WSL2-created venv is `venv/bin/python` (ELF) and cannot be invoked by native Windows Python. The corrected guidance is to create a Windows venv natively (`python -m venv venv` from PowerShell), then `start.ps1` auto-discovers `venv\Scripts\python.exe`. WSL2 remains useful as a parallel install for the full `bootstrap.py` + Linux runtime path. ##### Hardened - **PR [#2807](nesquena/hermes-webui#2807 by [@Koraji95-coder](https://github.com/Koraji95-coder) — `start.ps1`: `HERMES_WEBUI_PORT` env-var parsing uses `[int]::TryParse` + range guard (1-65535) instead of a bare `[int]` cast that threw `InvalidCastException` with no context on typos or accidental shell expansion. Server-process exit code is captured into `$script:serverExitCode` and emitted via `exit` AFTER the `try/finally` cleanup, so `Pop-Location` always runs (avoids leaving the caller stuck at `$RepoRoot` in interactive or dot-sourced sessions). Also drops a non-functional `@args` splat that PowerShell doesn't populate under `[CmdletBinding()]` — the launcher's existing use case is env-var-driven, no pass-through args needed. ### [`v0.51.123`](https://github.com/nesquena/hermes-webui/blob/HEAD/CHANGELOG.md#v051123--2026-05-24--Release-CU-stage-batch5--2-PR-low-risk-batch--gzipETag-static-caching--Open-in-VS-Code) [Compare Source](nesquena/hermes-webui@v0.51.122...v0.51.123) ##### Performance - **PR [#2779](nesquena/hermes-webui#2779 by [@v2psv](https://github.com/v2psv) — Static asset serving negotiates gzip, emits ETags, and uses `immutable` cache headers for fingerprinted URLs. `_serve_static()` in `api/routes.py` previously sent every `/static/*` response with `Cache-Control: no-store` and no `Content-Encoding`, so a page reload over a slow link re-downloaded the full \~2.4 MB JS+CSS shell on every visit. The fix layers three changes inside the same function: (1) gzip the body when the client opts in via `Accept-Encoding`, gated to compressible MIME types and files >1 KB; (2) emit a weak ETag derived from `(size, mtime_ns)` and short-circuit conditional GETs to `304 Not Modified`; (3) send `Cache-Control: public, max-age=31536000, immutable` when the URL carries a non-empty `?v=…` fingerprint (the `__WEBUI_VERSION__` token already substituted by the index template and referenced from `static/sw.js`'s `SHELL_ASSETS`), falling back to `public, max-age=300` otherwise. Raw bytes, compressed bytes, and ETags are cached in-process keyed by `(size, mtime_ns)` so a redeploy is picked up without a restart, while missing/random paths never enter the cache and image/font types skip gzip to avoid wasted CPU on already-compressed payloads. Measured against an asyncio TCP proxy that injects RTT + bandwidth caps for representative VPN scenarios: cold loads improve 2.7-3.1× (e.g. 80 ms RTT / 10 Mbps WireGuard goes from 4.0 s to 1.3 s), warm reloads improve 3.3-4.0× via 304 responses, and bytes-on-the-wire drop 74% on cold loads. Loopback (already fast) still benefits 2.4×. Scope is strictly `/static/*`: `/api/*`, `/stream`, `/`, `/index.html`, `/session/*`, and login/auth routes are served by independent handlers and continue to send `no-store` exactly as before — no change to CSRF, session payloads, SSE buffering, or login flows. 11 regression tests pin gzip negotiation, ETag/304 round-trip including `Vary: Accept-Encoding`, fingerprint-driven cache policy including empty `?v=`, image/tiny-file skip rules, redeploy invalidation, and the existing path-traversal sandbox. ##### Added - **PR [#2787](nesquena/hermes-webui#2787 by [@munim](https://github.com/munim) — "Open in VS Code" action in workspace file browser (resolves [#2735](nesquena/hermes-webui#2735)). Right-clicking any file, folder, or the workspace root now shows an **Open in VS Code** menu item alongside the existing Reveal in File Manager action. The action calls a new `POST /api/file/open-vscode` endpoint which resolves the workspace-relative path via the existing `safe_resolve` traversal guard, then launches VS Code via `subprocess.Popen` (fire-and-forget, consistent with `_handle_file_reveal`). The endpoint resolves the executable via `shutil.which()` first, then falls back to a hardcoded list of common install locations (macOS: `/usr/local/bin/code` and the app-bundle CLI; Linux: `/usr/bin/code`, `/snap/bin/code`; Windows: `%LOCALAPPDATA%\Programs\Microsoft VS Code\bin\code.cmd` and the `%PROGRAMFILES%` variants) so the action works even when the server process inherits a minimal PATH. Configurable via a new optional `vscode` block in `config.yaml`: `command` overrides the default `code` executable; `host_path_prefix` + `container_path_prefix` enable Docker/container host-path translation. If the command cannot be found anywhere, a descriptive error is returned instead of a bare OS error. i18n keys `open_in_vscode` and `open_in_vscode_failed` added with full translations in all 10 locales. 26 new tests in `tests/test_2735_open_in_vscode.py` pin source wiring, command-resolution logic, i18n completeness, translated strings, and live endpoint error paths. ### [`v0.51.122`](https://github.com/nesquena/hermes-webui/blob/HEAD/CHANGELOG.md#v051122--2026-05-24--Release-CT-stage-batch4--4-PR-low-risk-batch--stale-cache-tail--inflight-UI--segment-flush--reasoning-accumulator) [Compare Source](nesquena/hermes-webui@v0.51.121...v0.51.122) ##### Fixed - **PR [#2802](nesquena/hermes-webui#2802 by [@ai-ag2026](https://github.com/ai-ag2026) — Drop stale inactive cached user tails when `/api/session` reloads a conversation whose saved sidecar already ends on an assistant answer. Supersedes [#2733](nesquena/hermes-webui#2733) (held due to async-compression interaction): the new guard adds a `len(cached_messages) <= len(disk_messages)` filter so it never fires when the cache has genuine new concurrent edits beyond the disk state — only when the cache has an unsaved user row past the saved assistant tail. Adds `api/models._inactive_cache_tail_needs_disk_check()` + `_cache_has_stale_unsaved_user_tail()` helpers and 5 new tests in `tests/test_webui_state_db_reconciliation.py`. Previously-held test `test_session_compress_async_reports_stale_session_guard` now passes (verified). Closes umbrella [#2361](nesquena/hermes-webui#2361) partially. - **PR [#2796](nesquena/hermes-webui#2796 by [@ai-ag2026](https://github.com/ai-ag2026) — Clear stale inflight UI state before starting a new send so blocked composer busy-state from failed/incomplete prior turns doesn't divert new turns into the invisible queue. Five-commit squashed fix: (1) drop stale optimistic sidebar rows once canonical session data arrives, (2) clear stale busy state before send via `_clearStaleBusyStateBeforeSend()`, (3) preserve server idle rows over stale optimistic local rows, (4) let `/api/chat/start` survive non-fatal pre-start UI errors via `_runOptionalPreStartUiStep()`, (5) keep those warnings console-only instead of throwing. Adds `_shouldKeepLocalOnlyOptimisticSessionRow()` in `static/sessions.js` and 8 new tests in `tests/test_inflight_send_start_race.py`. Closes [#2795](nesquena/hermes-webui#2795). Authorship preserved via `--author`. - **PR [#2777](nesquena/hermes-webui#2777 by [@b3nw](https://github.com/b3nw) — Flush pending render before segment reset at tool/interim\_assistant boundaries so live tokens that arrived in the 66ms rAF throttle window don't get lost from the DOM when `_resetAssistantSegment()` clears `assistantBody`. New `_flushPendingSegmentRender()` helper writes via `smd`, `renderMd`, or `esc` fallback (same paths as `_doRender`) only when `_renderPending` is true. Completed transcripts were never affected — `renderMessages` rebuilds from the full `assistantText` accumulator on `done`. Adds `tests/test_issue2713_streaming_segment_flush.py`. Closes [#2713](nesquena/hermes-webui#2713). - **PR [#2778](nesquena/hermes-webui#2778 by [@b3nw](https://github.com/b3nw) — Reset reasoning accumulator per turn and prefer `reasoning_content` over `reasoning` on read. Two related bugs: (1) `reasoningText` was initialized once when the SSE stream opened and never reset between turns, so the `done` event would assign the union of every turn's reasoning to the last assistant message in multi-turn agent sessions; now reset at both turn boundaries (`tool` + `interim_assistant`). (2) `static/ui.js renderMessages` preferred `m.reasoning` (potentially corrupted by bug 1) over `m.reasoning_content` (the clean per-turn backend value); the fallback now reads `m.reasoning_content || m.reasoning`. Updates `tests/test_streaming_race_fix.py` to scope the reconnect-accumulator guard to the `_wireSSE` preamble only (turn-boundary resets inside event listeners are intentional). Adds `tests/test_issue2565_reasoning_accumulation.py`. Closes [#2565](nesquena/hermes-webui#2565). ### [`v0.51.121`](https://github.com/nesquena/hermes-webui/blob/HEAD/CHANGELOG.md#v051121--2026-05-24--Release-CS-stage-batch3--4-PR-low-risk-batch--statedb-merge--display-counts--compression-marker--Windows-launcher) [Compare Source](nesquena/hermes-webui@v0.51.120...v0.51.121) ##### Fixed - **PR [#2788](nesquena/hermes-webui#2788 by [@Carry00](https://github.com/Carry00) — Prevent `state.db` messages being silently dropped during sidecar merge. Two related bugs were combining to discard historical messages: (1) `get_state_db_session_messages()` was selecting `role, content, timestamp` but NOT `id`, so every row was assigned a `("legacy", ...)` merge key instead of `("message_id", ...)`; (2) when a WebUI-origin session was continued via another Hermes surface (Gateway, CLI), the reader was always hitting the *active* profile's `state.db` rather than the session's own profile. Symptom: a 189-message session showed only 50 in the WebUI. Fix: include `id` in the SELECT when the column exists, and accept an optional `profile=` arg so cross-profile reads use the right database. Both callers in `api/routes.py handle_get` now thread `profile=getattr(s, 'profile', None)` through. - **PR [#2797](nesquena/hermes-webui#2797 by [@ai-ag2026](https://github.com/ai-ag2026) — Align messaging session display counts with deduped display messages. The `message_count` returned by `/api/session` is the display coordinate space used for pagination and the header badge. Messaging-thread `state.db` metadata can carry raw duplicate transport rows (blank assistant separators between Discord/Slack thread turns) that `_merged_session_messages_for_display()` intentionally dedupes for rendering. The advertised count was the raw row count, so the frontend expected phantom messages after dedupe — `len(display_msgs) < message_count` triggered "load older" UI states that immediately returned nothing. Fix: `raw["message_count"] = _merged_message_count` for messaging sessions, computed from the same merge that produced the displayed messages. Adds `tests/test_gateway_sync.py::test_messaging_session_message_count_matches_deduped_display_messages` covering the regression. - **PR [#2803](nesquena/hermes-webui#2803 by [@simjak](https://github.com/simjak) — Compression-summary cards no longer use ordinary tool output that merely mentions context compression. The streaming auto-compression path was using a local broad substring matcher that fired on any message containing the strings "context compaction" / "context compression" / "context was auto-compressed" / "active task list was preserved across context compression", including skill/tool JSON output and ordinary user discussion about compaction. The strict predicate at `api/compression_anchor._is_context_compression_marker()` was already correctly scoped to synthetic marker prefixes on non-tool messages. Fix: expose the strict predicate as `is_context_compression_marker()` (public name) and route `api/streaming._is_context_compression_marker` through it as a backward-compatible alias. Tool/skill output that mentions compression no longer seeds `compression_anchor_summary` cards. ##### Added - **PR [#2783](nesquena/hermes-webui#2783 by [@Koraji95-coder](https://github.com/Koraji95-coder) — Native Windows launcher and community-guide README link (squashed from 3 commits). `start.ps1` is a PowerShell equivalent of `start.sh` that bypasses `bootstrap.py`'s `ensure_supported_platform()` refusal and invokes `server.py` directly on native Windows. It mirrors `start.sh`'s discovery (load optional `.env` with the same readonly-var filter for `UID`/`GID`/`EUID`/`EGID`/`PPID`, find Python via `HERMES_WEBUI_PYTHON` env → `python3` → `python` → `py`, validate `HERMES_WEBUI_AGENT_DIR` on disk before use, prefer the agent's `venv\Scripts\python.exe`, set `HERMES_WEBUI_HOST` / `HERMES_WEBUI_PORT` / `HERMES_WEBUI_STATE_DIR` / `HERMES_HOME` defaults). The README adds a community-maintained native Windows setup section pointing to [@markwang2658](https://github.com/markwang2658)'s `hermes-windows-native-guide` and `hermes-windows-native` repos with the documented memory delta (\~330 MB native vs \~1080 MB WSL2+Docker). Closes both halves of [#1952](nesquena/hermes-webui#1952). Assumes Python + agent venv are already set up — first-time setup still needs WSL2 once to create the venv (`bootstrap.py` still refuses on native Windows). ### [`v0.51.120`](https://github.com/nesquena/hermes-webui/blob/HEAD/CHANGELOG.md#v051120--2026-05-24--Release-CR-stage-batch2--3-PR-low-risk-batch--Bedrock-provider--update-check-past-tag--CORS-preflight) [Compare Source](nesquena/hermes-webui@v0.51.119...v0.51.120) ##### Added - **PR [#2786](nesquena/hermes-webui#2786 by [@munim](https://github.com/munim) — Surface AWS Bedrock as a configurable provider in the WebUI model picker. `api/config.py` registers `"bedrock": "AWS Bedrock"` in `PROVIDER_LABELS`, adds 6 default Bedrock model IDs (Claude Opus 4.7 / 4.6 / 4.5, Sonnet 4.6 / 4.5, Haiku 4.5) to `DEFAULT_MODELS["bedrock"]`, and teaches `_build_configured_model_badges()` to detect Bedrock when both `AWS_ACCESS_KEY_ID` and `AWS_SECRET_ACCESS_KEY` are present (IAM-style auth, not single-API-key). Static fallback list is overridden at runtime by `hermes_cli.models.provider_model_ids("bedrock")` when the live AWS model list is reachable. Adds `tests/test_issue2720_bedrock_model_picker.py` with 11 test cases covering registry, defaults, env-detection, and runtime override. Resolves [#2720](nesquena/hermes-webui#2720). ##### Fixed - **PR [#2789](nesquena/hermes-webui#2789 by [@munim](https://github.com/munim) — Update check no longer falsely reports "Up to date" when HEAD has moved hundreds of commits past the latest tag. The hermes-agent repository keeps committing to master between tagged releases, and the old `_check_repo_release()` returned `behind=0` (since `current_tag == latest_tag`) and stopped — so the user saw "Up to date" while the working tree was hundreds of commits behind. The fix: when `behind == 0`, run `git describe --tags --always`; if the result contains the `-N-gSHA` suffix (HEAD past tag), return `None` so `_check_repo_branch()` runs and reports the real commit gap. Adds 8 new test cases in `tests/test_updates.py` covering past-tag detection, equal-tag-and-HEAD pass-through, untagged-repo behavior, and the agent-cadence [#2653](nesquena/hermes-webui#2653) scenario. Resolves [#2653](nesquena/hermes-webui#2653). - **PR [#2790](nesquena/hermes-webui#2790 by [@weidzhou](https://github.com/weidzhou) — Add `do_OPTIONS()` handler in `server.py` so CORS preflight requests return `200 OK` with appropriate `Access-Control-Allow-*` headers instead of `501 Not Implemented`. Browsers sending a preflight OPTIONS for cross-origin API calls previously hit the BaseHTTPRequestHandler default and the entire CORS exchange was blocked. The handler narrowly responds only to OPTIONS — no broader CORS posture change to other endpoints. Resubmit of closed [#2750](nesquena/hermes-webui#2750) (which bundled unrelated session-index changes); this PR is the minimal preflight-only split that [@nesquena-hermes](https://github.com/nesquena-hermes) and [@AJV20](https://github.com/AJV20) requested. ### [`v0.51.119`](https://github.com/nesquena/hermes-webui/blob/HEAD/CHANGELOG.md#v051119--2026-05-24--Release-CQ-stage-batch1--3-PR-low-risk-batch--tool-cards--404-recovery--Hepburn-skin) [Compare Source](nesquena/hermes-webui@v0.51.118...v0.51.119) ##### Fixed - **PR [#2801](nesquena/hermes-webui#2801 by [@ai-ag2026](https://github.com/ai-ag2026) — Preserve settled tool cards across stream completion. The streaming `done` handler now derives anchored settled tool cards from message-level tool metadata (`message.tool_calls`, `message._partial_tool_calls`, or `content[].type === 'tool_use'`) when present, instead of unconditionally falling back to session-level `d.session.tool_calls`. The fallback could overwrite the per-message anchors after pagination/windowing because session-level coordinates may not line up with the active message array, causing tool cards to disappear on the final `done` render. Fixes [#2613](nesquena/hermes-webui#2613), complements [#2777](nesquena/hermes-webui#2777) (which covers pending-segment flushes at tool/interim boundaries). Adds `tests/test_streaming_markdown.py::test_done_handler_prefers_message_tool_metadata_for_settled_render` to lock the precedence. - **PR [#2808](nesquena/hermes-webui#2808 by [@chouzz](https://github.com/chouzz) — Recover deterministically from boot-time `/session/{id}` 404s (Option A for [#2798](nesquena/hermes-webui#2798)). When `loadSession()` hits a 404 during boot-time restore (`!currentSid`), `static/sessions.js` now always clears `localStorage['hermes-webui-session']`, strips the stale URL with `history.replaceState(null, '', '/')`, and rethrows so boot falls through to empty-state recovery. The previous condition required the stale id to match `localStorage`, so a stale `/session/{id}` URL with empty `localStorage` (post state-reset) could leave the UI stuck on "Session not available in web UI." Fixes [#2798](nesquena/hermes-webui#2798). ##### Added - **PR [#2799](nesquena/hermes-webui#2799 by [@gavinssr](https://github.com/gavinssr) — Add Hepburn skin (magenta-rose palette derived from the Hepburn TUI theme). Full light + dark palette under `:root[data-skin="hepburn"]` / `:root.dark[data-skin="hepburn"]`, registered in `static/boot.js` `_SKINS` and whitelisted in `static/index.html`'s inline skin gate. As part of this PR `loadSettingsPanel()` in `static/panels.js` now prefers `localStorage.getItem('hermes-skin')` over `settings.skin` when populating the skin picker (DOM truth → settings fallback), so the picker matches what the user actually sees after the inline gate has already resolved legacy aliases. ### [`v0.51.118`](https://github.com/nesquena/hermes-webui/blob/HEAD/CHANGELOG.md#v051118--2026-05-22--Release-CP-stage-pr2773--1-PR-hotfix--v051117-brick-fix-chat-input-restored) [Compare Source](nesquena/hermes-webui@v0.51.117...v0.51.118) ##### Fixed - **PR [#2773](nesquena/hermes-webui#2773 by [@nesquena-hermes](https://github.com/nesquena-hermes) — fix(chat): rename `_inflightStateLimits()` in `static/ui.js` to `_getInflightStateLimits()` so it no longer collides with the `window._inflightStateLimits` config object set in `static/boot.js`. Closes [#2771](nesquena/hermes-webui#2771). The v0.51.117 in-flight-recovery quota fix ([#2766](nesquena/hermes-webui#2766)) declared a top-level helper with the same name as a window-attached config object; because top-level `function foo(){…}` declarations in classic (non-module) scripts attach to `window`, boot.js's `window._inflightStateLimits = {…}` assignment overwrote the function reference before any session could send. Every new chat broke on first `send()` with `TypeError: _inflightStateLimits is not a function`, leaving v0.51.117 effectively unusable. Renamed the function only (the public-ish window key is unchanged) and updated all 4 call sites. \*\*New regression test `tests/test_window_function_collision.py` scans every static JS file for top-level `function NAME()` declarations whose name is also the target of `window.NAME = {…}` / `= <number>`, the exact shape that broke [#2715](nesquena/hermes-webui#2715) (`_pinnedSessionsLimit` in v0.51.106) and [#2771](nesquena/hermes-webui#2771) (`_inflightStateLimits` in v0.51.117). The test fails loudly with a precise file:name diagnostic if the bug class returns. Verified end-to-end against the live browser before merge: `_getInflightStateLimits()` returns the limits object and `saveInflightState()` persists to localStorage without throwing. ### [`v0.51.117`](https://github.com/nesquena/hermes-webui/blob/HEAD/CHANGELOG.md#v051118--2026-05-22--Release-CP-stage-pr2773--1-PR-hotfix--v051117-brick-fix-chat-input-restored) [Compare Source](nesquena/hermes-webui@v0.51.116...v0.51.117) ##### Fixed - **PR [#2773](nesquena/hermes-webui#2773 by [@nesquena-hermes](https://github.com/nesquena-hermes) — fix(chat): rename `_inflightStateLimits()` in `static/ui.js` to `_getInflightStateLimits()` so it no longer collides with the `window._inflightStateLimits` config object set in `static/boot.js`. Closes [#2771](nesquena/hermes-webui#2771). The v0.51.117 in-flight-recovery quota fix ([#2766](nesquena/hermes-webui#2766)) declared a top-level helper with the same name as a window-attached config object; because top-level `function foo(){…}` declarations in classic (non-module) scripts attach to `window`, boot.js's `window._inflightStateLimits = {…}` assignment overwrote the function reference before any session could send. Every new chat broke on first `send()` with `TypeError: _inflightStateLimits is not a function`, leaving v0.51.117 effectively unusable. Renamed the function only (the public-ish window key is unchanged) and updated all 4 call sites. \*\*New regression test `tests/test_window_function_collision.py` scans every static JS file for top-level `function NAME()` declarations whose name is also the target of `window.NAME = {…}` / `= <number>`, the exact shape that broke [#2715](nesquena/hermes-webui#2715) (`_pinnedSessionsLimit` in v0.51.106) and [#2771](nesquena/hermes-webui#2771) (`_inflightStateLimits` in v0.51.117). The test fails loudly with a precise file:name diagnostic if the bug class returns. Verified end-to-end against the live browser before merge: `_getInflightStateLimits()` returns the limits object and `saveInflightState()` persists to localStorage without throwing. ### [`v0.51.116`](https://github.com/nesquena/hermes-webui/blob/HEAD/CHANGELOG.md#v051116--2026-05-22--Release-CN-stage-pr2676--1-PR--per-skill-enabledisable-toggle-in-Skills-panel-CLI-parity-with-hermes-skills-config) [Compare Source](nesquena/hermes-webui@v0.51.115...v0.51.116) ##### Added - **PR [#2676](nesquena/hermes-webui#2676 by [@lucasrc](https://github.com/lucasrc) — Each skill in the Skills panel now has a toggle pill (enabled/disabled) so users can turn individual skills on or off directly from the WebUI without editing `config.yaml`. Achieves parity with the existing `hermes skills config` CLI subcommand (interactive TUI that toggles `skills.disabled` in config). The disabled state is mirrored through to `skills.platform_disabled.webui` when that key is present. Disabled skills remain visible in the panel (muted via `opacity: .45`) instead of being filtered out, so users can re-enable them later. New endpoint: `POST /api/skills/toggle` validates the skill exists in the filesystem before mutating config, wraps the YAML read-modify-write under the existing `_cfg_lock` for thread safety, and calls `reload_config()` so the change takes effect immediately. Toggle pill uses theme variables (`--accent-bg-strong`, `--accent`, `--border`, `--muted`, `--accent-text`) so it adapts automatically to each skin: gold for default, red for ares, blue for poseidon, purple for sisyphus, grey for mono — verified empirically across light + dark variants. i18n keys (`skill_enabled`, `skill_disabled`, `skill_toggle_failed`) translated across all 10 locales. Default-state safety: fresh installs (no `skills.disabled` key in config) return `disabled: False` for every skill — no regression risk for new users. ### [`v0.51.115`](https://github.com/nesquena/hermes-webui/blob/HEAD/CHANGELOG.md#v051115--2026-05-22--Release-CM-stage-pr2731--1-PR--clarify-prompt-collapseexpand-with-chevron-icon-polish) [Compare Source](nesquena/hermes-webui@v0.51.114...v0.51.115) ##### Added - **PR [#2731](nesquena/hermes-webui#2731 by [@Michaelyklam](https://github.com/Michaelyklam) — Clarification prompts now include a compact Collapse/Expand control so users can temporarily shrink a blocking decision card and reread the chat context behind it before responding. The toggle uses Lucide chevron icons (chevron-down expanded → click to collapse, chevron-up collapsed → click to expand) and a small circular pill matching the existing composer-button design language. The collapsed card sits cleanly above the composer at every tested viewport (desktop 1920×1080, mobile iPhone 14 390×844) without edge clipping. New clarification prompts still open expanded so users notice them. ### [`v0.51.114`](https://github.com/nesquena/hermes-webui/blob/HEAD/CHANGELOG.md#v051114--2026-05-22--Release-CL-stage-407--1-PR--update-check-recovery-from-remote-re-tags) [Compare Source](nesquena/hermes-webui@v0.51.113...v0.51.114) ##### Fixed - **PR [#2758](nesquena/hermes-webui#2758 by [@nesquena-hermes](https://github.com/nesquena-hermes) — fix(updates): pass `--force` to `git fetch --tags` in `api/updates.py` so the WebUI's release-tracking update check can recover from a remote re-tag (e.g. a release tag that was force-pushed to a new commit after a squash-merge). Without `--force`, plain `git fetch origin --tags` returns `! [rejected] vX.Y.Z (would clobber existing tag)` and the entire update path (check, force-apply, normal-apply) jams indefinitely — neither the periodic check nor manual "Check now" nor the Update button can recover. Three fetch call sites were patched (`_check_repo`, `apply_force_update`, `apply_update`) to use `--tags --force`; the WebUI never pushes tags, so deferring to the remote's view is the right contract. Closes [#2756](nesquena/hermes-webui#2756). ### [`v0.51.113`](https://github.com/nesquena/hermes-webui/blob/HEAD/CHANGELOG.md#v051113--2026-05-22--Release-CK-stage-406--1-PR--composer-model-picker-lag-fix--hard-refresh-recovery) [Compare Source](nesquena/hermes-webui@v0.51.112...v0.51.113) ##### Fixed - **PR [#2743](nesquena/hermes-webui#2743 by [@franksong2702](https://github.com/franksong2702) — Composer model picker now opens immediately from the existing static option list while the dynamic `/api/models` catalog hydrates in the background, instead of blocking the click on the catalog request. A just-selected session model also survives a hard refresh that interrupts the async `/api/session/update` POST: the selection is staged into `sessionStorage` (keyed by session\_id, 10-minute TTL) before the async update flies, and `loadSession()` re-applies the pending pick on next session restore and retries the persistence call. Tests pin the new ordering: visible picker render before `await`, pending-state save before `await api('/api/session/update')`, and pending-state replay before the first `syncTopbar()` projects server metadata. ### [`v0.51.112`](https://github.com/nesquena/hermes-webui/blob/HEAD/CHANGELOG.md#v051112--2026-05-22--Release-CJ-stage-405--1-PR--session-model-authoritative-across-restore) [Compare Source](nesquena/hermes-webui@v0.51.111...v0.51.112) ##### Fixed - **PR [#2737](nesquena/hermes-webui#2737 by [@ai-ag2026](https://github.com/ai-ag2026) — Keep the session model authoritative when a restored session is reactivated. Previously, stale browser-cached picker state could override an active conversation's model in four scenarios: (1) on initial boot when `localStorage` had a different model preference than the active session, (2) on hard refresh when `S._bootReady` revealed the composer chip before the live catalog hydrated, (3) when the session's model wasn't in the current provider catalog (the static/default fallback silently rewrote `S.session.model`), (4) when starting a new session whose model wasn't in the static HTML dropdown. The fix: `loadSession()` now requests `resolve_model=1` so backend normalization happens synchronously with metadata; boot model hydration prefers the active session over `localStorage`; hard refresh re-runs the model dropdown hydration before `_bootReady`; a new `_ensureModelOptionInDropdown()` helper injects a `data-custom='1'` option for models not in the catalog instead of silently rewriting `S.session.model` to the default. 100 LOC of new pytest regression coverage pinning each behavior. ### [`v0.51.111`](https://github.com/nesquena/hermes-webui/blob/HEAD/CHANGELOG.md#v051111--2026-05-22--Release-CI-stage-404--1-PR--keep-statedb-replays-out-of-sidecar-tail) [Compare Source](nesquena/hermes-webui@v0.51.110...v0.51.111) ##### Fixed - **PR [#2746](nesquena/hermes-webui#2746 by [@ai-ag2026](https://github.com/ai-ag2026) — Prevent replayed state.db rows from being appended after an already-correct sidecar transcript tail. `merge_session_messages_append_only()` previously tried to skip state.db rows replaying the sidecar, but two edge cases leaked through: (1) the final row of a replayed sidecar prefix was not skipped because the replay index had reached the sidecar sequence length, and (2) a replayed middle segment was not considered prefix replay, so old state.db rows could be appended after the saved assistant tail. That made `/api/session` appear to end on an old user prompt even when the saved sidecar already ended on the real assistant answer. The fix tracks per-(role, content) visible-occurrence counts in the sidecar and uses that as a replay budget when comparing state.db rows; legitimate repeated messages from state.db are still preserved. `_has_visible_duplicate()` is kept as a thin wrapper around the new `_matching_visible_duplicate()` for backwards compatibility. Regression test covers both full-replay and middle-segment replay shapes. ### [`v0.51.110`](https://github.com/nesquena/hermes-webui/blob/HEAD/CHANGELOG.md#v051110--2026-05-22--Release-CH-stage-403--2-PR-batch--default-personality-from-config--sort-configured-providers-to-top) [Compare Source](nesquena/hermes-webui@v0.51.109...v0.51.110) ##### Added - **PR [#2747](nesquena/hermes-webui#2747 by [@s010mn](https://github.com/s010mn) — `new_session()` now reads `display.personality` from `config.yaml` as the default for new conversations. Previously every new session started with `personality=None` and required an explicit `/personality <name>` slash command. Values `'none'`, `'default'`, `'neutral'`, and empty string are treated as no-personality. Case-insensitive — `personality: Taleb` normalizes to `taleb`. Config-read is wrapped in try/except so malformed config falls back to the prior behavior rather than crashing session creation. The `/personality` slash command still works for per-session overrides. - **PR [#2683](nesquena/hermes-webui#2683 by [@jasonjcwu](https://github.com/jasonjcwu) — Sort providers so configured/custom entries appear first in both the model picker dropdown (`api/config.py::get_available_models`) and the Settings providers panel (`api/providers.py::get_providers`). Priority order: (1) the active provider, (2) `custom:*` providers from `custom_providers` config, (3) providers with configured API keys (credential pool or `config.yaml`), (4) all others alphabetical. Eliminates scrolling past 25+ unconfigured providers to find the one in active use. ### [`v0.51.109`](https://github.com/nesquena/hermes-webui/blob/HEAD/CHANGELOG.md#v051109--2026-05-22--Release-CG-stage-402--2-PR-batch--sidebar-action-menu-click-stability--chat-panel-sidebar-resync-after-navigation) [Compare Source](nesquena/hermes-webui@v0.51.108...v0.51.109) ##### Fixed - **PR [#2741](nesquena/hermes-webui#2741 by [@ai-ag2026](https://github.com/ai-ag2026) — Keep the sidebar conversation actions menu open while session-list refreshes, stream updates, or panel-resync repairs arrive. Previously the three-dot menu beside chat titles could be torn down before the user finished clicking it because `renderSessionListFromCache()` rebuilt the row DOM (and the fixed-position menu's anchor) without checking whether the menu was open. The new early-return at the top of the refresh keeps the menu stable; destructive menu actions explicitly close the menu before they fire, so dismissal still works as expected. - **PR [#2736](nesquena/hermes-webui#2736 by [@ai-ag2026](https://github.com/ai-ag2026) — Resync the chat sidebar after returning from Settings/Logs/other panels. The session list is virtualized, and the browser can clamp the preserved scrollTop during a panel transition; without a render after the chat view is visible again, stale virtual spacer/header DOM remained until the next manual scroll. The new `_resyncChatSidebarAfterPanelSwitch()` helper runs one guarded `requestAnimationFrame` after the panel becomes visible, bails if a rename input or action menu is open, and uses no polling. </details> --- ### Configuration 📅 **Schedule**: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined). 🚦 **Automerge**: Disabled by config. Please merge this manually once you are satisfied. ♻ **Rebasing**: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox. 🔕 **Ignore**: Close this PR and you won't be reminded about these updates again. --- - [ ] If you want to rebase/retry this PR, check this box --- This PR has been generated by [Renovate Bot](https://github.com/renovatebot/renovate).  Reviewed-on: https://git.erwanleboucher.dev/eleboucher/homelab/pulls/634

ai-ag2026 added 3 commits May 23, 2026 11:57

fix: drop stale optimistic sidebar rows

dcee056

fix: clear stale busy state before send

3a73400

fix: preserve server idle rows during optimistic merge

46c3b90

ai-ag2026 added 2 commits May 23, 2026 20:01

fix: let chat start survive pre-start UI errors

de51d27

fix: hide nonfatal pre-start send warnings

d2f5c90

ai-ag2026 mentioned this pull request May 23, 2026

Define WebUI run state consistency across transcript, context, streams, and replay #2361

Open

nesquena-hermes closed this May 24, 2026

nesquena-hermes mentioned this pull request May 24, 2026

Performance optimizations #2716

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: clear stale inflight UI state#2796

fix: clear stale inflight UI state#2796
ai-ag2026 wants to merge 5 commits into
nesquena:masterfrom
ai-ag2026:fix/inflight-optimistic-state

ai-ag2026 commented May 23, 2026 •

edited

Loading

Uh oh!

nesquena-hermes commented May 23, 2026

Uh oh!

nesquena-hermes commented May 23, 2026

Uh oh!

nesquena-hermes commented May 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

ai-ag2026 commented May 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test Plan

Notes

Uh oh!

nesquena-hermes commented May 23, 2026

Summary

Code reference

Diagnosis

Test plan

Uh oh!

nesquena-hermes commented May 23, 2026

Follow-up review — three new commits since the first pass

46c3b902 — preserve server idle rows during optimistic merge

de51d271 — let chat start survive pre-start UI errors

d2f5c906 — hide nonfatal pre-start send warnings

Test coverage assessment

Verdict

Uh oh!

nesquena-hermes commented May 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ai-ag2026 commented May 23, 2026 •

edited

Loading

`46c3b902` — preserve server idle rows during optimistic merge

`de51d271` — let chat start survive pre-start UI errors

`d2f5c906` — hide nonfatal pre-start send warnings