diff --git a/CHANGELOG.md b/CHANGELOG.md
index 93fb8cdde4..bbe6646c85 100644
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@@ -2,6 +2,29 @@
 
 ## [Unreleased]
 
+## [v0.51.83] — 2026-05-17 — Release BG (stage-376 — 12-PR contributor batch — chat-start adapter parity + populated-core journal recovery + thinking card dedup + context metadata refresh + model cache fingerprint + stream fade cap + manual cron delivery + active-session spinner + email gateway label + thinking copy button + /theme i18n + compact activity semantics)
+
+### Added
+
+- **PR #2460** by @Michaelyklam (closes #2449) — Add a copy button to Thinking card headers so users can copy the card's reasoning text without selecting the `<pre>` manually. The button stops header-toggle propagation and shows the same short checkmark feedback pattern used by existing copy actions.
+
+### Fixed
+
+- **PR #2438** by @franksong2702 (fixes #2435) — Keep the default-off `HERMES_WEBUI_RUNTIME_ADAPTER=legacy-journal` chat-start path response-compatible with the legacy-direct path by not adding adapter-internal `run_id`, `status`, or `active_controls` fields to `/api/chat/start` responses. The adapter facade for the future #1925 runtime split stays an internal protocol-translator seam instead of expanding the public chat-start contract.
+- **PR #2439** by @franksong2702 (fixes #2434) — Recover already-journaled visible assistant text and tool cards even when restart repair first syncs a populated Hermes core transcript into an otherwise empty WebUI sidecar. The core-sync branch now merges non-duplicate run-journal output before clearing stale stream state, closing the carve-out from PR #2427 where recoverable partial output could be silently skipped. Adds `_append_journaled_partial_output(..., dedupe_existing=True)` plus helpers `_run_journal_has_visible_output`, `_find_existing_assistant_for_journal_content`, and `_journal_tool_already_present`.
+- **PR #2441** by @Michaelyklam (fixes #2440) — Compact live Thinking cards now reuse the same timeline card across sequential tool calls within a single assistant turn. `finalizeThinkingCard` clears the `data-thinking-active` marker by searching the entire assistant turn instead of only the tool group, and `appendThinking` reuses the most recent Thinking card when no active marker is set, preventing repeated Thinking cards from stacking as reasoning resumes between tool calls.
+- **PR #2444** by @franksong2702 (fixes #2442) — Refresh session context-window metadata when a session's resolved model changes during deferred hydration or when the user switches models, so high-context models do not stay stuck on a stale prior window and trigger premature compression. Adds a shared `_resolve_context_length_for_session_model` helper, updates `GET /api/session?resolve_model=1` to refresh non-zero persisted windows from current model metadata, resets context metadata on `/api/session/update` model/provider changes, and applies returned `context_length`/`threshold_tokens`/`last_prompt_tokens` in the deferred client-side resolution path with an immediate context-indicator resync.
+- **PR #2445** by @Michaelyklam (fixes #2443) — `/api/models` now fingerprints the in-module provider catalog plus the local Codex `models_cache.json` as part of its persisted cache metadata, so server-side catalog additions and Codex local catalog refreshes invalidate `models_cache.json` immediately on the next restart instead of waiting for the 24-hour TTL or manual cache deletion.
+- **PR #2450** by @Michaelyklam (fixes #2447) — Cap the optional streaming word-fade drain after the final `done` SSE event so very large or bursty completed responses render from the canonical session promptly instead of keeping the chat in a live/working state until Stop is pressed. The existing caught-up path and per-token animation wait are preserved for normal responses.
+- **PR #2452** by @Michaelyklam (fixes #2451) — Manual WebUI cron triggers now deliver the same final response or failure notice as scheduled cron runs. The manual-run wrapper reuses the scheduler delivery contract (`[SILENT]` skipping, separate `last_delivery_error` metadata, error-notice fallback) with a `TypeError` shim for legacy `mark_job_run` signatures used by older WebUI test doubles.
+- **PR #2455** by @franksong2702 (fixes #2454) — Keep the sidebar spinner in sync with server session metadata when the currently open session has finished but the browser still has stale local busy state. A new `_reconcileActiveSessionIdleStateFromList` helper clears `S.busy`, `S.activeStreamId`, the `INFLIGHT` cache, and active-session stream metadata before optimistic merging can re-mark the row as streaming.
+- **PR #2457** by @Michaelyklam (closes #2456) — Email gateway sessions imported from Hermes Agent `state.db` now normalize as messaging sessions and show an `Email` source label in the WebUI sidebar instead of falling through as unlabelled generic agent sessions. Keeps the Python source-normalization contract (`MESSAGING_SOURCES`, `SOURCE_LABELS`), gateway status platform labels, and frontend `static/sessions.js` whitelist in sync.
+- **PR #2463** by @Michaelyklam (closes #2462) — Align `/theme` command help strings in Russian, German, Simplified Chinese, Traditional Chinese, and French with the current Theme × Skin contract. The localized command descriptions now mention `system/dark/light` plus the full skin list through `nous`, and French invalid-usage text now uses the actual `/theme ` slash command prefix instead of `/thème`. Supersedes the parallel-discovery duplicate at #2464 (closed in favor of this PR).
+
+### Changed
+
+- **PR #2466** by @franksong2702 (closes #2465) — Clarify `Compact tool activity` semantics in Preferences: the setting now describes compact inline activity that preserves the agent timeline, matching the current long-running turn behavior where thinking cards, visible progress notes, and tool Activity bursts stay in chronological order instead of being described as one top-of-turn collapsed block. Renderer behavior is unchanged; this is a description-only correction plus the `simplified_tool_calling` default comment and regression-test wording.
+
 ## [v0.51.82] — 2026-05-17 — Release BF (stage-375 — 2-PR batch — table renderer pipe protection + Catppuccin appearance skin)
 
 ### Added
diff --git a/api/agent_sessions.py b/api/agent_sessions.py
index 41556fade7..641e3a22dd 100644
--- a/api/agent_sessions.py
+++ b/api/agent_sessions.py
@@ -9,6 +9,7 @@
 
 MESSAGING_SOURCES = {
     'discord',
+    'email',
     'slack',
     'telegram',
     'weixin',
@@ -22,6 +23,7 @@
     'cli': 'CLI',
     'cron': 'Cron',
     'discord': 'Discord',
+    'email': 'Email',
     'slack': 'Slack',
     'telegram': 'Telegram',
     'tool': 'Tool',
diff --git a/api/config.py b/api/config.py
index ea890e6366..4351f997aa 100644
--- a/api/config.py
+++ b/api/config.py
@@ -11,6 +11,7 @@
 
 import collections
 import copy
+import hashlib
 import json
 import logging
 import os
@@ -2205,11 +2206,45 @@ def _models_cache_file_fingerprint(path: Path) -> dict:
     return fingerprint
 
 
+def _models_cache_catalog_fingerprint() -> dict:
+    """Return non-secret model-catalog identity metadata for cache invalidation.
+
+    The /api/models payload is not only a function of user config/auth files.
+    It also depends on the provider/model catalog baked into this module and on
+    small local catalogs such as Codex's models_cache.json. Keep this cheap and
+    deterministic so a server restart after catalog changes does not keep
+    serving an otherwise-valid persisted models_cache.json until the 24h TTL
+    expires (#2443).
+    """
+    catalog_payload = {
+        "provider_models": _PROVIDER_MODELS,
+        "provider_display": _PROVIDER_DISPLAY,
+    }
+    try:
+        encoded = json.dumps(
+            catalog_payload,
+            sort_keys=True,
+            separators=(",", ":"),
+            ensure_ascii=True,
+            default=str,
+        ).encode("utf-8")
+        provider_catalog_sha = hashlib.sha256(encoded).hexdigest()
+    except Exception:
+        provider_catalog_sha = "unavailable"
+
+    codex_home = Path(os.getenv("CODEX_HOME", "").strip() or (HOME / ".codex")).expanduser()
+    return {
+        "provider_catalog_sha256": provider_catalog_sha,
+        "codex_models_cache": _models_cache_file_fingerprint(codex_home / "models_cache.json"),
+    }
+
+
 def _models_cache_source_fingerprint() -> dict:
-    """Return the current config/auth-store fingerprint for /api/models cache."""
+    """Return the current config/auth/catalog fingerprint for /api/models cache."""
     return {
         "config_yaml": _models_cache_file_fingerprint(_get_config_path()),
         "auth_json": _models_cache_file_fingerprint(_get_auth_store_path()),
+        "catalog": _models_cache_catalog_fingerprint(),
     }
 
 
@@ -4057,7 +4092,7 @@ def _get_session_agent_lock(session_id: str) -> threading.Lock:
     "rtl": False,  # right-to-left chat layout (chat messages + composer only)
     "notifications_enabled": False,  # browser notification when tab is in background
     "show_thinking": True,  # show/hide thinking/reasoning blocks in chat view
-    "simplified_tool_calling": True,  # group tools/thinking into one quiet activity disclosure
+    "simplified_tool_calling": True,  # render tools/thinking as compact inline timeline activity
     "api_redact_enabled": True,  # redact sensitive data (API keys, secrets) from API responses
     "sidebar_density": "compact",  # compact | detailed
     "auto_title_refresh_every": "0",  # adaptive title refresh: 0=off, 5/10/20=every N exchanges
diff --git a/api/models.py b/api/models.py
index c45e33c0d6..0518b227b7 100644
--- a/api/models.py
+++ b/api/models.py
@@ -718,7 +718,76 @@ def _truncate_journal_tool_args(args, limit: int = 4) -> dict:
     return out
 
 
-def _append_journaled_partial_output(session, stream_id: str | None) -> bool:
+def _normalize_journal_recovery_text(value) -> str:
+    return " ".join(str(value or "").split())
+
+
+def _find_existing_assistant_for_journal_content(session, content: str) -> int | None:
+    candidate = _normalize_journal_recovery_text(content)
+    if not candidate:
+        return None
+    for idx, message in enumerate(session.messages or []):
+        if not isinstance(message, dict) or message.get('role') != 'assistant':
+            continue
+        if message.get('_error'):
+            continue
+        existing = _normalize_journal_recovery_text(message.get('content'))
+        if not existing:
+            continue
+        if existing == candidate:
+            return idx
+        if len(candidate) >= 24 and candidate in existing:
+            return idx
+    return None
+
+
+def _journal_tool_already_present(session, name: str, preview: str) -> bool:
+    candidate_name = str(name or '')
+    candidate_preview = _normalize_journal_recovery_text(preview)
+    for tool_call in session.tool_calls or []:
+        if not isinstance(tool_call, dict):
+            continue
+        if str(tool_call.get('name') or '') != candidate_name:
+            continue
+        existing_preview = _normalize_journal_recovery_text(
+            tool_call.get('preview') or tool_call.get('snippet') or ''
+        )
+        if existing_preview == candidate_preview:
+            return True
+    return False
+
+
+def _run_journal_has_visible_output(session, stream_id: str | None) -> bool:
+    if not stream_id:
+        return False
+    try:
+        from api.run_journal import read_run_events
+        journal = read_run_events(session.session_id, stream_id)
+    except Exception:
+        return False
+    for event in journal.get('events') or []:
+        if not isinstance(event, dict):
+            continue
+        event_name = str(event.get('event') or event.get('type') or '')
+        payload = event.get('payload') if isinstance(event.get('payload'), dict) else {}
+        if event_name == 'token' and str(payload.get('text') or ''):
+            return True
+        if event_name == 'interim_assistant':
+            if payload.get('already_streamed'):
+                continue
+            if str(payload.get('text') or '').strip():
+                return True
+        if event_name == 'tool':
+            return True
+    return False
+
+
+def _append_journaled_partial_output(
+    session,
+    stream_id: str | None,
+    *,
+    dedupe_existing: bool = False,
+) -> bool:
     """Recover already-emitted visible output from a dead stream journal.
 
     This repair path is intentionally conservative: it restores user-visible
@@ -757,6 +826,12 @@ def flush_assistant() -> int | None:
         assistant_parts = []
         if not content:
             return current_assistant_idx
+        if dedupe_existing:
+            existing_idx = _find_existing_assistant_for_journal_content(session, content)
+            if existing_idx is not None:
+                current_assistant_idx = existing_idx
+                assistant_started_at = None
+                return existing_idx
         timestamp = int(assistant_started_at or time.time())
         session.messages.append({
             'role': 'assistant',
@@ -821,6 +896,9 @@ def ensure_assistant_anchor(created_at: float | None = None) -> int:
                 anchor_idx = ensure_assistant_anchor(created_at)
             name = str(payload.get('name') or 'tool')
             preview = str(payload.get('preview') or '')
+            if dedupe_existing and _journal_tool_already_present(session, name, preview):
+                current_assistant_idx = anchor_idx
+                continue
             recovered_tool_calls.append({
                 'name': name,
                 'preview': preview,
@@ -946,19 +1024,48 @@ def _apply_core_sync_or_error_marker(
             core = json.load(f)
         core_messages = core.get('messages', [])
         if core_messages:
+            _stream_id = stream_id_for_recheck or session.active_stream_id
             session.messages = core_messages
             session.tool_calls = core.get('tool_calls', [])
             for field in ('input_tokens', 'output_tokens', 'estimated_cost'):
                 if core.get(field) is not None:
                     setattr(session, field, core[field])
+            _pending_text = _normalize_journal_recovery_text(session.pending_user_message)
+            _already_checkpointed = False
+            if _pending_text and session.messages:
+                for _last_msg in reversed(session.messages):
+                    if isinstance(_last_msg, dict) and _last_msg.get('role') == 'user':
+                        _last_text = _normalize_journal_recovery_text(_last_msg.get('content'))
+                        _already_checkpointed = _last_text == _pending_text
+                        break
+            if (
+                _pending_text
+                and not _already_checkpointed
+                and _run_journal_has_visible_output(session, _stream_id)
+            ):
+                _recovered_ts = int(time.time())
+                if isinstance(session.pending_started_at, (int, float)) and session.pending_started_at > 0:
+                    _recovered_ts = int(session.pending_started_at)
+                _append_recovered_pending_turn(session, timestamp=_recovered_ts)
+            recovered_output = _append_journaled_partial_output(
+                session,
+                _stream_id,
+                dedupe_existing=True,
+            )
             session.active_stream_id = None
             session.pending_user_message = None
             session.pending_attachments = []
             session.pending_started_at = None
+            if recovered_output:
+                session.messages.append(
+                    _interrupted_recovery_marker(recovered_output=True)
+                )
             session.save(touch_updated_at=touch_updated_at)
             logger.info(
-                "Session %s: synced %d messages from core transcript",
-                sid, len(core_messages),
+                "Session %s: synced %d messages from core transcript%s",
+                sid,
+                len(core_messages),
+                " and recovered journaled output" if recovered_output else "",
             )
             return True
 
diff --git a/api/routes.py b/api/routes.py
index 677a667765..93900d76b0 100644
--- a/api/routes.py
+++ b/api/routes.py
@@ -754,8 +754,15 @@ def _run_cron_tracked(job, profile_home=None, execution_profile_home=None):
     agent config/.env while running. When no job profile is selected, both homes
     are the same and legacy server-default behavior is preserved.
     """
+    import importlib
+
     from cron.jobs import mark_job_run, save_job_output
 
+    _cron_scheduler = importlib.import_module("cron.scheduler")
+
+    _silent_marker = getattr(_cron_scheduler, "SILENT_MARKER", "[SILENT]")
+    _deliver_result = getattr(_cron_scheduler, "_deliver_result", None)
+
     job_id = job.get("id", "")
     execution_profile_home = execution_profile_home or profile_home
 
@@ -772,11 +779,29 @@ def _with_cron_home(home, fn):
             job, execution_profile_home
         )
 
-        # Persist output and run metadata back to the job's owning cron store,
-        # even when the selected execution profile is different.
+        # Persist output, deliver the same content the scheduled cron path would
+        # send, and write run metadata back to the job's owning cron store even
+        # when the selected execution profile is different.
         def _persist_success():
             save_job_output(job_id, output)
 
+            deliver_content = (
+                final_response
+                if success
+                else f"⚠️ Cron job '{job.get('name', job_id)}' failed:\n{error}"
+            )
+            should_deliver = bool(deliver_content)
+            if should_deliver and success and _silent_marker in deliver_content.strip().upper():
+                should_deliver = False
+
+            delivery_error = None
+            if should_deliver and _deliver_result is not None:
+                try:
+                    delivery_error = _deliver_result(job, deliver_content)
+                except Exception as de:
+                    delivery_error = str(de)
+                    logger.error("Delivery failed for manual cron job %s: %s", job_id, de)
+
             # Match the scheduled cron path: an apparently successful run with no
             # final response should not leave the job looking healthy.
             _success, _error = success, error
@@ -784,7 +809,14 @@ def _persist_success():
                 _success = False
                 _error = "Agent completed but produced empty response (model error, timeout, or misconfiguration)"
 
-            mark_job_run(job_id, _success, _error)
+            try:
+                mark_job_run(job_id, _success, _error, delivery_error=delivery_error)
+            except TypeError:
+                # Older/fake cron.jobs modules used by focused WebUI tests may
+                # not expose the newer delivery_error parameter. Real Hermes
+                # scheduler builds do, so this is only a compatibility shim for
+                # legacy test doubles and deployments.
+                mark_job_run(job_id, _success, _error)
 
         _with_cron_home(profile_home, _persist_success)
     except Exception as e:
@@ -1630,6 +1662,57 @@ def _resolve_effective_session_model_provider_for_display(session) -> str | None
     return provider
 
 
+def _resolve_context_length_for_session_model(
+    model: str | None,
+    provider: str | None = None,
+) -> int:
+    """Best-effort current context window for a session model.
+
+    Persisted session context metadata is a snapshot from a prior model call.
+    During session hydration/model switching, the current model metadata should
+    be allowed to replace that stale snapshot.
+    """
+    model_for_lookup = str(model or "").strip()
+    if not model_for_lookup:
+        return 0
+    try:
+        from agent.model_metadata import get_model_context_length as _get_cl
+        from api.config import get_config as _get_config_for_cl
+
+        _cfg_for_cl = _get_config_for_cl()
+        _cfg_ctx_len_load = None
+        _cfg_custom_providers_load = None
+        try:
+            _model_cfg_load = _cfg_for_cl.get('model', {}) if isinstance(_cfg_for_cl, dict) else {}
+            if isinstance(_model_cfg_load, dict):
+                _raw_cfg_ctx_load = _model_cfg_load.get('context_length')
+                if _raw_cfg_ctx_load is not None:
+                    try:
+                        _parsed_load = int(_raw_cfg_ctx_load)
+                        if _parsed_load > 0:
+                            _cfg_ctx_len_load = _parsed_load
+                    except (TypeError, ValueError):
+                        pass
+            _raw_cp_load = _cfg_for_cl.get('custom_providers') if isinstance(_cfg_for_cl, dict) else None
+            if isinstance(_raw_cp_load, list):
+                _cfg_custom_providers_load = _raw_cp_load
+        except Exception:
+            pass
+        try:
+            return _get_cl(
+                model_for_lookup,
+                "",
+                config_context_length=_cfg_ctx_len_load,
+                provider=provider or "",
+                custom_providers=_cfg_custom_providers_load,
+            ) or 0
+        except TypeError:
+            # Older hermes-agent builds: legacy 2-arg form.
+            return _get_cl(model_for_lookup, "") or 0
+    except Exception:
+        return 0
+
+
 def _session_model_state_from_request(
     model: str | None,
     requested_provider: str | None,
@@ -3553,48 +3636,22 @@ def handle_get(handler, parsed) -> bool:
             # /api/session/get response — the same wrong-window display this
             # fix addresses on the streaming side.
             _persisted_cl = getattr(s, "context_length", 0) or 0
-            if not _persisted_cl:
+            _threshold_tokens = getattr(s, "threshold_tokens", 0) or 0
+            if (not _persisted_cl) or resolve_model:
                 _model_for_lookup = (
-                    getattr(s, "model", "") or effective_model or ""
+                    effective_model or getattr(s, "model", "") or ""
                 ).strip()
-                if _model_for_lookup:
-                    try:
-                        from agent.model_metadata import get_model_context_length as _get_cl
-                        from api.config import get_config as _get_config_for_cl
-                        _cfg_for_cl = _get_config_for_cl()
-                        _cfg_ctx_len_load = None
-                        _cfg_custom_providers_load = None
-                        try:
-                            _model_cfg_load = _cfg_for_cl.get('model', {}) if isinstance(_cfg_for_cl, dict) else {}
-                            if isinstance(_model_cfg_load, dict):
-                                _raw_cfg_ctx_load = _model_cfg_load.get('context_length')
-                                if _raw_cfg_ctx_load is not None:
-                                    try:
-                                        _parsed_load = int(_raw_cfg_ctx_load)
-                                        if _parsed_load > 0:
-                                            _cfg_ctx_len_load = _parsed_load
-                                    except (TypeError, ValueError):
-                                        pass
-                            _raw_cp_load = _cfg_for_cl.get('custom_providers') if isinstance(_cfg_for_cl, dict) else None
-                            if isinstance(_raw_cp_load, list):
-                                _cfg_custom_providers_load = _raw_cp_load
-                        except Exception:
-                            pass
-                        try:
-                            _fb_cl = _get_cl(
-                                _model_for_lookup,
-                                "",
-                                config_context_length=_cfg_ctx_len_load,
-                                provider=effective_provider or "",
-                                custom_providers=_cfg_custom_providers_load,
-                            ) or 0
-                        except TypeError:
-                            # Older hermes-agent builds: legacy 2-arg form.
-                            _fb_cl = _get_cl(_model_for_lookup, "") or 0
-                        if _fb_cl:
-                            _persisted_cl = _fb_cl
-                    except Exception:
-                        pass
+                _fb_cl = _resolve_context_length_for_session_model(
+                    _model_for_lookup,
+                    effective_provider or getattr(s, "model_provider", None) or "",
+                )
+                if _fb_cl:
+                    if _persisted_cl and _fb_cl != _persisted_cl:
+                        # The old threshold belongs to the old window. Hiding it
+                        # is less misleading than rendering a stale compression
+                        # threshold against a freshly resolved context length.
+                        _threshold_tokens = 0
+                    _persisted_cl = _fb_cl
             _session_tool_calls = getattr(s, "tool_calls", []) if load_messages else []
             if (
                 load_messages
@@ -3613,7 +3670,7 @@ def handle_get(handler, parsed) -> bool:
                 "pending_attachments": getattr(s, "pending_attachments", []) if load_messages else [],
                 "pending_started_at": getattr(s, "pending_started_at", None),
                 "context_length": _persisted_cl,
-                "threshold_tokens": getattr(s, "threshold_tokens", 0) or 0,
+                "threshold_tokens": _threshold_tokens,
                 "last_prompt_tokens": getattr(s, "last_prompt_tokens", 0) or 0,
             }
             if original_stream_id:
@@ -4183,6 +4240,7 @@ def handle_get(handler, parsed) -> bool:
             "telegram": "Telegram",
             "discord": "Discord",
             "slack": "Slack",
+            "email": "Email",
             "web": "Web",
             "api": "API",
         }
@@ -4638,6 +4696,8 @@ def handle_post(handler, parsed) -> bool:
         except KeyError:
             return bad(handler, "Session not found", 404)
         old_ws = getattr(s, "workspace", "")
+        old_model = getattr(s, "model", None)
+        old_provider = getattr(s, "model_provider", None)
         try:
             new_ws = str(resolve_trusted_workspace(body.get("workspace", s.workspace)))
         except ValueError as e:
@@ -4653,6 +4713,16 @@ def handle_post(handler, parsed) -> bool:
                 if model is not None:
                     s.model = model
                 s.model_provider = provider
+                if (
+                    str(old_model or "") != str(getattr(s, "model", "") or "")
+                    or str(old_provider or "") != str(getattr(s, "model_provider", "") or "")
+                ):
+                    s.context_length = _resolve_context_length_for_session_model(
+                        getattr(s, "model", None),
+                        getattr(s, "model_provider", None),
+                    )
+                    s.threshold_tokens = 0
+                    s.last_prompt_tokens = 0
             s.save()
         if str(old_ws or "") != str(new_ws or ""):
             try:
@@ -7800,9 +7870,6 @@ def _legacy_start_run(request: StartRunRequest) -> dict:
             response = dict(result.payload)
             response.setdefault("stream_id", result.stream_id)
             response.setdefault("session_id", result.session_id)
-            response.setdefault("run_id", result.run_id)
-            response.setdefault("status", result.status)
-            response.setdefault("active_controls", result.active_controls)
         else:
             response = _start_chat_stream_for_session(
                 s,
diff --git a/docs/pr-media/2449/after-thinking-copy.png b/docs/pr-media/2449/after-thinking-copy.png
new file mode 100644
index 0000000000..789f13d231
Binary files /dev/null and b/docs/pr-media/2449/after-thinking-copy.png differ
diff --git a/docs/pr-media/2449/before-thinking-copy.png b/docs/pr-media/2449/before-thinking-copy.png
new file mode 100644
index 0000000000..a3d5315dc1
Binary files /dev/null and b/docs/pr-media/2449/before-thinking-copy.png differ
diff --git a/static/boot.js b/static/boot.js
index d48123e11e..9464f1670d 100644
--- a/static/boot.js
+++ b/static/boot.js
@@ -907,6 +907,25 @@ function clearPreview(opts={}){
 }
 $('btnClearPreview').onclick=handleWorkspaceClose;
 // workspacePath click handler removed -- use topbar workspace chip dropdown instead
+function _applySessionContextMetadataUpdate(data){
+  if(!S.session||!data||!data.session)return;
+  S.session.context_length=data.session.context_length||0;
+  S.session.threshold_tokens=data.session.threshold_tokens||0;
+  S.session.last_prompt_tokens=data.session.last_prompt_tokens||0;
+  if(typeof _syncCtxIndicator==='function'){
+    const u=S.lastUsage||{};
+    const _pick=(latest,stored,dflt=0)=>latest!=null?latest:(stored!=null?stored:dflt);
+    _syncCtxIndicator({
+      input_tokens:_pick(u.input_tokens,S.session.input_tokens),
+      output_tokens:_pick(u.output_tokens,S.session.output_tokens),
+      estimated_cost:_pick(u.estimated_cost,S.session.estimated_cost),
+      context_length:S.session.context_length||0,
+      last_prompt_tokens:_pick(u.last_prompt_tokens,S.session.last_prompt_tokens),
+      threshold_tokens:S.session.threshold_tokens||0,
+    });
+  }
+}
+
 $('modelSelect').onchange=async()=>{
   if(!S.session)return;
   const selectedModel=$('modelSelect').value;
@@ -916,7 +935,11 @@ $('modelSelect').onchange=async()=>{
   if(typeof closeModelDropdown==='function') closeModelDropdown();
   if(typeof _writePersistedModelState==='function') _writePersistedModelState(modelState.model,modelState.model_provider);
   else try{localStorage.setItem('hermes-webui-model',modelState.model)}catch{}
-  await api('/api/session/update',{method:'POST',body:JSON.stringify({
+  // Clarify scope: composer model changes are session-local, not the global default.
+  if(typeof showToast==='function'){
+    showToast(t('model_scope_toast')||'Applies to this conversation from your next message.', 3000);
+  }
+  const data=await api('/api/session/update',{method:'POST',body:JSON.stringify({
     session_id:S.session.session_id,
     workspace:S.session.workspace,
     model:modelState.model,
@@ -926,10 +949,7 @@ $('modelSelect').onchange=async()=>{
   S.session.model_provider=modelState.model_provider||null;
   if(typeof syncModelChip==='function') syncModelChip();
   syncTopbar();
-  // Clarify scope: composer model changes are session-local, not the global default.
-  if(typeof showToast==='function'){
-    showToast(t('model_scope_toast')||'Applies to this conversation from your next message.', 3000);
-  }
+  _applySessionContextMetadataUpdate(data);
   // Warn if selected model belongs to a different provider than what Hermes is configured for
   if(typeof _checkProviderMismatch==='function'){
     const warn=_checkProviderMismatch(selectedModel);
diff --git a/static/i18n.js b/static/i18n.js
index d108fc205c..084c6ce8e4 100644
--- a/static/i18n.js
+++ b/static/i18n.js
@@ -3762,7 +3762,7 @@ const LOCALES = {
     cmd_terminal: 'Открыть терминал рабочей области',
     cmd_new: 'Начать новую сессию чата',
     cmd_usage: 'Показать или скрыть использование токенов',
-    cmd_theme: 'Переключить тему (dark/light/slate/solarized/monokai/nord/oled)',
+    cmd_theme: 'Переключить внешний вид (тема: system/dark/light, скин: default/ares/mono/slate/poseidon/sisyphus/charizard/sienna/catppuccin/nous)',
     cmd_personality: 'Переключить личность агента',
     cmd_skills: 'Показать доступные навыки Hermes',
     available_commands: 'Доступные команды:',
@@ -6042,7 +6042,7 @@ const LOCALES = {
     cmd_terminal: 'Workspace-Terminal öffnen',
     cmd_new: 'Neue Chat-Sitzung starten',
     cmd_usage: 'Token-Verbrauchsanzeige umschalten',
-    cmd_theme: 'Darstellung wechseln (Theme: system/dark/light, Skin: default/ares/mono/slate/poseidon/sisyphus/charizard)',
+    cmd_theme: 'Darstellung wechseln (Theme: system/dark/light, Skin: default/ares/mono/slate/poseidon/sisyphus/charizard/sienna/catppuccin/nous)',
     cmd_personality: 'Agenten-Persönlichkeit wechseln',
     cmd_skills: 'Verfügbare Hermes-Skills auflisten',
     available_commands: 'Verfügbare Befehle:',
@@ -7206,7 +7206,7 @@ const LOCALES = {
     cmd_terminal: '打开工作区 Terminal',
     cmd_new: '新建聊天会话',
     cmd_usage: '切换 token 用量显示',
-    cmd_theme: '切换外观（主题：system/dark/light，皮肤：default/ares/mono/slate/poseidon/sisyphus/charizard）',
+    cmd_theme: '切换外观（主题：system/dark/light，皮肤：default/ares/mono/slate/poseidon/sisyphus/charizard/sienna/catppuccin/nous）',
     cmd_personality: '切换 Agent 人设',
     cmd_skills: '列出可用的 Hermes 技能',
     available_commands: '可用命令：',
@@ -8307,7 +8307,7 @@ const LOCALES = {
     cmd_terminal: '\u6253\u958b\u5de5\u4f5c\u5340 Terminal',
     cmd_new: '\u65b0\u5efa\u804a\u5929\u6703\u8a71',
     cmd_usage: '\u5207\u63db token \u7528\u91cf\u986f\u793a',
-    cmd_theme: '\u5207\u63db\u5916\u89c0\uff08\u4e3b\u984c\uff1asystem/dark/light\uff0c\u76ae\u819a\uff1adefault/ares/mono/slate/poseidon/sisyphus/charizard\uff09',
+    cmd_theme: '\u5207\u63db\u5916\u89c0\uff08\u4e3b\u984c\uff1asystem/dark/light\uff0c\u76ae\u819a\uff1adefault/ares/mono/slate/poseidon/sisyphus/charizard/sienna/catppuccin/nous\uff09',
     cmd_personality: '\u5207\u63db Agent \u4eba\u8a2d',
     cmd_skills: '\u5217\u51fa\u53ef\u7528\u7684 Hermes \u6280\u80fd',
     available_commands: '\u53ef\u7528\u547d\u4ee4\uff1a',
@@ -11779,7 +11779,7 @@ const LOCALES = {
     cmd_terminal: 'Ouvrez le terminal de l\'espace de travail',
     cmd_new: 'Démarrer une nouvelle session de discussion',
     cmd_usage: 'Activer/désactiver l\'affichage de l\'utilisation du jeton',
-    cmd_theme: 'Changer d\'apparence (thème : système/dark/light, skin : default/ares/mono/slate/poseidon/sisyphus/charizard)',
+    cmd_theme: 'Changer d\'apparence (thème : system/dark/light, skin : default/ares/mono/slate/poseidon/sisyphus/charizard/sienna/catppuccin/nous)',
     cmd_personality: 'Personnalité de l\'agent de commutation',
     cmd_skills: 'Lister les compétences Hermès disponibles',
     available_commands: 'Commandes disponibles :',
@@ -11805,7 +11805,7 @@ const LOCALES = {
     focus_label: 'Se concentrer',
     token_usage_on: 'Utilisation du jeton sur',
     token_usage_off: 'Utilisation des jetons désactivée',
-    theme_usage: 'Utilisation : /thème',
+    theme_usage: 'Utilisation : /theme ',
     theme_set: 'Thème:',
     no_active_session: 'Aucune session active',
     cmd_queue: 'Mettre un message en file d\'attente pour le prochain tour',
diff --git a/static/index.html b/static/index.html
index b56dda19c6..271cc33d93 100644
--- a/static/index.html
+++ b/static/index.html
@@ -1047,7 +1047,7 @@ <h2 data-i18n="empty_title">What can I help with?</h2>
                 <input type="checkbox" id="settingsSimplifiedToolCalling" style="width:15px;height:15px;accent-color:var(--accent)">
                 <span>Compact tool activity</span>
               </label>
-              <div style="font-size:11px;color:var(--muted);margin-top:4px">Group thinking and tool calls into one collapsed activity section per assistant turn.</div>
+              <div style="font-size:11px;color:var(--muted);margin-top:4px">Show thinking and tool calls as compact inline activity while preserving the agent timeline.</div>
             </div>
             <div class="settings-field">
               <label style="display:flex;align-items:center;gap:8px;cursor:pointer">
diff --git a/static/messages.js b/static/messages.js
index 7d40e2f91b..0c77e89e98 100644
--- a/static/messages.js
+++ b/static/messages.js
@@ -697,6 +697,7 @@ function attachLiveStream(activeSid, streamId, uploaded=[], options={}){
   const _STREAM_FADE_MAX_MS=350;
   const _STREAM_FADE_STAGGER_MS=16;
   const _STREAM_FADE_DONE_MAX_MS=320;
+  const _STREAM_FADE_DONE_DRAIN_MAX_MS=900;
   const _streamFadeEnabledForStream=window._fadeTextEffect===true;
 
   // rAF-throttled rendering: buffer tokens, render at most once per frame
@@ -1086,6 +1087,8 @@ function attachLiveStream(activeSid, streamId, uploaded=[], options={}){
       : _stripXmlToolCalls(assistantText.slice(segmentStart));
   }
   function _drainStreamFadeBeforeDone(onDone){
+    const drainStartedAt=performance.now();
+    let forcedDone=false;
     const step=()=>{
       if(!assistantBody){onDone();return;}
       const target=_streamFadeCurrentDisplayText();
@@ -1101,6 +1104,15 @@ function attachLiveStream(activeSid, streamId, uploaded=[], options={}){
         setTimeout(onDone, Math.min(remainingAnimationMs, _STREAM_FADE_DONE_MAX_MS));
         return;
       }
+      // Final SSE `done` means the canonical completed session is available.
+      // The optional word-fade playout must not keep that completed answer
+      // hidden behind the live Thinking state for large/bursty responses.
+      if(!forcedDone&&performance.now()-drainStartedAt>=_STREAM_FADE_DONE_DRAIN_MAX_MS){
+        forcedDone=true;
+        if(_smdParser) _smdEndParser();
+        onDone();
+        return;
+      }
       setTimeout(()=>requestAnimationFrame(step), 33);
     };
     step();
diff --git a/static/sessions.js b/static/sessions.js
index 1665ee8f5c..a431581698 100644
--- a/static/sessions.js
+++ b/static/sessions.js
@@ -256,6 +256,31 @@ function _isSessionEffectivelyStreaming(s) {
   return Boolean(s && (s.is_streaming || _isSessionLocallyStreaming(s)));
 }
 
+function _reconcileActiveSessionIdleStateFromList(serverRows) {
+  if (!S || !S.session || !S.session.session_id) return false;
+  if (typeof _sendInProgress !== 'undefined' && _sendInProgress) return false;
+  if (!Array.isArray(serverRows)) return false;
+  const sid=S.session.session_id;
+  const serverRow=serverRows.find(s=>s&&s.session_id===sid);
+  if (!serverRow) return false;
+  const serverRowIsIdle=!serverRow.is_streaming&&!serverRow.active_stream_id&&!serverRow.pending_user_message;
+  if (!serverRowIsIdle) return false;
+  let changed=false;
+  if (S.busy) { S.busy=false; changed=true; }
+  if (S.activeStreamId) { S.activeStreamId=null; changed=true; }
+  if (INFLIGHT&&INFLIGHT[sid]) {
+    delete INFLIGHT[sid];
+    if (typeof clearInflightState==='function') clearInflightState(sid);
+    changed=true;
+  }
+  if (S.session) {
+    S.session.active_stream_id=null;
+    S.session.pending_user_message=null;
+  }
+  if (changed&&typeof updateSendBtn==='function') updateSendBtn();
+  return changed;
+}
+
 function _purgeStaleInflightEntries() {
   // Clean up INFLIGHT entries for sessions the server confirms are NOT
   // streaming. This prevents the in-memory cache from growing unbounded
@@ -707,9 +732,9 @@ async function loadSession(sid){
       input_tokens:      _pick(u.input_tokens,      _s.input_tokens),
       output_tokens:     _pick(u.output_tokens,     _s.output_tokens),
       estimated_cost:    _pick(u.estimated_cost,    _s.estimated_cost),
-      context_length:    _pick(u.context_length,    _s.context_length),
+      context_length:    _pick(_s.context_length,    u.context_length),
       last_prompt_tokens:_pick(u.last_prompt_tokens,_s.last_prompt_tokens),
-      threshold_tokens:  _pick(u.threshold_tokens,  _s.threshold_tokens),
+      threshold_tokens:  _pick(_s.threshold_tokens,  u.threshold_tokens),
     });
   }
   if(typeof _renderPendingPromptsForActiveSession==='function') _renderPendingPromptsForActiveSession();
@@ -742,12 +767,13 @@ const _HANDOFF_THRESHOLD = 10;  // conversation rounds
 const _HANDOFF_STORAGE_PREFIX = 'handoff:';
 const _HANDOFF_SUFFIX_DISMISSED_AT = 'dismissed_at';
 const _HANDOFF_SUFFIX_SUMMARY_HANDLED_AT = 'summary_handled_at';
-const _MESSAGING_RAW_SOURCES = new Set(['weixin', 'telegram', 'discord', 'slack']);
+const _MESSAGING_RAW_SOURCES = new Set(['weixin', 'telegram', 'discord', 'slack', 'email']);
 const _MESSAGING_SOURCE_LABELS = {
   weixin: 'WeChat',
   telegram: 'Telegram',
   discord: 'Discord',
   slack: 'Slack',
+  email: 'Email',
 };
 
 function _isMessagingSession(session) {
@@ -1102,8 +1128,23 @@ function _resolveSessionModelForDisplaySoon(sid){
       if(!model||!S.session||S.session.session_id!==sid) return;
       S.session.model=model;
       S.session.model_provider=provider||null;
+      S.session.context_length=data.session.context_length||0;
+      S.session.threshold_tokens=data.session.threshold_tokens||0;
+      S.session.last_prompt_tokens=data.session.last_prompt_tokens||0;
       S.session._modelResolutionDeferred=false;
       syncTopbar();
+      if(typeof _syncCtxIndicator==='function'){
+        const u=S.lastUsage||{};
+        const _pick=(latest,stored,dflt=0)=>latest!=null?latest:(stored!=null?stored:dflt);
+        _syncCtxIndicator({
+          input_tokens:_pick(u.input_tokens,S.session.input_tokens),
+          output_tokens:_pick(u.output_tokens,S.session.output_tokens),
+          estimated_cost:_pick(u.estimated_cost,S.session.estimated_cost),
+          context_length:data.session.context_length||0,
+          last_prompt_tokens:_pick(u.last_prompt_tokens,S.session.last_prompt_tokens),
+          threshold_tokens:data.session.threshold_tokens||0,
+        });
+      }
     }catch(_){
       // Keep session switching non-blocking; the next load can try again.
     }
@@ -1878,9 +1919,6 @@ function _applySessionListPayload(sessData, projData){
   // active profile so the "Show N from other profiles" toggle can render
   // without a second round-trip. Stashed on the module for renderSessionListFromCache.
   _otherProfileCount = sessData.other_profile_count || 0;
-  _allSessions = _mergeOptimisticFirstTurnSessions(sessData.sessions||[]);
-  _clearLineageReportCache();
-  _allProjects = projData.projects||[];
   // Capture server clock for clock-skew compensation (issue #1144).
   // server_time is epoch seconds from the server's time.time().
   // _serverTimeDelta = client - server, so (Date.now() - _serverTimeDelta)
@@ -1891,6 +1929,10 @@ function _applySessionListPayload(sessData, projData){
   if (typeof sessData.server_tz === 'string') {
     _serverTz = sessData.server_tz;
   }
+  _reconcileActiveSessionIdleStateFromList(sessData.sessions||[]);
+  _allSessions = _mergeOptimisticFirstTurnSessions(sessData.sessions||[]);
+  _clearLineageReportCache();
+  _allProjects = projData.projects||[];
   _markPollingCompletionUnreadTransitions(_allSessions);
   const isStreaming = _allSessions.some(s => Boolean(s && s.is_streaming));
   if (isStreaming) {
diff --git a/static/style.css b/static/style.css
index c49ae0d9a3..ce1c85aa2c 100644
--- a/static/style.css
+++ b/static/style.css
@@ -3057,7 +3057,10 @@ main.main.showing-logs > #mainLogs{display:flex;}
    Only sub-element selectors that the consolidated block doesn't cover
    (label, toggle, rotate animation) are kept here. ── */
 .thinking-card-label{font-weight:600;letter-spacing:.02em;}
-.thinking-card-toggle{margin-left:auto;font-size:10px;display:inline-flex;align-items:center;justify-content:center;transform-origin:center;transition:transform .18s ease;will-change:transform;}
+.thinking-card-btn-row{margin-left:auto;display:inline-flex;align-items:center;gap:6px;}
+.thinking-copy-btn{display:inline-flex;align-items:center;justify-content:center;width:22px;height:22px;padding:0;border:0;border-radius:6px;background:transparent;color:var(--accent-text);opacity:.72;cursor:pointer;transition:background .15s,color .15s,opacity .15s;}
+.thinking-copy-btn:hover,.thinking-copy-btn:focus-visible{background:var(--accent-bg-strong);opacity:1;outline:none;}
+.thinking-card-toggle{font-size:10px;display:inline-flex;align-items:center;justify-content:center;transform-origin:center;transition:transform .18s ease;will-change:transform;}
 .thinking-card.open .thinking-card-toggle{transform:rotate(90deg);}
 
 .bg-error-banner{background:rgba(229,62,62,.15);border:1px solid rgba(229,62,62,.3);color:#fca5a5;padding:8px 16px;font-size:12px;display:flex;align-items:center;justify-content:space-between;gap:12px;border-radius:0;}
diff --git a/static/ui.js b/static/ui.js
index 5a33723dad..43e938e084 100644
--- a/static/ui.js
+++ b/static/ui.js
@@ -3682,6 +3682,19 @@ function copyMsg(btn){
     setTimeout(()=>{btn.innerHTML=orig;btn.style.color='';},1500);
   }).catch(()=>showToast(t('copy_failed')));
 }
+function _copyThinkingText(btn){
+  const card=btn&&btn.closest?btn.closest('.thinking-card'):null;
+  if(!card)return;
+  const pre=card.querySelector('.thinking-card-body pre');
+  const text=pre?pre.textContent:'';
+  if(!text)return;
+  _copyText(text).then(()=>{
+    const orig=btn.innerHTML;
+    btn.innerHTML=li('check',12);
+    btn.style.color='var(--accent)';
+    setTimeout(()=>{btn.innerHTML=orig;btn.style.color='';},1500);
+  }).catch(()=>showToast(t('copy_failed')));
+}
 
 // ── TTS: Text-to-Speech via Web Speech API (#499) ──
 // Strips markdown, code blocks, and MEDIA: paths for clean speech output.
@@ -4732,9 +4745,9 @@ function _assistantTurnBlocks(turn){
 }
 function _thinkingCardHtml(text, open){
   const clean=_sanitizeThinkingDisplayText(text);
-  return open
-    ? `<div class="thinking-card open"><div class="thinking-card-header" onclick="this.parentElement.classList.toggle('open')"><span class="thinking-card-icon">${li('lightbulb',14)}</span><span class="thinking-card-label">${t('thinking')}</span><span class="thinking-card-toggle">${li('chevron-right',12)}</span></div><div class="thinking-card-body"><pre>${esc(clean)}</pre></div></div>`
-    : `<div class="thinking-card"><div class="thinking-card-header" onclick="this.parentElement.classList.toggle('open')"><span class="thinking-card-icon">${li('lightbulb',14)}</span><span class="thinking-card-label">${t('thinking')}</span><span class="thinking-card-toggle">${li('chevron-right',12)}</span></div><div class="thinking-card-body"><pre>${esc(clean)}</pre></div></div>`;
+  const copyBtn=`<button class="thinking-copy-btn" onclick="event.stopPropagation();_copyThinkingText(this)" title="${t('copy')}" aria-label="${t('copy')}">${li('copy',12)}</button>`;
+  const classes=`thinking-card${open?' open':''}`;
+  return `<div class="${classes}"><div class="thinking-card-header" onclick="this.parentElement.classList.toggle('open')"><span class="thinking-card-icon">${li('lightbulb',14)}</span><span class="thinking-card-label">${t('thinking')}</span><span class="thinking-card-btn-row">${copyBtn}<span class="thinking-card-toggle">${li('chevron-right',12)}</span></span></div><div class="thinking-card-body"><pre>${esc(clean)}</pre></div></div>`;
 }
 function isSimplifiedToolCalling(){
   return window._simplifiedToolCalling!==false;
@@ -7040,7 +7053,7 @@ function finalizeThinkingCard(){
       const summary=group.querySelector('.tool-call-group-summary');
       if(summary) summary.setAttribute('aria-expanded','false');
     }
-    const active=group.querySelector('.agent-activity-thinking[data-thinking-active="1"]');
+    const active=turn.querySelector('.agent-activity-thinking[data-thinking-active="1"]');
     if(active) active.removeAttribute('data-thinking-active');
     _syncToolCallGroupSummary(group);
   }
@@ -7095,6 +7108,11 @@ function appendThinking(text='', options){
   }
   const thinkingText=String(text||'').trim()||'Thinking…';
   let row=blocks.querySelector('.agent-activity-thinking[data-thinking-active="1"]');
+  if(!row){
+    const thinkingCards=Array.from(blocks.querySelectorAll('.agent-activity-thinking'));
+    row=thinkingCards.filter(el=>el.closest('.assistant-turn-blocks')===blocks).pop()||null;
+    if(row) row.setAttribute('data-thinking-active','1');
+  }
   if(!row){
     row=_thinkingActivityNode(thinkingText, false);
     row.setAttribute('data-thinking-active','1');
diff --git a/tests/test_cron_manual_run_persistence.py b/tests/test_cron_manual_run_persistence.py
index 49943b63e9..7d28159214 100644
--- a/tests/test_cron_manual_run_persistence.py
+++ b/tests/test_cron_manual_run_persistence.py
@@ -1,21 +1,32 @@
 """Regression tests for manual WebUI cron runs."""
 
 
-
-def test_manual_cron_run_saves_output_and_marks_job(monkeypatch):
-    import api.routes as routes
-
-    calls = []
-
+def _install_cron_fakes(monkeypatch, calls, deliver_result=None, silent_marker="[SILENT]"):
     cron_jobs = type("CronJobs", (), {})()
     cron_jobs.save_job_output = lambda job_id, output: calls.append(
         ("save", job_id, output)
     )
-    cron_jobs.mark_job_run = lambda job_id, success, error=None: calls.append(
-        ("mark", job_id, success, error)
+    cron_jobs.mark_job_run = lambda job_id, success, error=None, delivery_error=None: calls.append(
+        ("mark", job_id, success, error, delivery_error)
     )
 
+    cron_scheduler = type("CronScheduler", (), {})()
+    cron_scheduler.SILENT_MARKER = silent_marker
+    if deliver_result is None:
+        deliver_result = lambda job, content: calls.append(
+            ("deliver", job["id"], content)
+        ) or None
+    cron_scheduler._deliver_result = deliver_result
+
     monkeypatch.setitem(__import__("sys").modules, "cron.jobs", cron_jobs)
+    monkeypatch.setitem(__import__("sys").modules, "cron.scheduler", cron_scheduler)
+
+
+def test_manual_cron_run_saves_output_delivers_and_marks_job(monkeypatch):
+    import api.routes as routes
+
+    calls = []
+    _install_cron_fakes(monkeypatch, calls)
     monkeypatch.setattr(
         routes,
         "_run_cron_job_in_profile_subprocess",
@@ -27,25 +38,17 @@ def test_manual_cron_run_saves_output_and_marks_job(monkeypatch):
 
     assert calls == [
         ("save", "job123", "manual output"),
-        ("mark", "job123", True, None),
+        ("deliver", "job123", "done"),
+        ("mark", "job123", True, None, None),
     ]
     assert routes._is_cron_running("job123") == (False, 0.0)
 
 
-def test_manual_cron_run_marks_empty_response_as_failure(monkeypatch):
+def test_manual_cron_run_marks_empty_response_as_failure_without_delivery(monkeypatch):
     import api.routes as routes
 
     calls = []
-
-    cron_jobs = type("CronJobs", (), {})()
-    cron_jobs.save_job_output = lambda job_id, output: calls.append(
-        ("save", job_id, output)
-    )
-    cron_jobs.mark_job_run = lambda job_id, success, error=None: calls.append(
-        ("mark", job_id, success, error)
-    )
-
-    monkeypatch.setitem(__import__("sys").modules, "cron.jobs", cron_jobs)
+    _install_cron_fakes(monkeypatch, calls)
     monkeypatch.setattr(
         routes,
         "_run_cron_job_in_profile_subprocess",
@@ -58,4 +61,75 @@ def test_manual_cron_run_marks_empty_response_as_failure(monkeypatch):
     assert calls[0] == ("save", "job-empty", "manual output")
     assert calls[1][0:3] == ("mark", "job-empty", False)
     assert "empty response" in calls[1][3]
+    assert calls[1][4] is None
     assert routes._is_cron_running("job-empty") == (False, 0.0)
+
+
+def test_manual_cron_run_records_delivery_errors_separately(monkeypatch):
+    import api.routes as routes
+
+    calls = []
+
+    def fail_delivery(job, content):
+        calls.append(("deliver", job["id"], content))
+        return "discord not configured"
+
+    _install_cron_fakes(monkeypatch, calls, deliver_result=fail_delivery)
+    monkeypatch.setattr(
+        routes,
+        "_run_cron_job_in_profile_subprocess",
+        lambda job, execution_profile_home: (True, "manual output", "done", None),
+    )
+
+    routes._mark_cron_running("job-delivery-error")
+    routes._run_cron_tracked({"id": "job-delivery-error"})
+
+    assert calls == [
+        ("save", "job-delivery-error", "manual output"),
+        ("deliver", "job-delivery-error", "done"),
+        ("mark", "job-delivery-error", True, None, "discord not configured"),
+    ]
+    assert routes._is_cron_running("job-delivery-error") == (False, 0.0)
+
+
+def test_manual_cron_run_skips_silent_success_delivery(monkeypatch):
+    import api.routes as routes
+
+    calls = []
+    _install_cron_fakes(monkeypatch, calls)
+    monkeypatch.setattr(
+        routes,
+        "_run_cron_job_in_profile_subprocess",
+        lambda job, execution_profile_home: (True, "manual output", "[SILENT]", None),
+    )
+
+    routes._mark_cron_running("job-silent")
+    routes._run_cron_tracked({"id": "job-silent"})
+
+    assert calls == [
+        ("save", "job-silent", "manual output"),
+        ("mark", "job-silent", True, None, None),
+    ]
+    assert routes._is_cron_running("job-silent") == (False, 0.0)
+
+
+def test_manual_cron_run_delivers_failure_notice(monkeypatch):
+    import api.routes as routes
+
+    calls = []
+    _install_cron_fakes(monkeypatch, calls)
+    monkeypatch.setattr(
+        routes,
+        "_run_cron_job_in_profile_subprocess",
+        lambda job, execution_profile_home: (False, "manual output", "", "boom"),
+    )
+
+    routes._mark_cron_running("job-failed")
+    routes._run_cron_tracked({"id": "job-failed", "name": "Nightly check"})
+
+    assert calls[0] == ("save", "job-failed", "manual output")
+    assert calls[1][0:2] == ("deliver", "job-failed")
+    assert "Nightly check" in calls[1][2]
+    assert "boom" in calls[1][2]
+    assert calls[2] == ("mark", "job-failed", False, "boom", None)
+    assert routes._is_cron_running("job-failed") == (False, 0.0)
diff --git a/tests/test_gateway_sync.py b/tests/test_gateway_sync.py
index d364606e88..75fedb2b25 100644
--- a/tests/test_gateway_sync.py
+++ b/tests/test_gateway_sync.py
@@ -797,6 +797,7 @@ def test_agent_session_source_normalization_contract():
 
     cases = {
         'cli': ('cli', 'CLI'),
+        'email': ('messaging', 'Email'),
         'weixin': ('messaging', 'Weixin'),
         'telegram': ('messaging', 'Telegram'),
         'discord': ('messaging', 'Discord'),
@@ -818,6 +819,14 @@ def test_agent_session_source_normalization_contract():
             assert normalized['raw_source'] is None
 
 
+def test_sessions_js_treats_email_as_messaging_source():
+    """Email gateway sessions should receive the same sidebar metadata as other messaging channels."""
+    src = (REPO_ROOT / "static" / "sessions.js").read_text(encoding="utf-8")
+
+    assert "'email'" in src[src.find("_MESSAGING_RAW_SOURCES"):src.find("function _isMessagingSession")]
+    assert "email: 'Email'" in src[src.find("_MESSAGING_SOURCE_LABELS"):src.find("function _isMessagingSession")]
+
+
 def test_cross_source_parent_child_is_not_collapsed_into_root_metadata(cleanup_test_sessions):
     """A WebUI continuation from a messaging parent must keep WebUI metadata.
 
diff --git a/tests/test_issue1436_context_indicator_load_path.py b/tests/test_issue1436_context_indicator_load_path.py
index 59a11b0eb0..608f616e24 100644
--- a/tests/test_issue1436_context_indicator_load_path.py
+++ b/tests/test_issue1436_context_indicator_load_path.py
@@ -103,7 +103,7 @@ def fake_j(h, data, status=200):
         return captured
 
     def test_persisted_context_length_passed_through_unchanged(self):
-        """When Session.context_length is non-zero, return it as-is (no fallback)."""
+        """Fast metadata loads keep the persisted value to avoid catalog work."""
         s = self._stub_session(context_length=200_000, model="claude-sonnet-4.6")
         result = self._invoke_get_session(s, fallback_returns=999_999)
         body = result["data"]["session"]
@@ -112,6 +112,94 @@ def test_persisted_context_length_passed_through_unchanged(self):
             f"got {body['context_length']}"
         )
 
+    def test_resolved_model_load_refreshes_stale_persisted_context_length(self):
+        """The deferred resolve_model=1 load must refresh stale context metadata.
+
+        Session switching first asks for messages=0&resolve_model=0 for speed,
+        then follows with messages=0&resolve_model=1 to hydrate the final
+        model/provider display.  That second path is also where a stale
+        context_length from a prior model must be corrected; otherwise a
+        resumed DeepSeek 1M session can stay stuck on an old 200k window until
+        the user manually toggles models.
+        """
+        import api.routes as routes
+
+        captured = {}
+
+        def fake_j(h, data, status=200):
+            captured["data"] = data
+
+        fake_module = MagicMock()
+        fake_module.get_model_context_length = MagicMock(return_value=1_000_000)
+
+        s = self._stub_session(context_length=200_000, model="deepseek-v4-pro")
+        handler = MagicMock()
+        parsed = urlparse(
+            "/api/session?session_id=test-1436&messages=0&resolve_model=1"
+        )
+
+        with patch("api.routes.get_session", return_value=s), \
+             patch("api.routes.j", side_effect=fake_j), \
+             patch.dict("sys.modules", {"agent.model_metadata": fake_module}):
+            routes.handle_get(handler, parsed)
+
+        body = captured["data"]["session"]
+        assert body["context_length"] == 1_000_000, (
+            "resolve_model=1 must refresh stale persisted context_length from "
+            "current model metadata"
+        )
+
+    def test_session_model_update_refreshes_context_metadata(self):
+        """Changing the session model must not keep the prior model's window."""
+        import api.routes as routes
+
+        captured = {}
+
+        def fake_j(h, data, status=200):
+            captured["data"] = data
+
+        fake_module = MagicMock()
+        fake_module.get_model_context_length = MagicMock(return_value=1_000_000)
+
+        s = self._stub_session(context_length=200_000, model="old-model")
+        s.model_provider = "old-provider"
+        s.workspace = "/tmp"
+        s.threshold_tokens = 100_000
+        s.last_prompt_tokens = 80_000
+        s.save = MagicMock()
+        s.compact.return_value = {
+            **s.compact.return_value,
+            "model": "deepseek-v4-pro",
+            "model_provider": "deepseek",
+            "context_length": 1_000_000,
+            "threshold_tokens": 0,
+            "last_prompt_tokens": 0,
+        }
+        handler = MagicMock()
+        parsed = urlparse("/api/session/update")
+
+        body = {
+            "session_id": "test-1436",
+            "workspace": "/tmp",
+            "model": "deepseek-v4-pro",
+            "model_provider": "deepseek",
+        }
+        with patch("api.routes._check_csrf", return_value=True), \
+             patch("api.routes.read_body", return_value=body), \
+             patch("api.routes.get_session", return_value=s), \
+             patch("api.routes.resolve_trusted_workspace", return_value="/tmp"), \
+             patch("api.routes.j", side_effect=fake_j), \
+             patch.dict("sys.modules", {"agent.model_metadata": fake_module}):
+            routes.handle_post(handler, parsed)
+
+        assert s.model == "deepseek-v4-pro"
+        assert s.model_provider == "deepseek"
+        assert s.context_length == 1_000_000
+        assert s.threshold_tokens == 0
+        assert s.last_prompt_tokens == 0
+        s.save.assert_called_once()
+        assert captured["data"]["session"]["context_length"] == 1_000_000
+
     def test_zero_context_length_falls_back_to_model_metadata(self):
         """Pre-#1318 sessions with context_length=0 must resolve via model_metadata."""
         s = self._stub_session(context_length=0, model="claude-opus-4-7",
@@ -276,13 +364,19 @@ class TestIssue1436SourceMarkers:
 
     def test_routes_load_path_imports_get_model_context_length(self):
         src = ROUTES.read_text(encoding="utf-8")
-        # The import must appear inside the GET /api/session load-path block.
+        # The session load path can call a helper, but the lazy import must
+        # remain in routes.py so WebUI still works with older/missing agent
+        # bundles by swallowing metadata-resolution failures.
         start = src.find('if parsed.path == "/api/session":')
         end = src.find('if parsed.path == "/api/projects":', start)
         block = src[start:end]
-        assert "from agent.model_metadata import get_model_context_length" in block, (
-            "GET /api/session load-path block must lazy-import "
-            "get_model_context_length for the context_length=0 fallback (#1436)"
+        assert "_resolve_context_length_for_session_model" in block, (
+            "GET /api/session load-path block must resolve model context "
+            "metadata for the context_length fallback (#1436)"
+        )
+        assert "from agent.model_metadata import get_model_context_length" in src, (
+            "routes.py must lazy-import get_model_context_length for the "
+            "context_length fallback (#1436)"
         )
 
     def test_routes_load_path_marks_fix_with_issue_number(self):
diff --git a/tests/test_issue1699_model_cache_source_fingerprint.py b/tests/test_issue1699_model_cache_source_fingerprint.py
index 30500eb5cd..0a06d3b836 100644
--- a/tests/test_issue1699_model_cache_source_fingerprint.py
+++ b/tests/test_issue1699_model_cache_source_fingerprint.py
@@ -142,3 +142,39 @@ def test_disk_models_cache_still_loads_when_auth_and_config_sources_are_unchange
     result = config.get_available_models()
 
     assert result == fresh_opencode
+
+
+def test_memory_models_cache_invalidates_when_static_catalog_changes(tmp_path, monkeypatch):
+    _configure_isolated_sources(tmp_path, monkeypatch, "opencode-go")
+    stale_opencode = _valid_models_cache("opencode-go", "glm-5.1")
+    with config._available_models_cache_lock:
+        config._available_models_cache = stale_opencode
+        config._available_models_cache_ts = time.monotonic()
+        config._available_models_cache_source_fingerprint = config._models_cache_source_fingerprint()
+
+    updated_models = list(config._PROVIDER_MODELS["opencode-go"])
+    updated_models.append({"id": "new-catalog-model", "label": "New Catalog Model"})
+    monkeypatch.setitem(config._PROVIDER_MODELS, "opencode-go", updated_models)
+
+    result = config.get_available_models()
+
+    opencode_group = next(g for g in result["groups"] if g.get("provider_id") == "opencode-go")
+    assert any(m.get("id") == "new-catalog-model" for m in opencode_group["models"])
+
+
+def test_disk_models_cache_invalidates_when_static_catalog_changes(tmp_path, monkeypatch):
+    _configure_isolated_sources(tmp_path, monkeypatch, "opencode-go")
+    stale_opencode = _valid_models_cache("opencode-go", "glm-5.1")
+    config._save_models_cache_to_disk(stale_opencode)
+    assert config._models_cache_path.exists()
+
+    updated_models = list(config._PROVIDER_MODELS["opencode-go"])
+    updated_models.append({"id": "new-disk-catalog-model", "label": "New Disk Catalog Model"})
+    monkeypatch.setitem(config._PROVIDER_MODELS, "opencode-go", updated_models)
+    _reset_memory_cache()
+
+    result = config.get_available_models()
+
+    assert result != stale_opencode
+    opencode_group = next(g for g in result["groups"] if g.get("provider_id") == "opencode-go")
+    assert any(m.get("id") == "new-disk-catalog-model" for m in opencode_group["models"])
diff --git a/tests/test_issue1896_context_length_fallback_args.py b/tests/test_issue1896_context_length_fallback_args.py
index 9e2af74f51..d22ba1e5be 100644
--- a/tests/test_issue1896_context_length_fallback_args.py
+++ b/tests/test_issue1896_context_length_fallback_args.py
@@ -202,25 +202,32 @@ def test_routes_session_load_fallback_passes_config_overrides():
     anchor = "older sessions (pre-#1318) that have context_length=0 persisted"
     idx = ROUTES_PY.find(anchor)
     assert idx != -1, "session-load fallback comment moved/removed"
-    # Find the resolver callsite that follows.
+    # The route block may delegate the resolver details to a helper, but the
+    # session-load path must still call the helper and that helper must preserve
+    # the same kwargs as the streaming.py fix.
     block_end = ROUTES_PY.find("if _fb_cl:", idx)
     assert block_end != -1, "_fb_cl assignment not found after fallback comment"
     block = ROUTES_PY[idx:block_end]
+    helper_start = ROUTES_PY.find("def _resolve_context_length_for_session_model")
+    assert helper_start != -1, "context-length resolver helper not found"
+    helper_end = ROUTES_PY.find("\ndef ", helper_start + 1)
+    helper = ROUTES_PY[helper_start:helper_end if helper_end != -1 else len(ROUTES_PY)]
+    assert "_resolve_context_length_for_session_model" in block
     # Same kwargs as the streaming.py fix.
-    assert "config_context_length=" in block, (
+    assert "config_context_length=" in helper, (
         "session-load fallback in api/routes.py must pass config_context_length= "
         "so user-set model.context_length wins over the 256K default. See #1896."
     )
-    assert "provider=effective_provider" in block, (
-        "session-load fallback in api/routes.py must pass provider=effective_provider "
+    assert "provider=provider or" in helper, (
+        "session-load fallback in api/routes.py must pass provider= "
         "so the registry lookup is provider-aware. See #1896."
     )
-    assert "custom_providers=" in block, (
+    assert "custom_providers=" in helper, (
         "session-load fallback in api/routes.py must pass custom_providers= "
         "so the per-model override path applies. See #1896."
     )
     # Legacy fallback for older hermes-agent builds that pre-date the kwargs.
-    assert "except TypeError:" in block, (
+    assert "except TypeError:" in helper, (
         "session-load fallback must catch TypeError to support older "
         "hermes-agent builds without the new kwargs."
     )
diff --git a/tests/test_issue2454_active_session_spinner.py b/tests/test_issue2454_active_session_spinner.py
new file mode 100644
index 0000000000..7db2d3900a
--- /dev/null
+++ b/tests/test_issue2454_active_session_spinner.py
@@ -0,0 +1,62 @@
+"""Regression coverage for #2454 active-session stale sidebar spinner.
+
+The backend can already reconcile stale stream state and return `/api/sessions`
+rows with `is_streaming: false`, `active_stream_id: null`, and
+`pending_user_message: null`. The remaining bug is frontend-local: the current
+open session can keep `S.busy = true`, so `_isSessionLocallyStreaming()` still
+makes the sidebar row render as streaming even after the server says idle.
+"""
+
+from pathlib import Path
+
+REPO = Path(__file__).resolve().parents[1]
+SESSIONS_SRC = (REPO / "static" / "sessions.js").read_text(encoding="utf-8")
+
+
+def _function_body(src: str, signature: str) -> str:
+    start = src.find(signature)
+    assert start != -1, f"missing {signature}"
+    brace = src.find("{", start)
+    assert brace != -1, f"missing opening brace for {signature}"
+    depth = 0
+    for i in range(brace, len(src)):
+        ch = src[i]
+        if ch == "{":
+            depth += 1
+        elif ch == "}":
+            depth -= 1
+            if depth == 0:
+                return src[brace + 1 : i]
+    raise AssertionError(f"could not extract function body for {signature}")
+
+
+def test_active_session_idle_reconcile_clears_stale_busy_and_inflight_state():
+    body = _function_body(SESSIONS_SRC, "function _reconcileActiveSessionIdleStateFromList(")
+
+    assert "serverRows" in body, "reconcile must inspect raw /api/sessions rows before optimistic merging"
+    assert "S.session.session_id" in body, "reconcile must target the currently active session"
+    assert "_sendInProgress" in body, "cleanup must not interrupt a send that has not received stream_id yet"
+    assert "!serverRow.is_streaming" in body, "server idle metadata must gate the cleanup"
+    assert "!serverRow.active_stream_id" in body, "active stream id must be absent before cleanup"
+    assert "!serverRow.pending_user_message" in body, "pending user text must be absent before cleanup"
+    assert "S.busy=false" in body, "stale local busy state must be cleared"
+    assert "S.activeStreamId=null" in body, "stale active stream id must be cleared"
+    assert "delete INFLIGHT[sid]" in body, "stale active-session inflight cache must be purged"
+    assert "clearInflightState(sid)" in body, "persisted inflight cache must be cleared too"
+    assert "updateSendBtn()" in body, "composer controls must reflect the idle state after cleanup"
+
+
+def test_session_list_payload_reconciles_active_idle_state_before_optimistic_merge_and_render():
+    body = _function_body(SESSIONS_SRC, "function _applySessionListPayload(")
+
+    reconcile_pos = body.find("_reconcileActiveSessionIdleStateFromList(sessData.sessions||[])")
+    merge_pos = body.find("_allSessions = _mergeOptimisticFirstTurnSessions")
+    render_pos = body.find("renderSessionListFromCache()")
+
+    assert reconcile_pos != -1, "active-session idle reconciliation must run for refreshed rows"
+    assert merge_pos != -1, "session rows must still be applied from /api/sessions"
+    assert render_pos != -1, "payload application must still render from cache"
+    assert reconcile_pos < merge_pos < render_pos, (
+        "local S.busy/INFLIGHT state must be reconciled against raw server rows "
+        "before optimistic merging can re-label a stale active session as streaming"
+    )
diff --git a/tests/test_issue2462_theme_i18n.py b/tests/test_issue2462_theme_i18n.py
new file mode 100644
index 0000000000..c34bea37f6
--- /dev/null
+++ b/tests/test_issue2462_theme_i18n.py
@@ -0,0 +1,43 @@
+"""Regression coverage for #2462 stale /theme i18n help strings."""
+
+from pathlib import Path
+import re
+
+ROOT = Path(__file__).resolve().parents[1]
+I18N_JS = (ROOT / "static" / "i18n.js").read_text(encoding="utf-8")
+
+
+def _locale_block(locale: str) -> str:
+    # Locale keys are mostly bare identifiers, but zh-Hant is quoted. Match the
+    # requested block up to the next top-level locale block or the LOCALES close.
+    match = re.search(
+        rf"\n\s*['\"]?{re.escape(locale)}['\"]?:\s*\{{(?P<body>.*?)(?=\n\s*['\"]?[a-z][\w-]*['\"]?:\s*\{{|\n\}};)",
+        I18N_JS,
+        re.S,
+    )
+    assert match, f"locale block {locale!r} not found"
+    return match.group("body")
+
+
+def _literal_value(block: str, key: str) -> str:
+    match = re.search(rf"\n\s*{re.escape(key)}:\s*'(?P<value>(?:\\'|[^'])*)',", block)
+    assert match, f"{key!r} not found in locale block"
+    return match.group("value")
+
+
+def test_theme_command_help_mentions_current_theme_and_skin_values():
+    """Every /theme help string should describe the current Theme × Skin contract."""
+    required_fragments = (
+        "system/dark/light",
+        "default/ares/mono/slate/poseidon/sisyphus/charizard/sienna/catppuccin/nous",
+    )
+    for locale in ("en", "it", "ja", "ru", "es", "de", "zh", "zh-Hant", "pt", "ko", "fr"):
+        value = _literal_value(_locale_block(locale), "cmd_theme")
+        for fragment in required_fragments:
+            assert fragment in value, f"{locale} cmd_theme missing {fragment!r}: {value!r}"
+
+
+def test_french_theme_usage_uses_actual_slash_command_with_space():
+    fr_theme_usage = _literal_value(_locale_block("fr"), "theme_usage")
+    assert fr_theme_usage == "Utilisation : /theme "
+    assert "/thème" not in fr_theme_usage
diff --git a/tests/test_regressions.py b/tests/test_regressions.py
index 624043a621..eb0025690a 100644
--- a/tests/test_regressions.py
+++ b/tests/test_regressions.py
@@ -794,7 +794,7 @@ def test_ui_js_keeps_reasoning_only_assistant_messages_visible(cleanup_test_sess
 
 def test_ui_js_does_not_hide_anchor_segments_that_contain_thinking(cleanup_test_sessions):
     """R19c2/R19c3: reasoning-only messages must remain visible through the
-    shared collapsed activity dropdown, even when the anchor segment has no prose.
+    shared compact timeline activity UI, even when the anchor segment has no prose.
     """
     src = (REPO_ROOT / "static" / "ui.js").read_text()
     compact = src.replace(' ', '').replace('\n', '')
diff --git a/tests/test_runtime_adapter_seam.py b/tests/test_runtime_adapter_seam.py
index 296abd642e..c261d54db9 100644
--- a/tests/test_runtime_adapter_seam.py
+++ b/tests/test_runtime_adapter_seam.py
@@ -119,3 +119,22 @@ def test_chat_start_route_selects_adapter_only_when_flag_enabled():
     assert "LegacyJournalRuntimeAdapter" in start_body
     assert "_start_chat_stream_for_session(" in start_body
     assert "HERMES_WEBUI_RUNTIME_ADAPTER" not in start_body, "route should use runtime_adapter_enabled(), not inline env checks"
+
+
+def test_chat_start_adapter_path_preserves_legacy_response_shape():
+    """The RuntimeAdapter seam must be invisible to /api/chat/start callers.
+
+    The adapter can use run_id/status/controls internally, but the flagged
+    route must not add fields that the legacy-direct response does not expose.
+    """
+    routes = importlib.import_module("api.routes")
+    src = (routes.Path(__file__).parent.parent / "api" / "routes.py").read_text(encoding="utf-8")
+    branch_start = src.index("if runtime_adapter_enabled():")
+    branch_end = src.index("else:", branch_start)
+    adapter_branch = src[branch_start:branch_end]
+
+    assert 'response.setdefault("stream_id", result.stream_id)' in adapter_branch
+    assert 'response.setdefault("session_id", result.session_id)' in adapter_branch
+    assert 'response.setdefault("run_id", result.run_id)' not in adapter_branch
+    assert 'response.setdefault("status", result.status)' not in adapter_branch
+    assert 'response.setdefault("active_controls", result.active_controls)' not in adapter_branch
diff --git a/tests/test_session_metadata_fast_path.py b/tests/test_session_metadata_fast_path.py
index c4e5e71915..b3c3bacc74 100644
--- a/tests/test_session_metadata_fast_path.py
+++ b/tests/test_session_metadata_fast_path.py
@@ -42,6 +42,21 @@ def test_session_switch_defers_model_resolution_without_blocking():
     assert "if(fallback&&!deferModelCorrection)" in ui
 
 
+def test_deferred_model_resolution_refreshes_context_metadata():
+    src = (ROOT / "static" / "sessions.js").read_text(encoding="utf-8")
+    start = src.index("function _resolveSessionModelForDisplaySoon")
+    end = src.index("const _INITIAL_MSG_LIMIT", start)
+    block = src[start:end]
+
+    assert "S.session.context_length" in block, (
+        "deferred model resolution must also hydrate context_length so a "
+        "resumed high-context session does not keep the old model's limit"
+    )
+    assert "S.session.threshold_tokens" in block
+    assert "_syncCtxIndicator" in block
+    assert "context_length:data.session.context_length||0" in block.replace(" ", "")
+
+
 def test_boot_does_not_block_session_restore_on_model_catalog():
     src = (ROOT / "static" / "boot.js").read_text(encoding="utf-8")
 
diff --git a/tests/test_session_sidecar_repair.py b/tests/test_session_sidecar_repair.py
index d6f17938ae..900990b3db 100644
--- a/tests/test_session_sidecar_repair.py
+++ b/tests/test_session_sidecar_repair.py
@@ -741,6 +741,134 @@ def test_journal_recovery_keeps_consecutive_tools_on_one_anchor(self, hermes_hom
         assert len(s.tool_calls) == 2
         assert s.tool_calls[0]["assistant_msg_idx"] == s.tool_calls[1]["assistant_msg_idx"]
 
+    def test_core_sync_branch_recovers_visible_journal_output(self, hermes_home, monkeypatch):
+        """The empty-sidecar + populated-core repair branch should still restore
+        already-journaled visible output from the interrupted stream."""
+        s = _make_session(messages=[])
+        s.pending_user_message = "Check maintainer activity"
+        s.pending_started_at = time.time() - 120
+        s.active_stream_id = "core_journal_stream"
+        s.save()
+
+        core_messages = [
+            {"role": "user", "content": "Earlier question"},
+            {"role": "assistant", "content": "Earlier answer"},
+        ]
+        core_path = _write_core_transcript(hermes_home, s.session_id, core_messages)
+
+        append_run_event(
+            s.session_id,
+            "core_journal_stream",
+            "token",
+            {"text": "I will check GitHub first."},
+        )
+        append_run_event(
+            s.session_id,
+            "core_journal_stream",
+            "tool",
+            {
+                "name": "terminal",
+                "preview": "gh pr list --repo nesquena/hermes-webui",
+                "args": {"command": "gh pr list --repo nesquena/hermes-webui"},
+            },
+        )
+        append_run_event(
+            s.session_id,
+            "core_journal_stream",
+            "tool_complete",
+            {"name": "terminal", "duration": 1.2, "is_error": False},
+        )
+        append_run_event(
+            s.session_id,
+            "core_journal_stream",
+            "token",
+            {"text": "The first check finished before the restart."},
+        )
+
+        result = _apply_core_sync_or_error_marker(
+            s,
+            core_path,
+            stream_id_for_recheck="core_journal_stream",
+        )
+
+        assert result is True
+        contents = [m.get("content", "") for m in s.messages]
+        assert contents[:2] == [m["content"] for m in core_messages]
+        recovered_users = [m for m in s.messages if m.get("_recovered")]
+        assert len(recovered_users) == 1
+        assert recovered_users[0]["role"] == "user"
+        assert recovered_users[0]["content"] == "Check maintainer activity"
+        assert any("I will check GitHub first." in c for c in contents)
+        assert any("The first check finished before the restart." in c for c in contents)
+        assert s.tool_calls, "journaled tool starts should become visible settled tool cards"
+        assert s.tool_calls[0]["name"] == "terminal"
+        error_msgs = [m for m in s.messages if m.get("_error")]
+        assert len(error_msgs) == 1
+        assert "partial output above was recovered" in error_msgs[0]["content"]
+        assert s.pending_user_message is None
+        assert s.active_stream_id is None
+
+    def test_core_sync_branch_does_not_duplicate_journal_output_already_in_core(
+        self, hermes_home, monkeypatch
+    ):
+        """If the core transcript already contains the same visible output, the
+        journal repair must not append a second copy."""
+        s = _make_session(messages=[])
+        s.pending_user_message = "Check maintainer activity"
+        s.pending_started_at = time.time() - 120
+        s.active_stream_id = "duplicate_core_journal_stream"
+        s.save()
+
+        core_messages = [
+            {"role": "user", "content": "Check maintainer activity"},
+            {"role": "assistant", "content": "I will check GitHub first."},
+        ]
+        core_tool_calls = [
+            {
+                "name": "terminal",
+                "preview": "gh pr list --repo nesquena/hermes-webui",
+                "snippet": "gh pr list --repo nesquena/hermes-webui",
+                "assistant_msg_idx": 1,
+                "done": True,
+            },
+        ]
+        core_path = _write_core_transcript(
+            hermes_home,
+            s.session_id,
+            core_messages,
+            tool_calls=core_tool_calls,
+        )
+
+        append_run_event(
+            s.session_id,
+            "duplicate_core_journal_stream",
+            "token",
+            {"text": "I will check GitHub first."},
+        )
+        append_run_event(
+            s.session_id,
+            "duplicate_core_journal_stream",
+            "tool",
+            {
+                "name": "terminal",
+                "preview": "gh pr list --repo nesquena/hermes-webui",
+                "args": {"command": "gh pr list --repo nesquena/hermes-webui"},
+            },
+        )
+
+        result = _apply_core_sync_or_error_marker(
+            s,
+            core_path,
+            stream_id_for_recheck="duplicate_core_journal_stream",
+        )
+
+        assert result is True
+        contents = [m.get("content", "") for m in s.messages]
+        assert contents.count("I will check GitHub first.") == 1
+        assert len(s.tool_calls) == 1
+        assert s.tool_calls[0]["name"] == "terminal"
+        assert not [m for m in s.messages if m.get("_error")]
+
 
 class TestLastResortSyncDelegation:
     """_last_resort_sync_from_core delegates to the shared helpers
diff --git a/tests/test_smooth_text_fade.py b/tests/test_smooth_text_fade.py
index ee5f8ff725..1faad4d647 100644
--- a/tests/test_smooth_text_fade.py
+++ b/tests/test_smooth_text_fade.py
@@ -82,6 +82,7 @@ def fade_helper_script(performance_stub: str = "{_t:0,now(){return this._t;}}")
 const _STREAM_FADE_MAX_MS=350;
 const _STREAM_FADE_STAGGER_MS=16;
 const _STREAM_FADE_DONE_MAX_MS=320;
+const _STREAM_FADE_DONE_DRAIN_MAX_MS=900;
 const performance={performance_stub};
 {helpers}
 """
@@ -178,6 +179,20 @@ def test_stream_fade_uses_incremental_renderer_without_changing_default_path():
     assert "_wrapStreamingFadeWords" not in MESSAGES_JS
 
 
+def test_stream_fade_done_drain_has_hard_cap_for_large_buffered_responses():
+    drain_block = function_block(MESSAGES_JS, "_drainStreamFadeBeforeDone")
+    assert "const _STREAM_FADE_DONE_DRAIN_MAX_MS=900" in MESSAGES_JS
+    assert_contains_all(
+        drain_block,
+        [
+            "const drainStartedAt=performance.now();",
+            "performance.now()-drainStartedAt>=_STREAM_FADE_DONE_DRAIN_MAX_MS",
+            "if(_smdParser) _smdEndParser();",
+            "onDone();",
+        ],
+    )
+
+
 def test_stream_fade_css_is_opacity_only_and_hides_live_cursor():
     fade_css = STYLE_CSS[STYLE_CSS.index("OpenWebUI-style streaming word fade") :]
     assert "filter:" not in STYLE_CSS[STYLE_CSS.index("OpenWebUI-style streaming word fade") :].split(
diff --git a/tests/test_ui_card_animation.py b/tests/test_ui_card_animation.py
index 3af47bc0d3..3a84f4aca0 100644
--- a/tests/test_ui_card_animation.py
+++ b/tests/test_ui_card_animation.py
@@ -27,7 +27,8 @@ def test_tool_card_detail_uses_transitionable_collapsed_state():
 
 
 def test_thinking_card_toggle_and_body_use_animation_friendly_state():
-    assert ".thinking-card-toggle{margin-left:auto;font-size:10px;display:inline-flex;" in COMPACT_CSS
+    assert ".thinking-card-btn-row{margin-left:auto;display:inline-flex;align-items:center;gap:6px;" in COMPACT_CSS
+    assert ".thinking-card-toggle{font-size:10px;display:inline-flex;" in COMPACT_CSS
     assert ".thinking-card-header{display:flex;align-items:center;gap:8px;" in COMPACT_CSS
     # Body uses div default (display:block); canonical rule lives in the
     # consolidated block. Open state caps at 260px (intentional "quieter" sizing).
@@ -41,7 +42,18 @@ def test_thinking_card_toggle_and_body_use_animation_friendly_state():
 def test_tool_card_toggle_uses_same_chevron_icon_markup_as_thinking_card():
     assert "<span class=\"thinking-card-toggle\">${li('chevron-right',12)}</span>" in UI_JS
     assert "<span class=\"tool-card-toggle\">${li('chevron-right',12)}</span>" in UI_JS
-    assert "<div class=\"thinking-card\"><div class=\"thinking-card-header\" onclick=\"this.parentElement.classList.toggle('open')\"><span class=\"thinking-card-icon\">" in UI_JS
+    assert "<div class=\"${classes}\"><div class=\"thinking-card-header\" onclick=\"this.parentElement.classList.toggle('open')\"><span class=\"thinking-card-icon\">" in UI_JS
+
+
+def test_thinking_card_header_includes_copy_button_that_does_not_toggle_card():
+    assert "function _copyThinkingText(btn){" in UI_JS
+    assert "const copyBtn=`<button class=\"thinking-copy-btn\"" in UI_JS
+    assert "event.stopPropagation();_copyThinkingText(this)" in UI_JS
+    assert "card.querySelector('.thinking-card-body pre')" in UI_JS
+    assert "_copyText(text).then(()=>{" in UI_JS
+    assert "btn.innerHTML=li('check',12);" in UI_JS
+    assert ".thinking-copy-btn{" in COMPACT_CSS
+    assert ".thinking-copy-btn:hover,.thinking-copy-btn:focus-visible{" in COMPACT_CSS
 
 
 def test_live_thinking_updates_existing_card_body_in_place():
diff --git a/tests/test_ui_tool_call_cleanup.py b/tests/test_ui_tool_call_cleanup.py
index e647b35392..7713287f52 100644
--- a/tests/test_ui_tool_call_cleanup.py
+++ b/tests/test_ui_tool_call_cleanup.py
@@ -100,7 +100,7 @@ def test_render_messages_gates_settled_activity_grouping(self):
         fn = _function_body(UI_JS, "renderMessages")
         helper = _function_body(UI_JS, "ensureActivityGroup")
         assert "isSimplifiedToolCalling()" in fn, (
-            "Settled tool/thinking grouping should be gated by the Compact tool activity toggle."
+            "Settled compact inline activity rendering should be gated by the Compact tool activity toggle."
         )
         assert "tool-cards-toggle" in fn, (
             "The non-simplified path should preserve the upstream loose tool-card controls."
@@ -157,7 +157,7 @@ def test_live_tool_cards_use_grouping_only_when_simplified(self):
         live_fn = _function_body(UI_JS, "appendLiveToolCard")
         settled_fn = _function_body(UI_JS, "renderMessages")
         assert "isSimplifiedToolCalling()" in live_fn, (
-            "Live streaming tool cards should branch on the Compact tool activity toggle."
+            "Live streaming tool cards should branch on the Compact tool activity timeline mode."
         )
         assert "ensureActivityGroup" in live_fn, (
             "Compact live tool rendering should use the grouped activity container."
@@ -263,7 +263,7 @@ def test_settled_thinking_suppresses_visible_assistant_echoes(self):
     def test_compact_activity_keeps_thinking_cards_after_session_switch(self):
         ui_min = re.sub(r"\s+", "", UI_JS)
         assert "functionensureActivityGroup(" in ui_min, (
-            "Tool calls should still use the shared Activity disclosure helper."
+            "Tool calls should still use the shared compact Activity disclosure helper."
         )
         assert "data-agent-activity-group" in UI_JS, (
             "The Activity disclosure needs a stable data-agent-activity-group hook."
@@ -307,6 +307,13 @@ def test_live_visible_interim_text_splits_tool_bursts_not_thinking(self):
         assert "body.querySelector" in live_tool_fn and "data-live-tid" in live_tool_fn, (
             "tool_complete must still update its current live Activity burst by tool id."
         )
+        finalize_fn = _function_body(UI_JS, "finalizeThinkingCard")
+        assert "turn.querySelector('.agent-activity-thinking[data-thinking-active=\"1\"]')" in finalize_fn, (
+            "Compact Thinking cards live directly in assistant-turn blocks, so finalization must clear the active marker from the whole turn, not only the tool group."
+        )
+        assert "thinkingCards.filter" in live_thinking_fn and "setAttribute('data-thinking-active','1')" in live_thinking_fn, (
+            "Compact live thinking should reactivate the latest existing Thinking card instead of stacking a new card after every tool boundary."
+        )
         close_activity_fn = _function_body(MESSAGES_JS, "_closeCurrentLiveActivityGroup")
         assert "data-live-activity-current" in close_activity_fn, (
             "Visible interim assistant boundaries should close the previous live Activity burst."