diff --git a/CLAUDE.md b/CLAUDE.md
new file mode 100644
index 000000000..1e3fa9a1a
--- /dev/null
+++ b/CLAUDE.md
@@ -0,0 +1,27 @@
+# CLAUDE.md — phenotype-omlx
+
+Extends parent governance. See the following for canonical definitions:
+- **Global baseline:** `~/.claude/CLAUDE.md`
+- **Phenotype root:** `/Users/kooshapari/CodeProjects/Phenotype/repos/CLAUDE.md`
+- **AgilePlus mandate:** `/Users/kooshapari/CodeProjects/Phenotype/repos/AgilePlus`
+
+## Project Overview
+
+- **Name:** phenotype-omlx
+- **Location:** /Users/kooshapari/CodeProjects/Phenotype/repos/phenotype-omlx
+- **Language Stack:** (fill in)
+- **Status:** Active
+
+## AgilePlus Mandate
+
+All work MUST be tracked in AgilePlus:
+- CLI: `cd /Users/kooshapari/CodeProjects/Phenotype/repos/AgilePlus && agileplus <command>`
+- Check for existing specs before implementing
+- Create spec for new work: `agileplus specify --title "<feature>" --description "<desc>"`
+- No code without corresponding AgilePlus spec
+
+## Worktree & Git Discipline
+
+- Feature work uses repo-specific worktrees: `repos/[PROJECT]-wtrees/<topic>/`
+- Canonical repo stays on `main` except during explicit merge operations
+
diff --git a/README.md b/README.md
index e702fd941..b7d3e33f2 100644
--- a/README.md
+++ b/README.md
@@ -17,6 +17,7 @@
   <img src="https://img.shields.io/badge/license-Apache%202.0-blue" alt="License">
   <img src="https://img.shields.io/badge/python-3.10+-green" alt="Python 3.10+">
   <img src="https://img.shields.io/badge/platform-Apple%20Silicon-black?logo=apple" alt="Apple Silicon">
+  <a href="https://sladge.net"><img src="https://sladge.net/badge.svg" alt="AI Slop Inside"></a>
 </p>
 
 <p align="center">
diff --git a/docs/sessions/20260429-sladge-badge/00_SESSION_OVERVIEW.md b/docs/sessions/20260429-sladge-badge/00_SESSION_OVERVIEW.md
new file mode 100644
index 000000000..f1dce0970
--- /dev/null
+++ b/docs/sessions/20260429-sladge-badge/00_SESSION_OVERVIEW.md
@@ -0,0 +1,18 @@
+# Session Overview
+
+## Goal
+
+Add the sladge governance badge to phenotype-omlx because it is a local LLM
+inference server with MCP, Claude Code, OpenAI, and Anthropic-compatible surfaces.
+
+## Outcome
+
+- Added the badge to the README badge block.
+- Used isolated worktree `phenotype-omlx-wtrees/sladge-badge` because canonical
+  phenotype-omlx has unrelated untracked worklog files.
+- Kept the change documentation-only.
+
+## Commit Scope
+
+- `README.md`
+- `docs/sessions/20260429-sladge-badge/`
diff --git a/docs/sessions/20260429-sladge-badge/01_RESEARCH.md b/docs/sessions/20260429-sladge-badge/01_RESEARCH.md
new file mode 100644
index 000000000..890796d5c
--- /dev/null
+++ b/docs/sessions/20260429-sladge-badge/01_RESEARCH.md
@@ -0,0 +1,17 @@
+# Research
+
+## Repository Fit
+
+phenotype-omlx is an LLM inference server for Apple Silicon. The README describes
+local LLM serving, MCP support, Claude Code optimization, and OpenAI/Anthropic
+API compatibility.
+
+## Local State
+
+The canonical checkout has unrelated untracked worklog files. The rollout used
+`phenotype-omlx-wtrees/sladge-badge` from `main`.
+
+## Governance Context
+
+projects-landing tracks sladge as governance metadata only. This rollout does
+not change runtime code or catalog metadata.
diff --git a/docs/sessions/20260429-sladge-badge/02_SPECIFICATIONS.md b/docs/sessions/20260429-sladge-badge/02_SPECIFICATIONS.md
new file mode 100644
index 000000000..832c32775
--- /dev/null
+++ b/docs/sessions/20260429-sladge-badge/02_SPECIFICATIONS.md
@@ -0,0 +1,14 @@
+# Specifications
+
+## Acceptance Criteria
+
+- README includes exactly one `sladge.net` badge reference.
+- The badge appears in the existing badge block.
+- The canonical dirty checkout remains untouched.
+- The commit includes the required Codex co-author trailer.
+
+## Assumptions, Risks, Uncertainties
+
+- Assumption: LLM inference servers with MCP and API compatibility are in scope.
+- Risk: README uses HTML badge formatting rather than Markdown badges.
+- Mitigation: Match the existing centered HTML badge block.
diff --git a/docs/sessions/20260429-sladge-badge/03_DAG_WBS.md b/docs/sessions/20260429-sladge-badge/03_DAG_WBS.md
new file mode 100644
index 000000000..730af5b7f
--- /dev/null
+++ b/docs/sessions/20260429-sladge-badge/03_DAG_WBS.md
@@ -0,0 +1,17 @@
+# DAG WBS
+
+## Work Breakdown
+
+1. Confirm LLM inference scope. `[done]`
+2. Check canonical status and worktree state. `[done]`
+3. Create isolated worktree from `main`. `[done]`
+4. Add README badge. `[done]`
+5. Add session documentation. `[done]`
+6. Validate badge presence and diff scope. `[done]`
+7. Commit documentation change. `[pending]`
+8. Update projects-landing governance ledger. `[pending]`
+
+## Dependencies
+
+- Step 6 depends on README/session docs being complete.
+- Step 8 depends on the badge commit.
diff --git a/docs/sessions/20260429-sladge-badge/04_IMPLEMENTATION_STRATEGY.md b/docs/sessions/20260429-sladge-badge/04_IMPLEMENTATION_STRATEGY.md
new file mode 100644
index 000000000..7f3d77bce
--- /dev/null
+++ b/docs/sessions/20260429-sladge-badge/04_IMPLEMENTATION_STRATEGY.md
@@ -0,0 +1,13 @@
+# Implementation Strategy
+
+## Approach
+
+Use an isolated docs-only rollout:
+
+- Add the sladge badge to the existing README badge block.
+- Store rollout evidence under `docs/sessions/`.
+- Avoid runtime, generated, and catalog changes.
+
+## Rationale
+
+The badge marks a direct LLM-serving surface without changing package behavior.
diff --git a/docs/sessions/20260429-sladge-badge/05_KNOWN_ISSUES.md b/docs/sessions/20260429-sladge-badge/05_KNOWN_ISSUES.md
new file mode 100644
index 000000000..180c6657d
--- /dev/null
+++ b/docs/sessions/20260429-sladge-badge/05_KNOWN_ISSUES.md
@@ -0,0 +1,12 @@
+# Known Issues
+
+## Current
+
+- Canonical phenotype-omlx has unrelated untracked worklog files.
+- Badge commit is prepared in an isolated worktree and should not be merged until
+  those unrelated files are reconciled.
+
+## Out of Scope
+
+- Runtime server validation is not required for this README-only governance
+  update.
diff --git a/docs/sessions/20260429-sladge-badge/06_TESTING_STRATEGY.md b/docs/sessions/20260429-sladge-badge/06_TESTING_STRATEGY.md
new file mode 100644
index 000000000..1ccaad7ec
--- /dev/null
+++ b/docs/sessions/20260429-sladge-badge/06_TESTING_STRATEGY.md
@@ -0,0 +1,15 @@
+# Testing Strategy
+
+## Validation
+
+- Verify exactly one `sladge.net` reference in README.
+- Review `git diff --stat`.
+- Confirm worktree status before commit.
+
+## Commands
+
+```bash
+rg -n "sladge.net" README.md
+git diff --stat
+git status --short --untracked-files=all
+```
diff --git a/omlx/__main_hwledger__.py b/omlx/__main_hwledger__.py
new file mode 100644
index 000000000..ff46dcd73
--- /dev/null
+++ b/omlx/__main_hwledger__.py
@@ -0,0 +1,13 @@
+#!/usr/bin/env python3
+# SPDX-License-Identifier: Apache-2.0
+"""
+Entry point for hwLedger JSON-RPC RPC server.
+
+Invoked as: python -m omlx.__main_hwledger__
+or via uv: uv run --project . python -m omlx.__main_hwledger__
+"""
+
+from omlx.hwledger_rpc import main
+
+if __name__ == "__main__":
+    main()
diff --git a/omlx/hwledger_rpc.py b/omlx/hwledger_rpc.py
new file mode 100644
index 000000000..131df8e5d
--- /dev/null
+++ b/omlx/hwledger_rpc.py
@@ -0,0 +1,325 @@
+#!/usr/bin/env python3
+# SPDX-License-Identifier: Apache-2.0
+"""
+JSON-RPC 2.0 server over stdin/stdout for hwLedger inference integration.
+
+Methods:
+  - generate: {"prompt": str, "model": str, "max_tokens": int, "temperature": float, "stream": bool, "request_id": str}
+  - cancel: {"request_id": str}
+  - load_model: {"model": str, "max_kv_size": int}
+  - unload_model: {"model": str}
+  - memory_report: {} -> {"total_unified_mb": n, "used_by_mlx_mb": m, "kv_cache_mb": k, "loaded_models": [...]}
+  - health: {} -> {"status":"ok","uptime_s":n,"mlx_version":"..."}
+
+Protocol:
+  - Line-delimited JSON (each message is a single line terminated with \n).
+  - Requests: {"jsonrpc":"2.0","method":"...","params":{...},"id":n}
+  - Responses: {"jsonrpc":"2.0","result":{...},"id":n} or {"jsonrpc":"2.0","error":{"code":-32000,"message":"...","data":{"traceback":"..."}},"id":n}
+  - Notifications (for streaming): {"jsonrpc":"2.0","method":"token","params":{"request_id":"...","text":"..."}}
+"""
+
+import asyncio
+import json
+import logging
+import os
+import sys
+import threading
+import time
+import traceback
+from dataclasses import dataclass
+from typing import Any, Dict, Optional, Set
+from uuid import UUID
+
+import psutil
+
+logging.basicConfig(level=logging.INFO, stream=sys.stderr)
+logger = logging.getLogger(__name__)
+
+
+@dataclass
+class GenerateRequest:
+    prompt: str
+    model: str
+    max_tokens: int
+    temperature: float
+    stream: bool
+    request_id: str
+
+
+@dataclass
+class TokenEvent:
+    request_id: str
+    text: str
+    is_final: bool = False
+    prompt_tokens: Optional[int] = None
+    completion_tokens: Optional[int] = None
+    stopped_reason: Optional[str] = None
+
+
+class HwLedgerRpcServer:
+    """JSON-RPC 2.0 server for hwLedger inference."""
+
+    def __init__(self, engine_pool=None):
+        """Initialize with optional engine pool (injected for testing)."""
+        self.engine_pool = engine_pool
+        self.start_time = time.time()
+        self.next_request_id = 0
+        self.running_generations: Dict[str, asyncio.Task] = {}
+        self.generation_lock = threading.Lock()
+        self.pending_tokens: Dict[str, list] = {}
+        self.process = psutil.Process()
+
+    async def generate(self, req: GenerateRequest) -> None:
+        """Spawn an async generation task and stream tokens via JSON-RPC notifications."""
+        request_id = req.request_id
+
+        if self.engine_pool is None:
+            await self._send_error(None, -32000, "Engine pool not initialized")
+            return
+
+        async def _generate_task():
+            try:
+                prompt_tokens = 0
+                completion_tokens = 0
+                stopped_reason = "eos"
+
+                if self.engine_pool is not None:
+                    # In a real deployment, this calls the actual mlx-lm engine.
+                    # For now, we stub it out to accept any model and yield placeholder tokens.
+                    for i in range(min(req.max_tokens, 10)):
+                        # Simulated token: in production, this comes from the engine pool.
+                        token_text = f"[token-{i}]"
+                        completion_tokens += 1
+                        await self._send_token_notification(request_id, token_text)
+                        await asyncio.sleep(0.01)  # Simulate generation latency
+
+                # Send final result
+                await self._send_result(
+                    request_id,
+                    {
+                        "request_id": request_id,
+                        "prompt_tokens": prompt_tokens,
+                        "completion_tokens": completion_tokens,
+                        "stopped_reason": stopped_reason,
+                    },
+                )
+            except Exception as e:
+                logger.exception(f"Error in generate task for {request_id}: {e}")
+                await self._send_error(request_id, -32000, str(e), traceback.format_exc())
+            finally:
+                with self.generation_lock:
+                    self.running_generations.pop(request_id, None)
+                    self.pending_tokens.pop(request_id, None)
+
+        task = asyncio.create_task(_generate_task())
+        with self.generation_lock:
+            self.running_generations[request_id] = task
+            self.pending_tokens[request_id] = []
+
+    async def cancel(self, request_id: str) -> None:
+        """Cancel a running generation by request_id."""
+        with self.generation_lock:
+            task = self.running_generations.pop(request_id, None)
+            self.pending_tokens.pop(request_id, None)
+
+        if task:
+            task.cancel()
+            try:
+                await task
+            except asyncio.CancelledError:
+                pass
+
+    async def load_model(
+        self, model: str, max_kv_size: int
+    ) -> Dict[str, Any]:
+        """Load a model into the engine pool."""
+        if self.engine_pool is None:
+            return {"loaded": False, "error": "Engine pool not initialized"}
+
+        # Stub: in production, this calls engine_pool.load_model(model, max_kv_size).
+        # For now, we pretend any model loads successfully.
+        return {
+            "loaded": True,
+            "model": model,
+            "context_length": 8192,
+        }
+
+    async def unload_model(self, model: str) -> Dict[str, Any]:
+        """Unload a model from the engine pool."""
+        if self.engine_pool is None:
+            return {"unloaded": False, "error": "Engine pool not initialized"}
+
+        # Stub: in production, this calls engine_pool.unload_model(model).
+        return {"unloaded": True}
+
+    async def memory_report(self) -> Dict[str, Any]:
+        """Report unified memory usage breakdown."""
+        try:
+            mem_info = self.process.memory_info()
+            total_mb = mem_info.rss / 1024 / 1024
+            # Simplified: assume 30% is MLX, rest is overhead
+            used_by_mlx_mb = total_mb * 0.3
+            kv_cache_mb = total_mb * 0.5
+
+            return {
+                "total_unified_mb": round(total_mb, 2),
+                "used_by_mlx_mb": round(used_by_mlx_mb, 2),
+                "kv_cache_mb": round(kv_cache_mb, 2),
+                "loaded_models": [],  # Would be populated from engine_pool in production
+            }
+        except Exception as e:
+            logger.exception(f"Error in memory_report: {e}")
+            return {
+                "total_unified_mb": 0,
+                "used_by_mlx_mb": 0,
+                "kv_cache_mb": 0,
+                "loaded_models": [],
+                "error": str(e),
+            }
+
+    async def health(self) -> Dict[str, Any]:
+        """Report server health."""
+        uptime = time.time() - self.start_time
+        return {
+            "status": "ok",
+            "uptime_s": round(uptime, 2),
+            "mlx_version": "0.3.6",  # Placeholder; would be fetched from mlx-lm in production
+        }
+
+    async def handle_request(self, line: str) -> None:
+        """Parse and dispatch a single JSON-RPC request line."""
+        try:
+            msg = json.loads(line)
+        except json.JSONDecodeError as e:
+            logger.error(f"JSON decode error: {e}")
+            return
+
+        method = msg.get("method")
+        params = msg.get("params", {})
+        request_id = msg.get("id")
+
+        try:
+            if method == "generate":
+                gen_req = GenerateRequest(
+                    prompt=params.get("prompt", ""),
+                    model=params.get("model", ""),
+                    max_tokens=params.get("max_tokens", 100),
+                    temperature=params.get("temperature", 0.7),
+                    stream=params.get("stream", True),
+                    request_id=params.get("request_id", str(request_id)),
+                )
+                await self.generate(gen_req)
+
+            elif method == "cancel":
+                await self.cancel(params.get("request_id", ""))
+                await self._send_result(request_id, {"cancelled": True})
+
+            elif method == "load_model":
+                result = await self.load_model(
+                    model=params.get("model", ""),
+                    max_kv_size=params.get("max_kv_size", 0),
+                )
+                await self._send_result(request_id, result)
+
+            elif method == "unload_model":
+                result = await self.unload_model(model=params.get("model", ""))
+                await self._send_result(request_id, result)
+
+            elif method == "memory_report":
+                result = await self.memory_report()
+                await self._send_result(request_id, result)
+
+            elif method == "health":
+                result = await self.health()
+                await self._send_result(request_id, result)
+
+            else:
+                await self._send_error(request_id, -32601, f"Method not found: {method}")
+
+        except Exception as e:
+            logger.exception(f"Error handling {method}: {e}")
+            await self._send_error(request_id, -32000, str(e), traceback.format_exc())
+
+    async def _send_result(self, request_id: Any, result: Any) -> None:
+        """Send a successful JSON-RPC result."""
+        response = {
+            "jsonrpc": "2.0",
+            "result": result,
+            "id": request_id,
+        }
+        print(json.dumps(response), file=sys.stdout, flush=True)
+
+    async def _send_error(
+        self, request_id: Any, code: int, message: str, data: Optional[str] = None
+    ) -> None:
+        """Send a JSON-RPC error response."""
+        error_obj = {"code": code, "message": message}
+        if data:
+            error_obj["data"] = {"traceback": data}
+
+        response = {
+            "jsonrpc": "2.0",
+            "error": error_obj,
+            "id": request_id,
+        }
+        print(json.dumps(response), file=sys.stdout, flush=True)
+
+    async def _send_token_notification(self, request_id: str, text: str) -> None:
+        """Send a token notification (not awaiting an ID)."""
+        notification = {
+            "jsonrpc": "2.0",
+            "method": "token",
+            "params": {
+                "request_id": request_id,
+                "text": text,
+            },
+        }
+        print(json.dumps(notification), file=sys.stdout, flush=True)
+
+    async def run_stdin_loop(self) -> None:
+        """Main event loop: read JSON-RPC requests from stdin, dispatch them."""
+        loop = asyncio.get_running_loop()
+
+        def read_stdin():
+            """Blocking read from stdin; yields lines."""
+            while True:
+                try:
+                    line = sys.stdin.readline()
+                    if not line:
+                        break
+                    line = line.rstrip("\n\r")
+                    if line:
+                        yield line
+                except Exception as e:
+                    logger.error(f"Error reading stdin: {e}")
+                    break
+
+        # Run stdin reading in a thread pool to avoid blocking the event loop.
+        def stdin_reader_thread():
+            for line in read_stdin():
+                asyncio.run_coroutine_threadsafe(self.handle_request(line), loop)
+
+        reader_thread = threading.Thread(target=stdin_reader_thread, daemon=True)
+        reader_thread.start()
+
+        # Keep the event loop alive.
+        try:
+            while True:
+                await asyncio.sleep(1)
+        except KeyboardInterrupt:
+            logger.info("Shutting down...")
+            sys.exit(0)
+
+
+def main():
+    """Entry point for hwLedger RPC server."""
+    server = HwLedgerRpcServer()
+    try:
+        asyncio.run(server.run_stdin_loop())
+    except KeyboardInterrupt:
+        logger.info("RPC server shutdown")
+        sys.exit(0)
+
+
+if __name__ == "__main__":
+    main()
diff --git a/worklogs/ARCHITECTURE.md b/worklogs/ARCHITECTURE.md
new file mode 100644
index 000000000..68228813d
--- /dev/null
+++ b/worklogs/ARCHITECTURE.md
@@ -0,0 +1,5 @@
+# Architecture Decisions & Refactoring
+
+## Index
+
+Add entries as ADRs, library extractions, or refactoring decisions are made.
diff --git a/worklogs/GOVERNANCE.md b/worklogs/GOVERNANCE.md
new file mode 100644
index 000000000..cbaccd3f2
--- /dev/null
+++ b/worklogs/GOVERNANCE.md
@@ -0,0 +1,5 @@
+# Governance & Quality
+
+## Index
+
+Add entries for policy decisions, quality gate evidence, or org alignment findings.
diff --git a/worklogs/README.md b/worklogs/README.md
new file mode 100644
index 000000000..22ed624ad
--- /dev/null
+++ b/worklogs/README.md
@@ -0,0 +1,19 @@
+# Worklogs
+
+Project worklogs capturing research, decisions, architecture, and completion notes.
+
+## Categories
+
+| File | Purpose |
+|------|---------|
+| ARCHITECTURE.md | ADRs, library extraction, refactoring decisions |
+| RESEARCH.md | Analysis, starred repo research, comparative studies |
+| GOVERNANCE.md | Policy, evidence, quality gates, org alignment |
+| DUPLICATION.md | Cross-project code duplication findings |
+| DEPENDENCIES.md | Dependency audits, upgrades, modernization |
+| INTEGRATION.md | External integrations, API changes |
+| PERFORMANCE.md | Optimization, benchmarking results |
+
+## Index
+
+See `../../INDEX.md` for cross-repo worklog index.
diff --git a/worklogs/RESEARCH.md b/worklogs/RESEARCH.md
new file mode 100644
index 000000000..72f13f07b
--- /dev/null
+++ b/worklogs/RESEARCH.md
@@ -0,0 +1,5 @@
+# Research & Analysis
+
+## Index
+
+Add entries as research completions, starred repo analysis, or comparative studies conclude.