Hermes Agent Solution Template (HAST)

A production-ready template for building AI agent workflows with durable orchestration and human-in-the-loop review. Ships with an exam grading demo that showcases the full pattern: AI evaluation → professor review → approval.

Architecture

graph TB
    subgraph "Client"
        Browser["Browser<br/>(React SPA)"]
        LibreChat["LibreChat UI<br/>:8080"]
    end

    subgraph "Docker Compose Network"
        subgraph "Application Layer"
            API["api<br/>FastAPI + Static Files<br/>:8000"]
            Auth["auth<br/>better-auth (Hono)<br/>:3100"]
            Worker["worker<br/>Temporal Worker"]
            LC["librechat<br/>LibreChat v0.8.4<br/>:3080"]
        end

        subgraph "Infrastructure Layer"
            Temporal["temporal<br/>Temporal Server<br/>:7233"]
            TemporalUI["temporal-ui<br/>Temporal Web UI<br/>:8233"]
            Hermes["hermes-gateway<br/>AI Agent Gateway<br/>:8642"]
            Postgres["postgres<br/>PostgreSQL 15<br/>:5432"]
            MongoDB["mongodb<br/>MongoDB 8.0<br/>:27017"]
            MC["mission-control<br/>Agent Dashboard<br/>:3001"]
        end

        HermesVol[("Shared Volume<br/>(Hermes logs)")]
    end

    Browser -->|"HTTP :8000<br/>API + SPA + Auth"| API
    LibreChat -->|"HTTP :8080 (Caddy)<br/>Chat UI"| LC
    API -->|"HTTP :3100<br/>Auth proxy"| Auth

    API -->|"gRPC :7233<br/>Start workflows, signals"| Temporal
    API -->|"SQL :5432<br/>Submissions, reviews"| Postgres
    API -->|"SSE :8642<br/>Stream evaluation"| Hermes

    Auth -->|"SQL :5432<br/>Users, sessions"| Postgres

    Worker -->|"gRPC :7233<br/>Poll tasks, report results"| Temporal
    Worker -->|"HTTP :8642<br/>Chat completions + responses"| Hermes
    Worker -->|"SQL :5432<br/>Update grades"| Postgres

    Temporal -->|"SQL :5432<br/>Workflow state"| Postgres
    TemporalUI -->|"gRPC :7233"| Temporal

    LC -->|"HTTP :8642<br/>Chat completions"| Hermes
    LC -->|"MongoDB :27017<br/>Conversations"| MongoDB

    Hermes -->|"HTTPS<br/>LLM API"| LLM["External LLM Provider<br/>(OpenCode Go, OpenRouter)"]

    Hermes -->|"Write logs"| HermesVol
    MC -->|"Read logs"| HermesVol

The template provides ten Docker containers orchestrated via Docker Compose. This is the core infrastructure you get out of the box -- swap the demo-specific pieces for your own domain logic:

Container	Role	Port
postgres	PostgreSQL 15 — data storage, auth, workflow state	5432
temporal	Temporal Server — durable workflow orchestration	7233
temporal-ui	Temporal Web UI — workflow visibility	8233
hermes-gateway	Hermes AI Agent — OpenAI-compatible API with web, file, vision tools	8642
auth	better-auth — email/password + OAuth authentication	3100
api	FastAPI backend + React frontend (SPA)	8000
worker	Temporal worker — executes workflow activities	—
mongodb	MongoDB 8.0 — LibreChat conversation storage	27017
librechat	LibreChat v0.8.4 — direct chat UI for Hermes Agent	3080 (via Caddy on 8080)
mission-control	Mission Control — agent observability dashboard	3001

Use Cases

This template implements a generic AI evaluate → human review → approve/reject pattern. The included grading demo is one example. You could fork this template to build:

Document review — AI summarizes and flags issues in contracts, proposals, or reports; reviewers approve or request revisions
Code review — AI analyzes pull requests or code submissions against style guides and best practices; senior engineers review
Content moderation — AI flags user-generated content; moderators make final decisions
Application screening — AI evaluates job applications or grant proposals against criteria; hiring managers review shortlisted candidates
Medical report analysis — AI extracts findings from lab results or imaging reports; physicians confirm or annotate

Each of these follows the same workflow pattern: submit content → AI evaluates against configurable criteria → human reviews AI output → approve, reject, or request re-evaluation.

Demo: Exam Grading Workflow

The included demo implements an exam grading use case to showcase the full template pattern. Professors upload student submissions, the Hermes AI agent evaluates them against a configurable rubric (with real-time SSE streaming and agent trace visibility), and professors review and finalize grades.

sequenceDiagram
    participant P as Professor
    participant UI as Web UI
    participant API as FastAPI
    participant T as Temporal
    participant W as Worker
    participant H as Hermes Agent
    participant LLM as LLM Provider

    P->>UI: Upload submission
    UI->>API: POST /api/submissions
    API->>API: Store in PostgreSQL
    API->>T: Start GradingWorkflow
    API-->>UI: 201 Created (status: evaluating)

    T->>W: Schedule evaluate_submission
    W->>H: POST /v1/responses (rubric + submission)
    H->>LLM: Forward with tools (web, file, vision)
    LLM-->>H: Structured JSON feedback + trace
    H-->>W: {score, strengths, weaknesses, reasoning, trace}
    W->>W: Store review + agent_trace in PostgreSQL
    W-->>T: Activity complete

    T->>W: Schedule notify_reviewer
    W->>W: Update status → "review"
    W-->>T: Activity complete

    Note over T: Workflow WAITS for signal (up to 7 days)

    P->>UI: Open submission (SSE streaming panel)
    UI->>API: GET /api/submissions/:id/stream
    API-->>UI: SSE events (live Hermes output)

    P->>UI: Review agent feedback + trace accordion
    P->>UI: Click Approve
    UI->>API: POST /api/submissions/:id/review
    API->>T: Signal: review_decision(approved)

    T->>W: Schedule record_final_grade
    W->>W: Update status → "approved"
    W-->>T: Workflow COMPLETE

The workflow is durable -- if the server crashes during grading, it resumes exactly where it left off. LLM calls use heartbeats for crash detection, idempotent writes prevent duplicates on retry, and rate limits are respected automatically.

Quick Start

# 1. Clone and enter the project
cd hermes-agent-solution-template

# 2. Create your local environment file
cp infra/local/.env.example infra/shared/.env

# 3. Edit .env with your API keys
#    At minimum, set OPENCODE_GO_API_KEY, HERMES_API_KEY, and TAVILY_API_KEY

# 4. Start all services
docker compose -f infra/shared/docker-compose.yml \
               -f infra/local/docker-compose.override.yml \
               up --build

# 5. Open the application
#    Web UI:       http://localhost:8000
#    LibreChat:    http://localhost:8080
#    Temporal UI:  http://localhost:8233
#    Mission Control: http://localhost:3001
#    Auth (proxied): http://localhost:8000/api/auth

Test Credentials (Local Development)

On first run, sign up using the web UI at http://localhost:8000/login. For consistent testing:

Field	Value
Email	`[email protected]`
Password	`TestPassword123!`
Name	`Professor Test`

Click "Sign up" on first use, then "Sign in" on subsequent runs.

Note: In dev mode (no AUTH_SECRET set), the backend bypasses authentication entirely. The frontend still requires sign-in via the auth service.

LibreChat

LibreChat provides a full-featured chat UI for direct interaction with the Hermes Agent. It is served via Caddy on port 8080 and stores conversation history in MongoDB.

Access: http://localhost:8080
Backend: Connects to hermes-gateway on port 8642 using the OpenAI-compatible /v1/chat/completions endpoint
Storage: Conversation history persisted to MongoDB 8.0
Use case: Explore Hermes Agent capabilities, test tool calls (web search, file, vision), or prototype prompts before integrating into the grading workflow

Authentication

The template uses better-auth for self-hosted authentication. Three sign-in methods are supported:

Email + Password -- always available
Google OAuth -- requires GOOGLE_CLIENT_ID and GOOGLE_CLIENT_SECRET in .env
GitHub OAuth -- requires GITHUB_CLIENT_ID and GITHUB_CLIENT_SECRET in .env

OAuth providers are optional. If their env vars are empty, the buttons appear but won't work. See Google Cloud Console and GitHub Developer Settings to create OAuth apps.

Configuration

All configuration is managed through environment variables. Copy infra/local/.env.example to infra/shared/.env and fill in required values.

Variable	Description	Default
`HERMES_MODEL_PROVIDER`	LLM provider for Hermes agent	`opencode-go`
`OPENCODE_GO_API_KEY`	LLM provider API key (OpenCode Go, OpenRouter, etc.)	required
`HERMES_API_KEY`	Self-assigned key for Hermes gateway auth	required
`TAVILY_API_KEY`	Tavily API key for Hermes web search tool	required
`AUTH_SECRET`	Random string for signing auth sessions	required in production
`AUTH_SERVICE_URL`	Internal URL of the better-auth service (API proxy)	`http://auth:3100`
`GOOGLE_CLIENT_ID`	Google OAuth client ID	optional
`GOOGLE_CLIENT_SECRET`	Google OAuth client secret	optional
`GITHUB_CLIENT_ID`	GitHub OAuth app client ID	optional
`GITHUB_CLIENT_SECRET`	GitHub OAuth app client secret	optional
`TASK_QUEUE`	Temporal task queue name	`grading-queue`
`GRADING_TIMEOUT_DAYS`	Days before unreviewed submissions expire	`7`
`MC_AUTH_USER`	Mission Control dashboard username	required
`MC_AUTH_PASS`	Mission Control dashboard password	required
`MC_API_KEY`	Mission Control API key for agent reporting	required

Documentation

See the docs/ directory for detailed guides:

Document	Description
Architecture	System design, container diagram, SSE streaming, agent trace, LibreChat integration
API Reference	REST endpoint reference including SSE stream and trace endpoints
Deployment	Local dev, Docker Compose, production (Hetzner/Lightsail), SSL
Workflows	Temporal workflow engine, signals, human-in-the-loop pattern, trace capture
Customization	How to fork the template, configure Hermes tools, and build your own use case
Data Model	ER diagram, all tables, agent_trace schema, MongoDB, migration strategy

Development

Project Structure

hermes-agent-solution-template/
├── frontend/                  # React SPA (Vite + TanStack + shadcn/ui)
├── services/
│   ├── api/                   # FastAPI application
│   ├── auth/                  # better-auth service (Hono + Node.js)
│   ├── workers/               # Temporal worker processes
│   └── hermes/                # Hermes agent config, hooks, + Dockerfile
├── infra/
│   ├── shared/                # Base Docker Compose + SQL init
│   ├── local/                 # Local dev overrides
│   └── hetzner/               # Production deployment
├── src/hermes_agent_solution_template/  # Shared Python package
├── docs/                      # Documentation
├── tests/                     # Test suite
├── scripts/                   # Utility scripts
└── sample_data/               # Sample submissions for testing

Running Tests

PYTHONPATH=.:src pytest tests/ -v

Frontend Development

cd frontend && pnpm install && pnpm dev

The Vite dev server proxies API calls to the FastAPI backend at :8000.

Database

The database schema is initialized automatically by infra/shared/init-db.sql when the postgres container starts for the first time. To reset the database, remove the postgres_data volume:

docker compose -f infra/shared/docker-compose.yml down -v

Recording Demo Videos

Demo videos are recorded using WebReel, a CLI tool that drives a headless browser through scripted interactions and outputs MP4/GIF/WebM files. WebReel is installed globally (not as a project dependency) so it can be reused across projects.

Setup

# Install WebReel globally (one-time)
npm install -g webreel

# Copy the example config
cp webreel.config.json.example webreel.config.json

Configuration

Edit webreel.config.json to match your environment:

baseUrl — Point to your deployment (https://your-domain.com or http://localhost:8000)
Submission IDs — Update the navigate URLs in Acts 4 and 5 to reference completed submissions with scores and agent traces (the example config includes placeholder IDs from the Hetzner deployment)

Recording

# Preview in a visible browser (no video output)
webreel preview

# Record the demo video
webreel record

# Output: videos/hast-demo.mp4

The example config defines a single continuous video with 5 acts: Home page tour, Settings peek, Hermes Chat, text submission demo (Math Algebra Exam), and image submission demo (scanned UTN exam with vision). Each demo submission shows the real evaluating state briefly, then navigates to a pre-evaluated submission to show results without the 30-140s agent wait time.

Note: webreel.config.json and videos/ are gitignored. Only the .example template is committed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hermes Agent Solution Template (HAST)

Architecture

Use Cases

Demo: Exam Grading Workflow

Quick Start

Test Credentials (Local Development)

LibreChat

Authentication

Configuration

Documentation

Development

Project Structure

Running Tests

Frontend Development

Database

Recording Demo Videos

Setup

Configuration

Recording

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
docs		docs
frontend		frontend
infra		infra
sample_data/exams		sample_data/exams
services		services
src/hermes_agent_solution_template		src/hermes_agent_solution_template
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
README.md		README.md
pyproject.toml		pyproject.toml
webreel.config.json.example		webreel.config.json.example

Folders and files

Latest commit

History

Repository files navigation

Hermes Agent Solution Template (HAST)

Architecture

Use Cases

Demo: Exam Grading Workflow

Quick Start

Test Credentials (Local Development)

LibreChat

Authentication

Configuration

Documentation

Development

Project Structure

Running Tests

Frontend Development

Database

Recording Demo Videos

Setup

Configuration

Recording

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages