Lead Qualifier Agent

AI-powered lead qualification chatbot that collects patient data through a conversational funnel, extracts structured variables with LLM, and qualifies leads using RAG-based vector similarity search against a medical knowledge base.

Built with NestJS, LangGraph, Prisma, and pgvector.

What It Does

The agent simulates a WhatsApp-style conversation to qualify leads for a weight loss clinic. It guides users through a multi-step funnel, extracts structured data from natural language, and makes a qualification decision based on vector similarity matching.

Qualification flow:

collect_name → collect_birth_date → collect_weight_loss_reason → qualified / rejected

Each step uses LLM extraction to parse unstructured user messages into structured fields. The agent also detects corrections to previously collected data mid-conversation (e.g., "actually my name is Carlos").

The final qualification step compares the user's weight loss reason against a pgvector knowledge base using cosine similarity — leads with reasons that match known treatable conditions (distance ≤ 0.20) are qualified; others are rejected.

Status Lifecycle

Status	Trigger	Description
`active`	New conversation created	Funnel is in progress
`qualified`	Vector similarity ≤ 0.20	Lead matches treatable conditions
`rejected`	Vector similarity > 0.20	Reason doesn't match knowledge base
`expired`	15 min inactivity	Session timed out, new one created on next message

Tech Stack

Layer	Technology
Framework	NestJS 11 + TypeScript
Database	PostgreSQL + pgvector (Docker)
ORM	Prisma 6 with vector extension
Agent	LangGraph StateGraph
LLM	OpenAI, Google Gemini, or OpenRouter (configurable)
Embeddings	text-embedding-3-small / gemini-embedding-001
Frontend	Vanilla HTML/CSS/JS with dark theme

Architecture

src/
├── agent/
│   ├── graph/
│   │   ├── nodes/
│   │   │   ├── process-message.node.ts   # LLM extraction + correction detection
│   │   │   ├── qualify-lead.node.ts      # Vector similarity qualification
│   │   │   └── generate-response.node.ts # Contextual response generation
│   │   ├── graph.ts                      # LangGraph workflow definition
│   │   └── state.ts                      # FunnelState annotation
│   ├── llm/
│   │   ├── llm.factory.ts               # Multi-provider LLM factory
│   │   └── llm.tokens.ts                # DI tokens (CHAT_MODEL, EMBEDDINGS)
│   └── vector/
│       └── vector-store.service.ts       # pgvector similarity search
├── conversation/
│   ├── conversation.service.ts           # Funnel orchestration + session mgmt
│   └── conversation.controller.ts        # REST API endpoints
├── prisma/
│   └── prisma.service.ts                 # Database connection
└── health/
    └── health.controller.ts              # Health check endpoint

Key design decisions:

Dependency Injection via NestJS IoC — LLM provider is injected via token + factory, swappable through env config without code changes
LangGraph StateGraph — deterministic funnel flow with conditional routing to qualification node
Lazy session expiration — checked on message receipt, no background scheduler needed

Getting Started

Prerequisites

Node.js 22+
pnpm
Docker

Quick Start (Docker)

cp .env.example .env
# Edit .env — set LLM_PROVIDER and API key

make ai
# Builds and starts all containers (app + postgres + migrations)

Open http://localhost:3000 for the chat UI.

Manual Setup

# 1. Start database
docker compose up -d postgres

# 2. Install dependencies
pnpm install

# 3. Run migrations and seed vector store
pnpm db:migrate
pnpm db:seed

# 4. Start dev server
pnpm start:dev

LLM Provider Configuration

Set LLM_PROVIDER in .env to switch between providers:

Provider	Env Var	Default Model
`openai`	`OPENAI_API_KEY`	gpt-4o-mini
`google`	`GOOGLE_API_KEY`	gemini-2.5-flash
`openrouter`	`OPENROUTER_API_KEY`	openrouter/free

API

`POST /conversations/:phoneNumber/messages`

Send a message to the qualification funnel.

Request:

{ "content": "Oi, meu nome é João" }

Response:

{
  "type": "text",
  "content": "Obrigado, João! Qual é a sua data de nascimento?",
  "conversation": {
    "phoneNumber": "5511999999999",
    "status": "active",
    "funnelStep": "collect_birth_date",
    "variables": {
      "name": "João",
      "birthDate": null,
      "weightLossReason": null
    }
  }
}

`GET /conversations/:phoneNumber/status`

Get current conversation state and extracted variables.

`GET /health`

Health check endpoint.

Roadmap

Human-in-the-Loop Review

The current qualification step is fully automated — the agent qualifies or rejects a lead based solely on vector similarity score. A natural next step is adding human oversight for borderline cases.

Proposed approach using LangGraph's interrupt():

When the similarity score falls in a gray zone (e.g., 0.15 < score ≤ 0.25), the graph pauses and waits for a human decision before proceeding:

// qualify-lead.node.ts
import { interrupt } from '@langchain/langgraph';

if (topDistance > 0.15 && topDistance <= 0.25) {
  const decision = interrupt({
    message: 'Borderline lead — manual review required',
    score: topDistance,
    weightLossReason: state.weightLossReason,
  });
  return {
    qualified: decision.approved,
    funnelStep: decision.approved ? 'qualified' : 'rejected',
  };
}

What this requires:

Checkpointer — persist the paused graph state between requests. LangGraph provides a Postgres-backed checkpointer (@langchain/langgraph-checkpoint-postgres) that fits the existing stack.
Review endpoint — POST /conversations/:phoneNumber/review accepts { approved: boolean } and resumes the graph via graph.invoke(null, { thread_id }).
Review queue UI — a simple admin page listing paused conversations for human agents to approve or reject.

Why it matters: Fully automated qualification can misfire on edge cases. Human review on the boundary improves lead quality without adding friction to the majority of conversations that fall clearly inside or outside the threshold.

License

UNLICENSED

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
prisma		prisma
public		public
src		src
test		test
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.prettierrc		.prettierrc
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml
eslint.config.mjs		eslint.config.mjs
nest-cli.json		nest-cli.json
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
tsconfig.build.json		tsconfig.build.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lead Qualifier Agent

What It Does

Status Lifecycle

Tech Stack

Architecture

Getting Started

Prerequisites

Quick Start (Docker)

Manual Setup

LLM Provider Configuration

API

`POST /conversations/:phoneNumber/messages`

`GET /conversations/:phoneNumber/status`

`GET /health`

Roadmap

Human-in-the-Loop Review

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Lead Qualifier Agent

What It Does

Status Lifecycle

Tech Stack

Architecture

Getting Started

Prerequisites

Quick Start (Docker)

Manual Setup

LLM Provider Configuration

API

POST /conversations/:phoneNumber/messages

GET /conversations/:phoneNumber/status

GET /health

Roadmap

Human-in-the-Loop Review

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`POST /conversations/:phoneNumber/messages`

`GET /conversations/:phoneNumber/status`

`GET /health`

Packages