market-data-service

Market Data Service Documentation

Overview

The Market Data Service is a production-ready microservice designed to fetch market data, process it through a streaming pipeline, and serve it via REST APIs. It integrates with external market data providers, processes raw data into meaningful insights (e.g., moving averages), and provides real-time updates via Kafka. The service is built with FastAPI, PostgreSQL, and Apache Kafka, ensuring scalability, reliability, and modularity.

Features

Market Data Fetching:
- Fetches real-time price data for specified symbols from external providers (e.g., Yahoo Finance).
- Supports configurable polling intervals.
Streaming Pipeline:
- Publishes raw price data to Kafka (price-events topic).
- Computes 5-point moving averages and stores results in the symbol_averages table.
REST API:
- Provides endpoints for fetching the latest price and scheduling polling jobs.
- OpenAPI documentation for easy integration.
Database Integration:
- Stores raw market data, processed price points, polling job configurations, and moving averages.
- Optimized with indexes on timestamp and symbol.
Dockerized Deployment:
- Fully containerized with Docker Compose for local development and production environments.
- Includes PostgreSQL, Kafka, and FastAPI services.

Architecture

Data Flow Diagram

sequenceDiagram
    participant C as Client
    participant A as FastAPI
    participant M as Market API
    participant K as Kafka
    participant MA as MA Consumer
    participant DB as PostgreSQL
    
    C->>A: GET /prices/latest
    A->>DB: Check cache
    alt Cache miss
        A->>M: Fetch latest price
        M-->>A: Price data
        A->>DB: Store raw response
        A->>K: Produce price event
    end
    A-->>C: Return price
    
    K->>MA: Consume price event
    MA->>DB: Fetch last 5 prices
    MA->>MA: Calculate MA
    MA->>DB: Store MA result

API Documentation

Endpoints

GET /prices/latest

Fetch the latest price for a given symbol.

Query Parameters:

symbol (required): The stock symbol (e.g., AAPL).
provider (optional): The data provider (default: Yahoo Finance).

Response:

{
  "symbol": "AAPL",
  "price": 150.25,
  "timestamp": "2024-03-20T10:30:00Z",
  "provider": "yahoo_finance"
}

POST /prices/poll

Schedule a polling job for specified symbols.

Request Body:

{
  "symbols": ["AAPL", "MSFT"],
  "interval": 60,
  "provider": "yahoo_finance"
}

Response:

{
  "job_id": "poll_123",
  "status": "accepted",
  "config": {
    "symbols": ["AAPL", "MSFT"],
    "interval": 60
  }
}

Database Schema

Tables

polling_job:
- Stores configurations for polling jobs.
- Columns: id, symbols, interval, provider, created_at, status.
price_point:
- Stores raw price data fetched from external providers.
- Columns: id, symbol, timestamp, price, provider, raw_response_id.
symbol_averages:
- Stores computed moving averages for symbols.
- Columns: id, symbol, timestamp, average_price.
raw_market_data:
- Stores raw responses from external market data providers.
- Columns: id, symbol, timestamp, source, data.

Setup Instructions

Prerequisites

Docker and Docker Compose installed.
Python 3.10+ installed locally (optional for development).

Steps

Clone the repository:

git clone https://github.com/an-siuu-man/market-data-service
cd market-data-service

Build and start the services:
```
docker-compose up --build
```
Access the API documentation:
- Open http://localhost:8000/docs in your browser.

Troubleshooting

Database Connection Refused:
- Ensure the db service is running and accessible.
- Check the DATABASE_URL environment variable for correct credentials.
Kafka Topic Not Found:
- Ensure the producer has published at least one message to the price-events topic.
- Create the topic manually using Kafka CLI tools.

Repo Structure

market-data-service/
├── app/
│   ├── api/                # FastAPI routes
│   ├── core/               # Core utilities (e.g., database)
│   ├── models/             # SQLAlchemy models
│   ├── services/           # External integrations (e.g., Kafka, providers)
│   ├── schemas/            # Pydantic schemas
│   └── main.py             # FastAPI entrypoint
├── migrations/             # Alembic migration scripts
├── postgres-data/          # Local Postgres data (for Docker)
├── scripts/                # Standalone scripts (e.g., poller, init_db)
├── myenv/                  # Python virtual environment (local)
├── Dockerfile              # Docker build file
├── docker-compose.yml      # Docker Compose config
├── requirements.txt        # Project dependencies
├── README.md               # Project documentation

Future Improvements

Caching:
- Add Redis for caching frequently accessed data.
Rate Limiting:
- Implement rate limiting for API endpoints.
Monitoring:
- Integrate Prometheus and Grafana for system monitoring.
Deployment:
- Deploy to AWS or Heroku for production use.

Note on Missing Features

Due to limited time and the steep learning curve of tools such as SQLAlchemy, Pydantic, and FastAPI, this project does not currently include a Postman collection, automated testing, or Redis caching. These features are valuable and were recommended in the assignment description, but I decided to focus on core functionality and learning the main frameworks in this implementation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

market-data-service

Market Data Service Documentation

Overview

Features

Architecture

Data Flow Diagram

API Documentation

Endpoints

GET /prices/latest

POST /prices/poll

Database Schema

Tables

Setup Instructions

Prerequisites

Steps

Troubleshooting

Repo Structure

Future Improvements

Note on Missing Features

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
app		app
migrations		migrations
scripts		scripts
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

market-data-service

Market Data Service Documentation

Overview

Features

Architecture

Data Flow Diagram

API Documentation

Endpoints

GET /prices/latest

POST /prices/poll

Database Schema

Tables

Setup Instructions

Prerequisites

Steps

Troubleshooting

Repo Structure

Future Improvements

Note on Missing Features

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages