Taxi Cab Problem — Reinforcement Learning

A Q-learning agent that learns to navigate a taxi in a 5×5 grid world, pick up passengers, and drop them at their destination.

▶ Live Demo

The Problem

Based on the classic Taxi-v3 environment. The taxi must:

Navigate a 5×5 grid with walls
Pick up a passenger from one of 4 locations (R, G, Y, B)
Drop them off at another location
Do this as efficiently as possible

Rewards:

+20 for successful drop-off
-1 for each step (encourages efficiency)
-10 for illegal pickup/drop-off attempts

State space: 500 states (5×5 grid × 5 passenger locations × 4 destinations)

Actions: 6 (Up, Down, Left, Right, Pickup, Drop)

The Solution

This project implements Q-learning, a model-free reinforcement learning algorithm. The agent learns a Q-table mapping state-action pairs to expected rewards through trial and error.

Q(s,a) ← Q(s,a) + α[r + γ·max(Q(s',a')) - Q(s,a)]

Hyperparameters:

Learning rate (α): 0.1
Discount factor (γ): 0.99
Exploration rate (ε): 0.1
Episodes: 5000

After training, the agent learns the optimal policy and consistently solves the task in minimal steps.

Project Structure

├── main.c          # Training loop, visualization, Q-learning
├── taxi.c          # Environment logic (step, reset, rewards)
├── taxi.h          # Environment struct and function declarations

Building Locally

Requires raylib for visualization.

Linux: raylib 5.5 is included in the repo — no installation needed.
For other systems: Install using package manager/download the source code of raylib an build manually.

# manually with gcc
gcc -o taxi main.c taxi.c -lraylib -lGL -lm -lpthread

# or Using the nob build system
# update the `nob.c` file with appropiate file path for a `nob` build.
./nob

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github/workflows		.github/workflows
raylib-5.5_linux_amd64		raylib-5.5_linux_amd64
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.c		main.c
nob.c		nob.c
nob.h		nob.h
taxi.c		taxi.c
taxi.h		taxi.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Taxi Cab Problem — Reinforcement Learning

The Problem

The Solution

Project Structure

Building Locally

License

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Taxi Cab Problem — Reinforcement Learning

The Problem

The Solution

Project Structure

Building Locally

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages