🏰 Escape the Castle: An RL Adventure

Welcome to Escape the Castle! This project features an AI agent, trained from scratch using Q-learning, that has one goal: navigate a dangerous, guard-filled castle and escape to freedom.

The Challenge

Our hero starts in a random location within a 7x7 castle grid. The exit is always at the far corner (6,6). But it's not that simple! The castle is a dynamic environment, patrolled by four unique guards, each with their own strengths and weaknesses. The agent must learn to deal with them, avoid hidden traps, and use healing fountains wisely to survive.

The catch? The agent has partial observability, meaning it can only see its immediate 3x3 surroundings. It must make life-or-death decisions based on limited information, just like a real adventurer!

The Agent: A Brain Built with Q-Learning

This agent's intelligence isn't hard-coded. It's built on the principles of Temporal Difference learning, specifically Q-learning. Through thousands of simulated runs, it learns the optimal strategy for any situation—when to fight, when to hide, when to heal, and when to just wait for a guard to pass. It learns from its mistakes, associating rewards and penalties with its actions until it masters the art of escape.

How to Run the Agent

Want to see it in action?

1. Train the Agent

First, you need to let the agent learn. Run the following command to start the training process. This will generate a Q_table.pickle file containing the agent's "brain."

python3 Q_learning.py train

(Note: Training can take a long time! The more episodes, the smarter the agent gets.)

2. Watch the Agent Play

Once training is complete, run the agent in evaluation mode with a GUI to see what it has learned.

python3 Q_learning.py gui

This project was built with Python, PyGame, NumPy, and Matplotlib. Dive into Q_learning.py to see how the magic happens!

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
code		code
Q_table_1000000_0.999999.pickle		Q_table_1000000_0.999999.pickle
Q_table_100000_0.99999.pickle		Q_table_100000_0.99999.pickle
Q_table_10000_0.9999.pickle		Q_table_10000_0.9999.pickle
Q_table_1000_0.99.pickle		Q_table_1000_0.99.pickle
Q_table_1000_0.999.pickle		Q_table_1000_0.999.pickle
Q_table_5000000_0.9999995.pickle		Q_table_5000000_0.9999995.pickle
README.md		README.md
action_dist_1000000_0.999999.png		action_dist_1000000_0.999999.png
action_dist_100000_0.99999.png		action_dist_100000_0.99999.png
action_dist_10000_0.9999.png		action_dist_10000_0.9999.png
action_dist_1000_0.99.png		action_dist_1000_0.99.png
action_dist_1000_0.999.png		action_dist_1000_0.999.png
action_dist_5000000_0.9999995.png		action_dist_5000000_0.9999995.png
requirements.txt		requirements.txt
rewards_plot_1000000_0.999999.png		rewards_plot_1000000_0.999999.png
rewards_plot_100000_0.99999.png		rewards_plot_100000_0.99999.png
rewards_plot_10000_0.9999.png		rewards_plot_10000_0.9999.png
rewards_plot_1000_0.99.png		rewards_plot_1000_0.99.png
rewards_plot_1000_0.999.png		rewards_plot_1000_0.999.png
rewards_plot_5000000_0.9999995.png		rewards_plot_5000000_0.9999995.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🏰 Escape the Castle: An RL Adventure

The Challenge

The Agent: A Brain Built with Q-Learning

How to Run the Agent

1. Train the Agent

2. Watch the Agent Play

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🏰 Escape the Castle: An RL Adventure

The Challenge

The Agent: A Brain Built with Q-Learning

How to Run the Agent

1. Train the Agent

2. Watch the Agent Play

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages