CrowdNav

CrowdNav is a research framework for training and evaluating socially-aware robot navigation policies in crowded environments. A robot learns to navigate to a goal among multiple humans (controlled by unknown policies such as ORCA) while avoiding collisions and behaving in a socially acceptable way.

The project bundles an OpenAI Gym simulation environment (crowd_sim/) with a set of learning/classical navigation policies (crowd_nav/), and includes a vendored copy of the Python-RVO2 library used for human motion simulation.

Project Layout

crowdnav-dip/
├── crowd_sim/           # OpenAI Gym simulation environment (humans + robot)
│   └── envs/
├── crowd_nav/           # Policies, training, and testing code
│   ├── configs/         # env.config, policy.config, train.config
│   ├── policy/          # cadrl, lstm_rl, sarl, multi_human_rl
│   ├── utils/           # trainer, explorer, replay memory
│   ├── data/            # trained models and outputs (e.g. data/output_trained)
│   ├── train.py
│   └── test.py
├── Python-RVO2-main/    # Vendored Python-RVO2 library (ORCA solver bindings)
└── setup.py

Available Policies

Implemented in crowd_nav/policy/ — see crowd_sim/README.md for more detail.

ORCA — reciprocal, collision-free velocity (classical baseline)
CADRL — value network predicting action for the most important human
LSTM-RL — LSTM encoder over human states
SARL — pairwise interaction module + self-attention over humans
OM-SARL — SARL extended with a local occupancy map for intra-human interaction

Setup

One command sets everything up (conda env + Python-RVO2 + package in editable mode + gym pin):

./scripts/setup_env.sh
conda activate navigate

The script is idempotent — safe to re-run. Requires conda and, for video export, ffmpeg:

Linux: sudo apt install ffmpeg cmake build-essential
macOS: brew install ffmpeg cmake

Manual setup (if you prefer not to use the script)

conda create -n navigate --override-channels -c conda-forge python=3.10
conda activate navigate
pip install "cython<3"

# Python-RVO2 (CMake >= 4 requires the policy flag)
export CMAKE_POLICY_VERSION_MINIMUM=3.5
(cd Python-RVO2-main && python setup.py build && python setup.py install)

pip install -e .
pip install "gym==0.15.7" pytest

Getting Started

The repository is organized in two parts: crowd_sim/ contains the simulation environment and crowd_nav/ contains the training and testing code. Simulation framework details: crowd_sim/README.md. Commands below should be run from inside crowd_nav/.

One-line commands

After ./scripts/setup_env.sh completes:

crowdnav-preflight        # verify rvo2 / ffmpeg / gym / trained model
crowdnav-gui              # launch the PyQt GUI (slide-14 demo)
crowdnav-baseline         # reproduce R1 -> exports/baseline.mp4

Advanced flag-driven usage with python test.py ... is documented below.

Visualize a test case

cd crowd_nav
python test.py --policy sarl --model_dir data/output_trained --phase test --visualize --test_case 0

Useful test.py flags (see crowd_nav/test.py):

--policy {orca, cadrl, lstm_rl, sarl} — policy to evaluate
--model_dir <path> — directory containing rl_model.pth and config files
--phase {train, val, test} — dataset phase
--test_case <int> — specific scenario id to visualize
--visualize — show an animated rollout
--video_file <path> — save the rollout to a video file
--traj — overlay trajectories
--gpu — run on GPU

Train a policy

cd crowd_nav
python train.py --policy sarl

Useful train.py flags (see crowd_nav/train.py):

--policy {cadrl, lstm_rl, sarl} — policy to train
--output_dir <path> — where checkpoints and logs are written (default data/output)
--resume — resume from a previous run in --output_dir
--gpu — train on GPU
--debug — enable debug logging

Training parameters live in crowd_nav/configs/train.config; environment parameters (number of humans, scenario type, reward shaping) in crowd_nav/configs/env.config; policy hyperparameters in crowd_nav/configs/policy.config.

Reproducing R1 (baseline + video)

R1 means: install the env, re-run the given DRL policy, export a simulation video.

./scripts/run_baseline.sh
# → exports/baseline.mp4

Verify end-to-end with the smoke test:

pytest -m smoke -v

What this covers:

Loads the trained SARL policy from crowd_nav/data/output_trained/rl_model.pth (no retraining — per R7).
Runs test case 0 under seed 42 — deterministic log lines are byte-identical across runs on the same machine (NF3).
Writes exports/baseline.mp4 (~500 KB–1 MB, ffmpeg-encoded).

See docs/R1_VERIFICATION.md for the full manual checklist.

Troubleshooting

Python-RVO2 build fails — make sure cython is installed in the active env and cmake/build-essential are on the system. On CMake 4+, setup_env.sh exports CMAKE_POLICY_VERSION_MINIMUM=3.5 to keep the vendored build compatible.
gym API errors — this project pins gym==0.15.7; newer versions break the env interface.
CUDA/GPU — omit --gpu to run on CPU; all scripts work without a GPU.
ffmpeg not found — --video_file will error with "ffmpeg not found on PATH". Install ffmpeg (Linux apt install ffmpeg, macOS brew install ffmpeg) and re-run. The render module autodetects via shutil.which at import time.

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
.github/workflows		.github/workflows
Python-RVO2-main		Python-RVO2-main
crowd_nav		crowd_nav
crowd_sim		crowd_sim
docs		docs
exports		exports
scripts		scripts
tests		tests
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml
setup.py		setup.py
slides.md		slides.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CrowdNav

Project Layout

Available Policies

Setup

Getting Started

One-line commands

Visualize a test case

Train a policy

Reproducing R1 (baseline + video)

Troubleshooting

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CrowdNav

Project Layout

Available Policies

Setup

Getting Started

One-line commands

Visualize a test case

Train a policy

Reproducing R1 (baseline + video)

Troubleshooting

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages