ReLoD: The Remote-Local Distributed System for Real-time Reinforcement Learning on Vision-Based Robotics Tasks
ReLoD uses a wired local and a wireless remote computer to perform real-time learning, an appealing setting for industrial learning systems. It is a generalist RL system for learning with real robots from scratch! Check out how ReLoD learns to perform vision-based tasks on UR5 and Roomba (iRobot Create 2): Youtube video
- Soft Actor Critic (SAC)
- Proximal Policy Optimization (PPO)
N.B: All vision-based experiments use Random Augmented Data (RAD) to improve sample efficiency
|   UR-Reacher |   Franka-VisualReacher | 
|---|
|   Create-Reacher |   Vector-ChargerDetector | 
|---|
| Hyper-parameter | Value | 
|---|---|
| Replay buffer | 100K | 
| Actor step size | 3e-4 | 
| Critic step size | 3e-4 | 
| Entropy coefficient step size | 3e-4 | 
| Batch size | 256 | 
| Discount factor | 0.99 | 
| Update every | 2 | 
| Num. update epochs every | 1 | 
| Actor MLP hidden sizes | [512 512] | 
| Critic MLP hidden sizes | [512 512] | 
| Warm-up time steps | 1000 | 
| Adam optimizer betas | [0.9, 0.999] | 
| Initial temperature | 0.1 | 
| Neural network activation | ReLU | 
- Download Mujoco and license files to ~/.mujoco
- Install miniconda or anaconda
- Create a virtual environment:
conda create --name myenv python=3.6    # Python 3.6 is necessary
conda activate myenv- Add the following to ~/.bashrc:
conda activate myenv
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/home/<username>/.mujoco/mjpro210/bin   # Change based on mujoco version
export MUJOCO_GL="egl"  # System specificand run:
source ~/.bashrc- Install packages with:
pip install -r requirements.txt
pip install . python task_ur5_visual_reacher.py  --work_dir "./results" --mode 'l' --seed 0 --env_steps 200100 The code for the Franka task can be found in this branch.
Wang, Y.⋆, Vasan, G.⋆, & Mahmood, A. R. (2023). Real-time reinforcement learning for vision-based robotics utilizing local and remote computers. In Proceedings of the 2023 International Conference on Robotics and Automation (ICRA).