Reinforcement-learning ~ Off-policy reinforcement learning for optimal control ~ DDPG algorithm is used for self-driving