https://github.com/blakcapple/drl-pid

Automatically tuning PID parameters based on deep reinforcement learning algorithm

Science Score: 10.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
○
codemeta.json file
○
.zenodo.json file
○
DOI references
✓
Academic publication links
Links to: arxiv.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (13.0%) to scientific vocabulary

Last synced: 10 months ago · JSON representation

Repository

Automatically tuning PID parameters based on deep reinforcement learning algorithm

Basic Info

Host: GitHub
Owner: blakcapple
License: mit
Language: Python
Default Branch: main
Homepage:
Size: 3.93 MB

Statistics

Stars: 6
Watchers: 1
Forks: 0
Open Issues: 0
Releases: 0

Created about 5 years ago · Last pushed over 3 years ago

https://github.com/blakcapple/DRL-PID/blob/main/

# DRL-PID

## Intro

It's a framework that focuses on the dynamic adjustment of the PID parameters of the controlled objects using **Deep Reinforcement Learning Algorithm**, which in this project, I used **Soft Actor Critic** (SAC) as the basic DRL algorithm and achieved great results. 

Specifically, this project is using DRL-PID framework in the **auto driving scenario** (track following), but this framework can expand to any scene that needs dynamically tuning PID parameters without human intervention. The **main advantage** of this framework in my opinion is it can **search the optimal PID tuning policy automatically**, especially when too many PID parameters are coupled together, and manual adjustment is difficult. But it also suffers from the unexplainable deep learning procedure and can not provide the theory guarantee.   

**It's my bachelor graduation project and I hope the open source code can benefit  the future research in the relevant domains.** 

**If you find this project helpful, star on it :star:**

## Method 

#### Overview

The training framework: 

![image](https://raw.githubusercontent.com/blakcapple/DRL-PID/main/Image/framework.png)

The execution framework:

![image](https://raw.githubusercontent.com/blakcapple/DRL-PID/main/Image/structure.png)

#### Line Detection 

![image](https://raw.githubusercontent.com/blakcapple/DRL-PID/main/Image/searching%20algorithm.png)

#### ROS Node

RL training is implemented in the **ROS** and **gazebo** platform

The training system operates using the ROS topic communication mechanism. 

The overall system mainly includes **environment detection node** and **tracking control node**. 

Environment detection node: **Subscribe to the topics released by the camera**, pre-process the images detected by the camera, and obtain state information; at the same time, judge whether the smart car deviates from the track by judging the length of the left and right boundary point sets obtained by the image search algorithm, and store these information as ROS messages to post on the Environment Feedback topic.

tracking control node: Subscribe to the environmental feedback topic and the topic **odom** released by GAZEBO that contains the pose information of the car; **output angular velocity information and linear velocity** information to the topic **cmd_vel**, and **cmd_vel** controls the movement of the smart car simulation model.
Finally, GAZEBO subscribes to the **cmd_vel** topic to visualize the motion of the smart car simulation model in real-time in the tracking environment , eventually form a **complete control closed-loop structure**.

## Installation   

* ROS (16.04 or 20.04 has been tested alright)
* Gazebo 

* Pytorch 

## Quick Start

**summit_description** contains the ROS package needed in this project, install this in your own ROS workspace first and add it to ROS path. 

**launch the environment**

```
roslaunch summit_description summit.launch
```

**env_feedback.py** contains the algorithm that detects the track using camera and processes the original environment detection into the state information.

**launch the env detection node**

```
python env_feedback.py
```

after the preparation steps, you can launch the main node to start training.

**start training**

```
python main.py
```

## Code Structure

**my_ground_plane**

the path model used in the project, put this into the .gazebo/model folder. You can change the map by replacing the **MyImage.png** in the /materials/textures

**summit description**

* contain robot description, necessary ros message, ros service file and launch file 

**DRL-PID** 

* main.py : main file for training  
* model_testing.py : test trained model 
* sac_torch.py : SAC algorithm
* network.py: network definition 
* env_feedback.py: the env pre-process file 
* line_follower.py: the environment definition 

## Experiment

**map_0** 

![image](https://raw.githubusercontent.com/blakcapple/DRL-PID/main/Image/map_0.png)

**episode reward**

![image](https://raw.githubusercontent.com/blakcapple/DRL-PID/main/Image/score_plot.png)

**episode success rate**

![image](https://raw.githubusercontent.com/blakcapple/DRL-PID/main/Image/success_rate.png)

## Reference 

[A Self-adaptive SAC-PID Control Approach based on Reinforcement Learning for Mobile Robots](https://arxiv.org/pdf/2103.10686.pdf) (this project is highly based on this paper)

Owner

Name: Ruan Yudi
Login: blakcapple
Kind: user
Location: Hangzhou, China
Company: Zhejiang University

Repositories: 2
Profile: https://github.com/blakcapple

GitHub Events

Total

Watch event: 6
Fork event: 1

Last Year

Watch event: 6
Fork event: 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

https://github.com/blakcapple/drl-pid

Science Score: 10.0%

Repository

Basic Info

Statistics

https://github.com/blakcapple/DRL-PID/blob/main/

Owner

GitHub Events

Total

Last Year