Improvement of semantic segmentation Model

We're optimizing our real-time lane detection by reducing the weight of the segmentation model.

project period

2023.03.16 ~ 2023.03.31

Team members

Kim Junhyung (https://github.com/Jun-WFI-hyung)
Jung JiHoon (https://github.com/IsaacJung210)
Lee SeWoong (https://github.com/tomy807)
Park Jin (https://github.com/parkjin0903)

Used

Carla
RTAB-Map
ROS
U-Net
E-Net
DeeplabV3
Segformer
CityScapes DataSet

Overview

1. Benchmark

Compare team members' segmentation models and select the best one for improvement.
 - U-Net
 - E-Net
 - Deeplabv3+
 - Segformer

1-1) Data

Cityscapes

1-2) Performance Results

Selected U-net after excluding Segformer, E-net, and Deplabv3+ due to issues with chipset, 
low MIoU, and significantly low inference time, respectively.

2. U-net performance improvement

2-1) Data

Used carla because Data labeling is good and most importantly, it's easy to implement in real time

2-2) Learning with Binary cross entry

- 2,200 randomly selected images per width out of 4,375 train images were used for training up to 10 widths.
- Loss converged quickly within 22,000 chapters.
- IOU results showed fluctuations as seen in the second graph.

2-3) Learning with Dice Loss

- Loss hovering without convergence
- Convergence observed with Dice loss and BCE loss together
- Same learning environment and conditions as BCE loss
- Slower convergence compared to BCE loss
- IOU stable and more reliable than BCE loss, with occasional unexpected zero values

2-4) Compare

- Similar patterns observed in graphs between BCE Loss and BCE+Dice Loss training
- BCE+Dice Loss showed slower convergence but faster improvement in IOU metric, leading 
  to overall improvement.
- Test conducted on Chapter 495 images
- Model learned with DiceLoss showed 10% IOU average increase
- Issues with inaccurate figures due to zero IOU cases such as noise.

3. Lightening & Quantization

3-1) Lightening

- Choose between efficient model design or parameter reduction for a lightweight model
- Unet model has a simple enough structure to focus on parameter reduction.

3-2) Quantization

- Left graph shows trade-off between accuracy and latency
- Right graph shows 4-bit quantized model has better accuracy and smaller size compared 
  to 8-bit quantized model
- Better quantization with a larger network can be better in terms of performance and model size 
  compared to rough quantization with a smaller network

- Applied static and quantization awareness training to models
- Used static quantization in TensorRT and PTQ (post-training quantization)
- Attempted dynamic quantization, but not suitable for convolutional operations.

3-3) QAT (Quantization-Aware Training)

- Quantization parameters can be calculated based on collected statistics or learned during training
- Float values are rounded to mimic int8 values, but calculations are still performed as floating 
  point numbers
- This method typically provides higher accuracy after quantization

3-4) Applied quantization model structure

- Create pth files from enhanced performance Unet models
- Quantify using PTQ, PTQ model and QAT
- Use minmaxobserver to store output values as unsigned int8 and weight values as signed int8
- Find optimal values by collecting Tensor statistics (e.g. min and max values) and fine-tuning

3-5) Comparison of quantization / dequantization test results

- Most time spent on convolution and inverse convolution
- Changed to quantized layer, reduced convolution time from 67ms to 27ms
- Inference speed improved by 50% or more overall, taking 1.6 seconds and 720 ms to deduce a photo.

- Storing log based on 100 chapters
- Reduced loss values but introduced noise in inferred image
- QAT conducted from 10epoch to 1000epoch
- Loss and IOU values tend to get worse as epoch increases
- Model size reduced from 280 mb to 93 mb

3-6) Conclusion

How to use

Check and customize Unet_config.json
train : python train.py [T or F : load pth True or False]
- optional arg: -p unet_epoch000.pth [Put in pth-filename]
- e.g. python train.py T -p unet_epoch075.pth
test : python inference.py [Put in pth-filename]
- e.g. python inference.py unet_epoch075.pth

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Carla_Data		Carla_Data
carla_pth/log		carla_pth/log
data/img		data/img
infer/log		infer/log
net		net
utils		utils
.gitignore		.gitignore
README.md		README.md
Unet_config.json		Unet_config.json
inference.py		inference.py
test.py		test.py
train.py		train.py
tree.txt		tree.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Improvement of semantic segmentation Model

project period

Team members

Used

Overview

1. Benchmark

1-1) Data

1-2) Performance Results

2. U-net performance improvement

2-1) Data

2-2) Learning with Binary cross entry

2-3) Learning with Dice Loss

2-4) Compare

3. Lightening & Quantization

3-1) Lightening

3-2) Quantization

3-3) QAT (Quantization-Aware Training)

3-4) Applied quantization model structure

3-5) Comparison of quantization / dequantization test results

3-6) Conclusion

How to use

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Improvement of semantic segmentation Model

project period

Team members

Used

Overview

1. Benchmark

1-1) Data

1-2) Performance Results

2. U-net performance improvement

2-1) Data

2-2) Learning with Binary cross entry

2-3) Learning with Dice Loss

2-4) Compare

3. Lightening & Quantization

3-1) Lightening

3-2) Quantization

3-3) QAT (Quantization-Aware Training)

3-4) Applied quantization model structure

3-5) Comparison of quantization / dequantization test results

3-6) Conclusion

How to use

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages