Skip to content

brightyoun/TITS-LPST

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

56 Commits
 
 
 
 
 
 

Repository files navigation

License Plate Detection via Information Maximization (TITS-LPST)

Examples of DHD

Younkwan Lee, Jihyo Jeon, Yeongmin Ko, Moongu Jeon, Witold Pedrycz

This is the official website for "License Plate Detection via Information Maximization (submitted to T-ITS 2020)", which is a newly built real-world dataset for license plate detection and scene text detection.

  • 9.7k+ images and 110k+ instances
  • Scenes: traffic and drone, Tasks: license plate detection and scene text (excluding license plate) detection
  • Variations in tilt degrees: Horizontal tilt degree (15 ~ 45 degrees) and vertical tilt degree (15 ~ 45 degrees).
  • Variations in distance: The distance from the license plate to the camera location is relatively diverse.
  • Variations in blur: Blurry image due to motion blur and hand jitter while capturing images.
  • If you are interested in license plate detection, please refer to [our paper] or our github project.

Examples of DHD

Table of Contents

  1. Abstract
  2. Introduction
  3. License Plate Detection Benchmarks
    2.1 LPST-110K Sample
    2.2 Full dataset
  4. Benchmark
    3.1 AOLP Dataset
    3.2 PKU Dataset
    3.3 UFPR Dataset
    3.4 CCPD Dataset
    3.5 LTSP-110K Dataset
  5. Comparisons
    4.1 Ablation Study 4.2 Comparisons with SOTA
  6. Citation
  7. Evaluation on the test set
  8. Contact

Abstract

License plate (LP) detection in the wild remains challenging due to the diversity of environmental conditions. Nevertheless, prior solutions have focused on controlled environments, such as when LP images frequently emerge as from an approximately frontal viewpoint and without scene text which might be mistaken for an LP. However, even for state-of-the-art object detectors, their detection performance is not satisfactory for real-world environments, suffering from various types of degradation. To solve these problems, we propose a novel end-to-end framework for robust LP detection, designed for such challenging settings. Our contribution is threefold: (1) A novel information-theoretic learning that takes advantage of a shared encoder, an LP detector and a scene text detector (excluding LP) simultaneously; (2) Localization refinement for generalizing the bounding box regression network to complement ambiguous detection results; (3) a large-scale, comprehensive dataset, LPST-110K, representing real-world unconstrained scenes including scene text annotations. Computational tests show that the proposed model outperforms other state-of-the-art methods on a variety of challenging datasets.License plate (LP) detection in the wild remains challenging due to the diversity of environmental conditions. Nevertheless, prior solutions have focused on controlled environments, such as when LP images frequently emerge as from an approximately frontal viewpoint and without scene text which might be mistaken for an LP. However, even for state-of-the-art object detectors, their detection performance is not satisfactory for real-world environments, suffering from various types of degradation. To solve these problems, we propose a novel end-to-end framework for robust LP detection, designed for such challenging settings. Our contribution is threefold: (1) A novel information-theoretic learning that takes advantage of a shared encoder, an LP detector and a scene text detector (excluding LP) simultaneously; (2) Localization refinement for generalizing the bounding box regression network to complement ambiguous detection results; (3) a large-scale, comprehensive dataset, LPST-110K, representing real-world unconstrained scenes including scene text annotations. Computational tests show that the proposed model outperforms other state-of-the-art methods on a variety of challenging datasets.

1. Introduction

Object detection research has attracted great interest in recent years, with models being applied widely in many traffic-related applications. A variety of methods have demonstrated high accuracy in detecting license plates (LP) under controlled settings. While existing detectors successfully applied to the LP detection problem, many key challenges still remain in \textit{unconstrained wild scenarios}. For example, real-world LP detection causes the following problems: modifications of prior settings to adapt to wild, incorrect detection results, ambiguity in classifying objects associated with scene text, low-quality visual data, uneven lighting, motion blur, and others. However, such scenarios are becoming increasingly common and gaining significant popularity in a variety of applications, including civil security, crowd analytics, law enforcement, and street view images. Despite being the most common scenario, LP benchmarks still do not consider real-world cases, and therefore many problems are not adequately addressed. As a result, state-of-the-art detectors struggle with these images. we propose an end-to-end framework which is composed of a single shared feature encoder and two parallel detection branches. The single shared encoder learns a global feature across all detection tasks (LP and non-LP respectively). More specifically, due to non-LP objects (scene text but not LP), our framework is divided into 1) LP detection network and 2) non-LP detection network. Different from traditional LP detection models, we explicitly prevent learning of non-LP objects. To this end, we bring a novel information-theoretic loss to minimize mutual information between the embedding feature and non-LP distribution that interferes with LP detection. We collect a new large-scale dataset, LPST-110K, containing images captured from unconstrained scenes. To the best of our knowledge, LPST-110K is the first dataset to address LP and scene text simultaneously for LP detection. By evaluating state-of-the-art detection models on LPST-110K, we demonstrate the accuracy improvement of our proposed model compared with other approaches.

2. License Plate Detection Benchmarks

name #images #instances #LP instances/Image #ST instances/Image #Variations in tilt degrees. #Variations in distance. #Variations in blur.
AOLP (T-VT, 2013) 2,049 2,049 1 1
SSIG (JEI, 2016) 2,000 8,683 4.34 4.34
PKU (T-IP, 2016) 3,977 4,389 1.10 1.10
UFPR (IJCNN, 2018) 4,500 4,500 1 1
CD-HARD (ECCV, 2018) 102 102 1 1
CCPD (ECCV, 2018)) 250K 250K 1 1
LPST-110K (Ours) 9,795 110K 5.21 11

Annotations format

The CSV file with annotations should contain one annotation per line. Images with multiple bounding boxes should use one row per bounding box. Note that indexing for pixel values starts at 0. The expected format of each line is:

path,x1,y1,x2,y2,class_name

2.1 LPST-110K Sample

2.2 Full dataset

To see full dataset, here's the Email and request Dataset contact us.

3. License Plate Detection Results on Benchmarks

3.1 AOLP Dataset

3.2 PKU Dataset

3.3 UFPR Dataset

3.4 CCPD Dataset

3.5 LTSP-110K Dataset

  • LP Detection Results on validation (AP)

    method AP AP75
    Faster R-CNN .781 .578
    SSD300 .740 .639
    YOLO9000 .729 .625
    RetinaNet (Baseline) .863 .769
    + IF loss .905 .885
    + GRL .906 .890
    + LRM (Ours) .911 .915
  • non-LP Detection Results on validation (Precision, Recall)

    method Precision Recall
    Faster R-CNN 31.1 33.8
    SSD300 29.8 29.2
    YOLO9000 30.5 23.9
    RetinaNet (Baseline) 37.3 32.1
    + IF loss 19.1 21.1
    + GRL 21.5 21.0
    + LRM (Ours) 20.2 15.3

4. Qualitative Results

4.1 Ablation Study

  • Ablation Study Examples of DHD

4.2 Comparisons with SOTA

  • Results on Various Benchmarks Examples of DHD

  • Results on LTSP-110K

![Examples of DHD](imgs/supp_002.png) ![Examples of DHD](imgs/supp_003.png)

5. Citation

If this project help your research, please consider to cite our github page.

6. Evaluation on the test set

Ablation studies can be conducted on the validation set. If you would like to evaluate your model on the test set, you can send us ([email protected]) your detection results in the json format.

7. Contact

If you have any questions or want to add your results, please feel free to contact us.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published