This project aims to develop machine learning models for detecting and classifying road signs in images, an essential task for advanced driver assistance systems, autonomous vehicles, and traffic monitoring. Additionally, we explore generative modeling to create new road sign images using Conditional Generative Adversarial Networks (cGANs).
- Problem Description
- Model Comparison
- Training Size Analysis
- Stretch Goal: Generative Innovation
- Acknowledgments & Resources
We address the challenge of classifying road signs into four categories: Traffic Light, Stop, Speed Limit, and Crosswalk, using labeled images with bounding boxes in the PASCAL VOC format.
- Dataset Size: 877 images
- Data Split: 70% training, 15% validation, 15% testing
- Metrics: Accuracy
- Challenges: Class imbalance, with a significant majority in the
Speed Limitclass.
We evaluated three models with varying hyperparameters:
- Tested hyperparameters:
C,penalty,solver,multi_class - Best result:
C=1,penalty=L2,solver=saga,multi_class=multinomial
- Tested hyperparameters:
C,kernel,gamma - Best result:
C=10,kernel=RBF,gamma=scale
- Tested hyperparameters:
learning rate,batch size,scheduler,optimizer - Best result:
lr=0.001,batch size=8,scheduler=StepLR,optimizer=SGD
The ResNet50 model achieved nearly 100% accuracy with the optimal hyperparameters.
We analyzed the effect of training data size on model performance. Results showed significant improvements with increased training data, but marginal gains beyond 50%, indicating the simplicity of the dataset with only four broad classes.
We implemented a Conditional Generative Adversarial Network (cGAN) to generate road sign images based on class labels.
- Adapted a basic cGAN for MNIST to handle road sign images.
- Adjusted latent dimensions, transforms, and architecture for our dataset.
- Evaluated the model qualitatively by examining generated images.
- Initial results were pixelated; improvements were made by adding layers and increasing epochs.
- Our work demonstrates a shift toward generative modeling, aligning with modern AI trends.
- Refine the model for high-resolution image generation.
- Explore applications in areas like stock image generation and reconstruction from occlusions.
- Investigate combining ResNet50 with the cGAN for input-output image mapping using custom loss functions.
Here are some sample images generated by our Conditional Generative Adversarial Network (cGAN):
- Dataset: Kaggle Link
- cGAN Implementation: Adapted from CGAN-PyTorch
- External References: Additional resources are cited within the report.
- Aryan Bahl
- Richie Ma
- Ronit Anandani
This project was part of CS 441 - Final Project, and we sincerely thank our instructors and collaborators for their guidance.



