Conv2D Implementation with OpenCL

This project demonstrates various convolutional neural network (CNN) operations using OpenCL for GPU acceleration. The operations include convolution, ReLU activation, max pooling, batch normalization, tensor concatenation, upsampling and sigmoid activation.

Features

Tensor Convolution
ReLU Activation
Max Pooling
Batch Normalization
Tensor Concatenation
Upsampling

Prerequisites

OpenCL SDK
OpenCV
cnpy
Meson

Installation of OpenCL in ubuntu

foo@bar:~$ sudo apt update
foo@bar:~$ sudo apt install intel-opencl-icd

Checking the installation of OpenCL Drivers

foo@bar:~$ clinfo
foo@bar:~$
Number of platforms                               1
Platform Name                                   Intel(R) OpenCL HD Graphics
Platform Vendor                                 Intel(R) Corporation
Platform Version                                OpenCL 3.0 
Platform Profile                                FULL_PROFILE

Installation and Configuration

Install the required libraries:

sudo apt-get install opencl-headers ocl-icd-opencl-dev
sudo apt-get install libopencv-dev
sudo apt-get install libboost-all-dev
sudo apt-get install cmake,meson
pip install -r requirements.txt

Clone the repository:

git clone https://github.com/raghulrajn/OpenCL
cd gpu

Clone cnpy:

git clone https://github.com/rogersce/cnpy.git
cp cnpy gpc/src

Download pretrained-kernel from ZF_UNET_224
```
python3 extractWeights zf_unet_224.h5
```
Weights and biases are extracted from the model and will be saved in pretrainedKernels folder

Required final structure

├── pretrainedKernels
├── gpu
    ├── lib
    │   ├── Core
    │   ├── lib
    │   ├── OpenCL
    │   └── vx
    ├── src
    │   └── cnpy
    │   ├── conv2d.cl
    │   ├── conv2d.cpp
    ├── meson.build
    ├── run.sh
    ├── results
    ├── sampleImages
    └── utils

Run the code

cd gpu
python3 genImage.py
chmod +x run.sh
./run.sh <PATH_TO_IMG NPY FILE>

Results after CPU and GPU execution are stored in results folder

Code Structure

conv2d.cpp: Main source file containing the implementation of Conv2d, Maxpool, Upsampling and UNET operations using OpenCL.
conv2d.cl: Kernel code for all GPU operations
pytorch.py: Python code for convolution, Relu, maxpool etc. Results are stored in npy folder
compare.py: Python code to crosscheck CPU, GPU and Pytorch results
genImage.py: Python code to create sample images for inference. png image and npy file of the same will generated

Name		Name	Last commit message	Last commit date
Latest commit History 96 Commits
.vscode		.vscode
gpu		gpu
sampleImages		sampleImages
README.md		README.md
extractWeights.py		extractWeights.py
gitignore		gitignore
model_summary.md		model_summary.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Conv2D Implementation with OpenCL

Features

Prerequisites

Installation of OpenCL in ubuntu

Checking the installation of OpenCL Drivers

Installation and Configuration

Required final structure

Run the code

Code Structure

Performance comparison

GPU performance

On Intel Iris Xe GPU

On AMD GPU

Results

About

Uh oh!

Uh oh!

Contributors 2

Uh oh!

Languages

raghulrajn/UNET-on-GPU-using-OpenCL

Folders and files

Latest commit

History

Repository files navigation

Conv2D Implementation with OpenCL

Features

Prerequisites

Installation of OpenCL in ubuntu

Checking the installation of OpenCL Drivers

Installation and Configuration

Required final structure

Run the code

Code Structure

Performance comparison

GPU performance

On Intel Iris Xe GPU

On AMD GPU

Results

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors 2

Uh oh!

Languages