opencv_transforms

This repository is intended as a faster drop-in replacement for Pytorch's Torchvision augmentations. This repo uses OpenCV for fast image augmentation for PyTorch computer vision pipelines. I wrote this code because the Pillow-based Torchvision transforms was starving my GPU due to slow image augmentation.

Requirements

A working installation of OpenCV. Tested with OpenCV version 3.4.1, 4.1.0
Tested on Windows 10 and Ubuntu 18.04. There is evidence that OpenCV doesn't work well with multithreading on Linux / MacOS, for example num_workers >0 in a pytorch DataLoader. I haven't run into this issue yet.

Installation

Using pip

opencv_transforms is available as a pip package:

pip install opencv_transforms

Using UV (recommended for development)

This project now uses UV for dependency management. To install for development:

Install UV if you haven't already:

curl -LsSf https://astral.sh/uv/install.sh | sh

Clone the repository and install dependencies:

git clone https://github.com/jbohnslav/opencv_transforms.git
cd opencv_transforms
uv sync --all-extras  # This installs all dependencies including dev dependencies

Run commands in the UV environment:

uv run python your_script.py
# or activate the virtual environment
source .venv/bin/activate  # On Unix/macOS
# or
.venv\Scripts\activate  # On Windows

Usage

Breaking change! Please note the import syntax!

from opencv_transforms import transforms
From here, almost everything should work exactly as the original transforms.

Example: Image resizing

import numpy as np
image = np.random.randint(low=0, high=255, size=(1024, 2048, 3))
resize = transforms.Resize(size=(256,256))
image = resize(image)

Should be 1.5 to 10 times faster than PIL. See benchmarks

Performance

Most transformations are between 1.5X and ~4X faster in OpenCV. Large image resizes are up to 10 times faster in OpenCV.
To reproduce the following benchmarks, download the Cityscapes dataset.
An example benchmarking file can be found in the notebook bencharming_v2.ipynb I wrapped the Cityscapes default directories with a HDF5 file for even faster reading.

The changes start to add up when you compose multiple transformations together.

Debug Utilities

The package includes optional debug utilities for investigating differences between PIL (torchvision) and OpenCV implementations:

# Basic debugging
from opencv_transforms.debug import utils
result = utils.compare_contrast_outputs(image, contrast_factor=0.5)

# Create test summary across multiple contrast factors
summary = utils.create_contrast_test_summary(image)

# Analyze PIL precision issues
utils.analyze_pil_precision_issue(image)

Visualization (requires matplotlib)

# Install dev dependencies (includes debug utilities)
uv sync

# Create comparison figures
from opencv_transforms.debug.visualization import create_comparison_figure
create_comparison_figure(original, pil_result, cv_result, "Contrast Transform")

Dataset Testing (requires datasets library)

from opencv_transforms.debug.dataset_utils import test_with_dataset_image
results = test_with_dataset_image("beans", num_samples=3)

TODO

Initial commit with all currently implemented torchvision transforms
Cityscapes benchmarks
Debug utilities for investigating PIL/OpenCV differences
Make the resample flag on RandomRotation, RandomAffine actually do something
Speed up augmentation in saturation and hue. Currently, fastest way is to convert to a PIL image, perform same augmentation as Torchvision, then convert back to np.ndarray

Name		Name	Last commit message	Last commit date
Latest commit History 158 Commits
benchmarks		benchmarks
debug		debug
opencv_transforms		opencv_transforms
scripts		scripts
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
TEST_PLAN.md		TEST_PLAN.md
UPDATE.md		UPDATE.md
benchmark.py		benchmark.py
imagenet_benchmark_results.png		imagenet_benchmark_results.png
multi_size_benchmark_results.csv		multi_size_benchmark_results.csv
multi_size_benchmark_results.png		multi_size_benchmark_results.png
multi_size_benchmark_results_heatmap.png		multi_size_benchmark_results_heatmap.png
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

opencv_transforms

Requirements

Installation

Using pip

Using UV (recommended for development)

Usage

Example: Image resizing

Performance

Debug Utilities

Visualization (requires matplotlib)

Dataset Testing (requires datasets library)

TODO

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

jbohnslav/opencv_transforms

Folders and files

Latest commit

History

Repository files navigation

opencv_transforms

Requirements

Installation

Using pip

Using UV (recommended for development)

Usage

Example: Image resizing

Performance

Debug Utilities

Visualization (requires matplotlib)

Dataset Testing (requires datasets library)

TODO

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages