torch-image-binarization

Local image binarization algorithms implemented in Pytorch. Includes the Otsu thresholding algorithm and the algorithm by Su et al. (which is an extension of the Otsu algorithm). The algorithms are implemented to optimize performance. With torch.compile it is approximately 4125x faster than the CPU-based version.

This was written to test the benefits of torch.compile. For a log of performance improvements, refer to optimizations.ipynb. The notebook expects triton==3.0.0 and torch==2.3.0.

Install

Install using pip:

pip install https://github.com/nopperl/torch-image-binarization

The package requires torch>=2.2.0 and optionally triton>=2.21, which need to be installed seperately, e.g. using pip:

pip install torch torchvision triton

Usage

Read an image:

from torchvision.io import ImageReadMode, read_image 
img = read_image("test_image.png", mode=ImageReadMode.GRAY)

Binarize the image:

from torck_image_binarization.thresholding import su
su(img)

For more information, refer to torch_image_binariztion/main.py

Benchmark

To show the performance gains of the CUDA-based PyTorch implementation over the CPU-based NumPy implementation and the benefits of torch.compile, the runtime is measured across different input image sizes. For more information, refer to optimizations.ipynb.

Results:

[------------------------------------ su -------------------------------------]
                                                                    |   runtime
1 threads: --------------------------------------------------------------------
      numpy                                                         | 3548992.0
      su(img)                                                       |   10426.5
      torch.compile(su)(img)                                        |    1333.6
      torch.compile(su, mode='reduce-overhead')(img)                |     858.8
      torch.compile(su, mode='max-autotune')(img)                   |     859.6
      torch.compile(su, dynamic=True)(img)                          |     859.7
      torch.compile(su, dynamic=True, mode='reduce-overhead')(img)  |     860.0
      torch.compile(su, dynamic=True, mode='max-autotune')(img)     |     860.0

Times are in microseconds (us).

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
torch_image_binarization		torch_image_binarization
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
compiled_kernel.py		compiled_kernel.py
optimizations.ipynb		optimizations.ipynb
requirements.txt		requirements.txt
setup.py		setup.py
test_image-binarized.png		test_image-binarized.png
test_image.png		test_image.png
test_image_gt.png		test_image_gt.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

torch-image-binarization

Install

Usage

Benchmark

About

Releases

Packages

Languages

License

nopperl/torch-image-binarization

Folders and files

Latest commit

History

Repository files navigation

torch-image-binarization

Install

Usage

Benchmark

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages