Custom Operations for Deep Learning

Introduction

In this repository, I attempt to reimplement popular operations such as convolution, linear layer, etc from scratch with CUDA programming (for GPU) and C++ (for CPU) to enhance my deep knowledge about Deep Learning. Then, I'll try to modify these operations and implement novel custom operations aiming at reducing inference time and increasing efficiency:

Math for implementation

There is math proof behind each of my implementation. You can first read the c++ implementation version and try to find out the math proof before jumping into cuda implemenation version since you have to think "parallelly" and "spatially" to implement in CUDA languages. The documentation of nvidia cuda programming can be found here. Next, I will explore some backends such as Cudnn, Cublas to optimize my code.

Setup environment

In order to run the build the package cuda_layers or cpp_layers code in setup.py file, you need to use Docker or your own environment. For those want to use docker:

docker pull kylepaul/deeplearning:deployment

More information of this docker image is at here. Then you would want to follow my compose.yml file for intitializing docker container and runnning with docker compose up -d

Build custom operations packages

The source code of convolution and linear layers are implemented in the src folder, in which you can find both cpp version runninng on CPU or cuda version running on GPU utilizing parallel computation with kernels implementation. The cuda_layers contain the code of backward that can be registered into the mechanism torch.autograd of Pytorch. To install the package, go into the cuda_layers:

python setup.py install

Train on MNIST

All the trainining code and custom operation registration, both forward and backward pass, was in the file modules.py. I use the default mnist dataset from torchvision to test the operations. To run the code train, simply:

python train.py

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
src		src
train		train
.gitignore		.gitignore
README.html		README.html
README.md		README.md
compose.yml		compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Custom Operations for Deep Learning

Introduction

Math for implementation

Setup environment

Build custom operations packages

Train on MNIST

About

Releases

Packages

Languages

kyle-paul/cusops

Folders and files

Latest commit

History

Repository files navigation

Custom Operations for Deep Learning

Introduction

Math for implementation

Setup environment

Build custom operations packages

Train on MNIST

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages