To use this repository, follow these steps:
# Clone the repository:
git clone https://github.com/AhmedZeer/cuda-kernels.git
# Navigate into the cloned repository:
cd cuda-kernels
# Build it:
make
# Run benchmark for GEMM:
./benchmark_gemm